lma-c4d

lma-c4d

Stars

Python 39 Updated Sep 11, 2024

Python 3 1 Updated Aug 19, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

Python 21 3 Updated Aug 26, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 624 98 Updated Mar 23, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,078 293 Updated Jun 22, 2024

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python 383 53 Updated May 14, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,983 237 Updated Sep 6, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,670 506 Updated Jul 18, 2024