Skip to content
View lma-c4d's full-sized avatar

Block or report lma-c4d

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 39 Updated Sep 11, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

Python 21 3 Updated Aug 26, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 624 98 Updated Mar 23, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,078 293 Updated Jun 22, 2024

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python 383 53 Updated May 14, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,983 237 Updated Sep 6, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,670 506 Updated Jul 18, 2024