Skip to content
View bitEEdh's full-sized avatar
  • Beijing Institute of Technology
  • Beijing Institute of Technology

Block or report bitEEdh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open source replication of Anthropic's Crosscoders for Model Diffing

Python 51 19 Updated Oct 27, 2024

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 41 2 Updated Jan 12, 2025

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

221 8 Updated Mar 20, 2025

A curated list of Large Language Model (LLM) Interpretability resources.

1,293 95 Updated Dec 21, 2024
Python 20 Updated Oct 19, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 10,734 1,546 Updated Apr 13, 2025
Jupyter Notebook 94 14 Updated Nov 17, 2024

RLHF implementation details of OAI's 2019 codebase

Python 186 9 Updated Jan 14, 2024

Robust recipes to align language models with human and AI preferences

Python 5,128 440 Updated Nov 21, 2024

Train transformer language models with reinforcement learning.

Python 13,222 1,800 Updated Apr 15, 2025

A course on aligning smol models.

Jupyter Notebook 5,733 2,008 Updated Jan 24, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,881 237 Updated Feb 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 6,666 715 Updated Apr 15, 2025

Simple RL training for reasoning

Python 3,442 256 Updated Apr 10, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,481 89 Updated Mar 18, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,053 93 Updated Jan 24, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 10,960 2,007 Updated Mar 28, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 357 25 Updated Apr 7, 2025
Python 19 1 Updated Oct 29, 2024
Python 27 5 Updated Oct 29, 2024
Jupyter Notebook 8 3 Updated Nov 18, 2024
Python 1 Updated Oct 11, 2024
Python 170 22 Updated Apr 4, 2025

LLMs as Copilots for Theorem Proving in Lean

C++ 1,072 99 Updated Apr 13, 2025
Python 24 5 Updated Mar 1, 2025
Jupyter Notebook 45 3 Updated Jun 13, 2024

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 11,917 1,241 Updated Apr 15, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 15,573 1,838 Updated Apr 15, 2025

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Python 155 13 Updated Oct 1, 2024

Code for CRATE (Coding RAte reduction TransformEr).

Python 1,214 95 Updated Oct 23, 2024
Next