bitEEdh

Hang Deng bitEEdh

I am Hang Deng, a student from Beijing Institute of Technology. I am learning to code on GitHub and trying to make some contributions.

1 follower · 6 following

Beijing Institute of Technology
Beijing Institute of Technology

Lists (2)

Sort

multi-cultural alignment

1 repository

semantic communication

4 repositories

Starred repositories

ckkissane / crosscoder-model-diff-replication

Open source replication of Anthropic's Crosscoders for Model Diffing

Python 51 19 Updated Oct 27, 2024

facebookresearch / iGSM

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2…

Python 41 2 Updated Jan 12, 2025

cooperleong00 / Awesome-LLM-Interpretability

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

221 8 Updated Mar 20, 2025

JShollaj / awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.

1,293 95 Updated Dec 21, 2024

ydyjya / SafetyHeadAttribution

Python 20 Updated Oct 19, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 10,734 1,546 Updated Apr 13, 2025

jacobdunefsky / transcoder_circuits

Jupyter Notebook 94 14 Updated Nov 17, 2024

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 186 9 Updated Jan 14, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,128 440 Updated Nov 21, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,222 1,800 Updated Apr 15, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,733 2,008 Updated Jan 24, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,881 237 Updated Feb 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 6,666 715 Updated Apr 15, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,442 256 Updated Apr 10, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,481 89 Updated Mar 18, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,053 93 Updated Jan 24, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 10,960 2,007 Updated Mar 28, 2025

THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 357 25 Updated Apr 7, 2025

Scarelette / CulturePark

Python 19 1 Updated Oct 29, 2024

Scarelette / CultureLLM

Python 27 5 Updated Oct 29, 2024

AidaRamezani / cultural_inference

Jupyter Notebook 8 3 Updated Nov 18, 2024

MuMu-Lily / Moral-Beliefs

Python 1 Updated Oct 11, 2024

Goedel-LM / Goedel-Prover

Python 170 22 Updated Apr 4, 2025

lean-dojo / LeanCopilot

LLMs as Copilots for Theorem Proving in Lean

C++ 1,072 99 Updated Apr 13, 2025

lean-dojo / LeanAgent

Python 24 5 Updated Mar 1, 2025

ydyjya / LLM-IHS-Explanation

Jupyter Notebook 45 3 Updated Jun 13, 2024

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 11,917 1,241 Updated Apr 15, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 15,573 1,838 Updated Apr 15, 2025

asahi417 / lmppl

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Hang Deng bitEEdh

Lists (2)

multi-cultural alignment

semantic communication

Starred repositories

successive-convex-approximation

integrated-sensing-and-communication

wireless-communication