-
-
oat Public
Forked from sail-sg/oat🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Python Apache License 2.0 UpdatedFeb 8, 2025 -
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python MIT License UpdatedFeb 7, 2025 -
oat-zero Public
Forked from sail-sg/oat-zeroA lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
-
ai-cookbook Public
Forked from daveebbelaar/ai-cookbookExamples and tutorials to help developers build AI systems
Python MIT License UpdatedFeb 1, 2025 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
-
-
unsloth Public
Forked from unslothai/unslothFinetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
-
smol-course Public
Forked from huggingface/smol-courseA course on aligning smol models.
Jupyter Notebook Apache License 2.0 UpdatedDec 23, 2024 -
open-instruct Public
Forked from allenai/open-instructPython Apache License 2.0 UpdatedDec 20, 2024 -
-
textgrad Public
Forked from zou-group/textgradTextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Python MIT License UpdatedNov 2, 2024 -
al-folio Public template
Forked from alshedivat/al-folioA beautiful, simple, clean, and responsive Jekyll theme for academics
HTML MIT License UpdatedOct 2, 2024 -
pyvene Public
Forked from stanfordnlp/pyveneStanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
Python Apache License 2.0 UpdatedSep 29, 2024 -
agentic_patterns Public
Forked from neural-maze/agentic_patternsImplementing the 4 agentic patterns from scratch
Jupyter Notebook MIT License UpdatedSep 26, 2024 -
PySvelte Public
Forked from Mech-Interp/PySvelteA library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations
Jupyter Notebook Apache License 2.0 UpdatedSep 8, 2024 -
-
maia Public
Forked from multimodal-interpretability/maiaOfficial implementation of MAIA, A Multimodal Automated Interpretability Agent
Jupyter Notebook UpdatedAug 15, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJul 31, 2024 -
llm-course Public
Forked from mlabonne/llm-courseCourse to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Jupyter Notebook Apache License 2.0 UpdatedJul 28, 2024 -
-
sae-transfer Public
Forked from ckkissane/sae-transferCode to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
-
awesome-llm-security Public
Forked from corca-ai/awesome-llm-securityA curation of awesome tools, documents and projects about LLM Security.
UpdatedJul 16, 2024 -
refusal_direction Public
Forked from andyrdt/refusal_directionCode and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
HTML Apache License 2.0 UpdatedJul 8, 2024 -
-
-
representation-engineering Public
Forked from andyzoujm/representation-engineeringRepresentation Engineering: A Top-Down Approach to AI Transparency
Jupyter Notebook MIT License UpdatedJun 28, 2024 -
-
chameleon Public
Forked from facebookresearch/chameleonRepository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Python Other UpdatedJun 21, 2024