yuanenming

Follow

Enming Yuan yuanenming

Follow

AI, LLMs

15 followers · 17 following

Achievements

Achievements

Highlights

Pro

Stars

deepseek-ai / DeepSeek-R1

85,990 11,096 Updated Feb 24, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,618 441 Updated Mar 12, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 758 93 Updated Dec 29, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,612 2,028 Updated Mar 11, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,111 1,414 Updated Mar 10, 2025

MoonshotAI / Kimi-k1.5

3,204 192 Updated Mar 7, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 927 69 Updated Mar 7, 2025

facebookresearch / blt

Code for BLT research paper

Python 1,433 111 Updated Mar 11, 2025

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 11,382 1,670 Updated Aug 8, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,541 544 Updated Mar 12, 2025

XuehaiPan / Dev-Setup

Automation scripts for setting up a basic development environment.

Shell 89 13 Updated Mar 10, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,425 2,368 Updated Mar 11, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 5,402 510 Updated Mar 12, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,198 1,238 Updated Jan 18, 2025

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,485 792 Updated Mar 10, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,306 2,728 Updated Mar 12, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,336 1,748 Updated Aug 1, 2024

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,910 1,514 Updated Mar 11, 2025

UpstageAI / evalverse-IFEval

Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)

Python 13 4 Updated May 4, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,796 23,570 Updated Mar 12, 2025

zijie0 / HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

13,539 992 Updated May 9, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,248 557 Updated Oct 19, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,289 170 Updated Mar 4, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,675 163 Updated Mar 10, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,936 624 Updated May 31, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,472 899 Updated Jul 1, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,125 98 Updated May 8, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,713 2,636 Updated Mar 12, 2025

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,032 1,024 Updated Jan 14, 2025

lucidrains / self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,368 71 Updated Apr 11, 2024