Skip to content
View yuanenming's full-sized avatar

Highlights

  • Pro

Block or report yuanenming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,618 441 Updated Mar 12, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 758 93 Updated Dec 29, 2024

Fully open reproduction of DeepSeek-R1

Python 22,612 2,028 Updated Mar 11, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,111 1,414 Updated Mar 10, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 927 69 Updated Mar 7, 2025

Code for BLT research paper

Python 1,433 111 Updated Mar 11, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 11,382 1,670 Updated Aug 8, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,541 544 Updated Mar 12, 2025

Automation scripts for setting up a basic development environment.

Shell 89 13 Updated Mar 10, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,425 2,368 Updated Mar 11, 2025

Tools for merging pretrained large language models.

Python 5,402 510 Updated Mar 12, 2025

Mamba SSM architecture

Python 14,198 1,238 Updated Jan 18, 2025

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,485 792 Updated Mar 10, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,306 2,728 Updated Mar 12, 2025

LLM101n: Let's build a Storyteller

32,336 1,748 Updated Aug 1, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,910 1,514 Updated Mar 11, 2025

Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)

Python 13 4 Updated May 4, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,796 23,570 Updated Mar 12, 2025

健康学习到150岁 - 人体系统调优不完全指南

13,539 992 Updated May 9, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,248 557 Updated Oct 19, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,289 170 Updated Mar 4, 2025

Minimalistic large language model 3D-parallelism training

Python 1,675 163 Updated Mar 10, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,936 624 Updated May 31, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,472 899 Updated Jul 1, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,125 98 Updated May 8, 2024

Ongoing research training transformer models at scale

Python 11,713 2,636 Updated Mar 12, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,032 1,024 Updated Jan 14, 2025

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,368 71 Updated Apr 11, 2024
Next