Skip to content
View ShuoTang123's full-sized avatar

Block or report ShuoTang123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 622 29 Updated Feb 10, 2025

veRL: Volcano Engine Reinforcement Learning for LLM

Python 2,754 227 Updated Feb 10, 2025

Open Thoughts: Fully Open Data Curation for Thinking Models

Python 681 45 Updated Feb 7, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,451 1,228 Updated Feb 1, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory

Python 25,883 1,741 Updated Feb 10, 2025

Lightweight tool to identify Data Contamination in LLMs evaluation

Python 46 1 Updated Mar 8, 2024

Large Reasoning Models

Python 801 44 Updated Dec 3, 2024

Training and Benchmarking LLMs for Code Preference.

Python 30 2 Updated Nov 15, 2024

Pytorch implementation of Tree Preference Optimization (TPO)

Python 11 Updated Oct 22, 2024

Clustering results demo

HTML 1 Updated Jun 11, 2021

[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Jupyter Notebook 68 Updated Oct 23, 2024
Python 7 1 Updated Aug 5, 2024
Python 9 1 Updated Jun 25, 2022

Implementation of the MATRIX framework (ICML 2024)

Python 45 4 Updated May 6, 2024

Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

27 Updated Feb 2, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 732 90 Updated Dec 29, 2024

Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

Python 2,692 84 Updated Jan 1, 2025

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,312 95 Updated Mar 3, 2024

PaL: Program-Aided Language Models (ICML 2023)

Python 481 61 Updated Jun 30, 2023
Python 22 2 Updated Jul 11, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 9,129 876 Updated Feb 10, 2025

DataComp for Language Models

HTML 1,219 110 Updated Dec 11, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,005 67 Updated Sep 25, 2024

Scrape from Twitter using Nitter instances

Python 190 31 Updated Sep 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 797 48 Updated Feb 3, 2025

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

171 24 Updated Jun 4, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,664 366 Updated Jan 7, 2025
Next