ShuoTang123

ShuoTang123

9 followers · 21 following

Achievements

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 622 29 Updated Feb 10, 2025

volcengine / verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python 2,754 227 Updated Feb 10, 2025

open-thoughts / open-thoughts

Open Thoughts: Fully Open Data Curation for Thinking Models

Python 681 45 Updated Feb 7, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,451 1,228 Updated Feb 1, 2025

deepseek-ai / DeepSeek-V3

Python 81,854 13,006 Updated Feb 8, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory

Python 25,883 1,741 Updated Feb 10, 2025

liyucheng09 / Contamination_Detector

Lightweight tool to identify Data Contamination in LLMs evaluation

Python 46 1 Updated Mar 8, 2024

SimpleBerry / LLaMA-O1

Large Reasoning Models

Python 801 44 Updated Dec 3, 2024

holarissun / RewardModelingBeyondBradleyTerry

13 1 Updated Jan 31, 2025

amazon-science / llm-code-preference

Training and Benchmarking LLMs for Code Preference.

Python 30 2 Updated Nov 15, 2024

MrBlankness / TPO

Pytorch implementation of Tree Preference Optimization (TPO)

Python 11 Updated Oct 22, 2024

Meteor-han / DataMining

Clustering results demo

HTML 1 Updated Jun 11, 2021

sail-sg / Cheating-LLM-Benchmarks

[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Jupyter Notebook 68 Updated Oct 23, 2024

Meteor-han / ReaMVP

Python 7 1 Updated Aug 5, 2024

Meteor-han / ReLMole

Python 9 1 Updated Jun 25, 2022

ShuoTang123 / MATRIX-Gen

40 1 Updated Oct 22, 2024

ShuoTang123 / MATRIX

Implementation of the MATRIX framework (ICML 2024)

Python 45 4 Updated May 6, 2024

elated-sawyer / WALL-E

Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

27 Updated Feb 2, 2025

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Python 732 90 Updated Dec 29, 2024

kislyuk / yq

Command-line YAML, XML, TOML processor - jq wrapper for YAML/XML/TOML documents

Python 2,692 84 Updated Jan 1, 2025

OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,312 95 Updated Mar 3, 2024

reasoning-machines / pal

PaL: Program-Aided Language Models (ICML 2023)

Python 481 61 Updated Jun 30, 2023

USTC-StarTeam / ZIP

Python 22 2 Updated Jul 11, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 9,129 876 Updated Feb 10, 2025

mlfoundations / dclm

DataComp for Language Models

HTML 1,219 110 Updated Dec 11, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,005 67 Updated Sep 25, 2024

bocchilorenzo / ntscraper

Scrape from Twitter using Nitter instances

Python 190 31 Updated Sep 1, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 797 48 Updated Feb 3, 2025

dzyim / ilya-sutskever-recommended-reading

It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.

171 24 Updated Jun 4, 2024

togethercomputer / MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,664 366 Updated Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ShuoTang123

Achievements

Achievements

Block or report ShuoTang123

Stars

zhaochenyang20 / Awesome-ML-SYS-Tutorial

volcengine / verl

open-thoughts / open-thoughts

Jiayi-Pan / TinyZero

deepseek-ai / DeepSeek-V3

unslothai / unsloth

liyucheng09 / Contamination_Detector

SimpleBerry / LLaMA-O1

holarissun / RewardModelingBeyondBradleyTerry

amazon-science / llm-code-preference

MrBlankness / TPO

Meteor-han / DataMining

sail-sg / Cheating-LLM-Benchmarks

Meteor-han / ReaMVP

Meteor-han / ReLMole

ShuoTang123 / MATRIX-Gen

ShuoTang123 / MATRIX

elated-sawyer / WALL-E

lmarena / arena-hard-auto

kislyuk / yq

OpenLMLab / MOSS-RLHF

reasoning-machines / pal

USTC-StarTeam / ZIP

sgl-project / sglang

mlfoundations / dclm

tencent-ailab / persona-hub

bocchilorenzo / ntscraper

ContextualAI / HALOs

dzyim / ilya-sutskever-recommended-reading

togethercomputer / MoA