Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,910 504 Updated Mar 1, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 1,400 120 Updated Feb 23, 2025

Kiln-AI / Kiln

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 2,974 195 Updated Mar 1, 2025

YanjieZe / Improved-3D-Diffusion-Policy

[arXiv 2024] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3

Python 218 20 Updated Feb 19, 2025

YanjieZe / 3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 702 70 Updated Feb 28, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,805 1,932 Updated Mar 1, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,808 1,386 Updated Feb 1, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,986 364 Updated Mar 1, 2025

MiniMax-AI / MiniMax-01

Python 2,262 157 Updated Feb 24, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,464 2,164 Updated Feb 1, 2025

deepseek-ai / DeepSeek-R1

83,952 10,825 Updated Feb 24, 2025

af-74413592 / Kolors-Controlnet-Pose-Tryon

Python 4 1 Updated Feb 7, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,145 95 Updated Feb 12, 2025

ekinakyurek / marc

Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

Python 295 27 Updated Nov 19, 2024

stanfordnlp / pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,429 124 Updated Feb 6, 2025

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,132 72 Updated Jul 14, 2024

junhahyung / STGuidance

Python 130 10 Updated Feb 28, 2025

kvfrans / shortcut-models

Python 377 9 Updated Dec 5, 2024

xandergos / sCM-mnist

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

Python 30 3 Updated Feb 28, 2025

alexzhou907 / DDBM

Python 184 17 Updated Apr 18, 2024

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,462 77 Updated Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

af-74413592

Achievements

Achievements

Block or report af-74413592

Starred repositories

deepseek-ai / DeepEP

deepseek-ai / DeepGEMM

MoonshotAI / MoBA

dhcode-cpp / X-R1

bojone / Keras-DDPM

XU-YIJIE / grpo-flat

samleoqh / DDCM-Semantic-Segmentation-PyTorch

MoonInTheRiver / DiffSinger

keonlee9420 / DiffSinger

modelscope / ms-swift