Skip to content
View DRSY's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DRSY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

minimal-cost for training 0.5B R1-Zero

Python 490 61 Updated Feb 20, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 246 7 Updated Feb 19, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,217 62 Updated Feb 19, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,895 1,068 Updated Feb 16, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 207 2 Updated Feb 12, 2025

A generative speech model for daily dialogue.

Python 34,578 3,727 Updated Feb 18, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 39 6 Updated Feb 19, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,902 485 Updated Feb 21, 2025

A fork to add multimodal model training to open-r1

Python 807 43 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 21,029 1,840 Updated Feb 21, 2025
Python 2,199 152 Updated Feb 20, 2025

An Approach to Enhancing the Efficacy of Post-Training Using Synthetic Data by Iterative Data Selection

Python 5 Updated Dec 24, 2024

A series of technical report on Slow Thinking with LLM

Python 412 21 Updated Feb 12, 2025
Python 1 Updated Feb 21, 2025
Python 1,334 50 Updated Nov 21, 2024

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 888 115 Updated Jan 4, 2025

LLM KV cache compression made easy

Python 399 26 Updated Feb 18, 2025

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,078 72 Updated Jan 23, 2025

Optimizing inference proxy for LLMs

Python 2,047 159 Updated Feb 16, 2025

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

Python 282 13 Updated Feb 20, 2025

Efficient Triton Kernels for LLM Training

Python 4,457 270 Updated Feb 21, 2025
2 Updated Oct 25, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

373 22 Updated Dec 18, 2024

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,895 586 Updated Feb 29, 2024

📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024

Jupyter Notebook 37 4 Updated Oct 15, 2024

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,714 487 Updated Feb 21, 2025
Next