Skip to content
View gyxxyg's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report gyxxyg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

R1-onevision, a visual language model capable of deep CoT reasoning.

274 5 Updated Feb 28, 2025

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 432 24 Updated Feb 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,362 448 Updated Feb 28, 2025

A library for advanced large language model reasoning

Python 1,996 176 Updated Feb 21, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,474 392 Updated Feb 28, 2025

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 10,801 712 Updated Mar 1, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 899 44 Updated Feb 28, 2025

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 129 14 Updated Feb 25, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,626 215 Updated Feb 28, 2025

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 186 6 Updated Oct 16, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

5,853 123 Updated Mar 1, 2025

Official Repo for Open-Reasoner-Zero

Python 1,412 60 Updated Mar 1, 2025
Python 132 5 Updated Feb 20, 2025

Train transformer language models with reinforcement learning.

Python 12,103 1,638 Updated Feb 28, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 327 24 Updated Feb 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,988 364 Updated Mar 1, 2025

Collect every awesome work about r1!

Python 229 6 Updated Feb 28, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,679 154 Updated Feb 21, 2025

LIMO: Less is More for Reasoning

Python 789 34 Updated Feb 24, 2025

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 204 17 Updated Feb 20, 2025

A paper list of some recent works about Token Compress for Vit and VLM

338 17 Updated Feb 9, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,459 339 Updated Feb 28, 2025

minimal-cost for training 0.5B R1-Zero

Python 565 74 Updated Feb 26, 2025

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 93 7 Updated Feb 14, 2025

A fork to add multimodal model training to open-r1

Python 902 49 Updated Feb 8, 2025

A simple framework for experimenting with Reinforcement Learning in Python.

Python 300 101 Updated Feb 27, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 41,638 3,679 Updated Feb 28, 2025

A collection of resources that investigate social agents.

120 12 Updated Feb 26, 2025
Next