gyxxyg

🏠

Working from home

Yongxin Guo gyxxyg

🏠

Working from home

Ph.D. Student at CUHKSZ

50 followers · 69 following

https://gyxxyg.github.io/yongxinguo/

Achievements

Stars

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

274 5 Updated Feb 28, 2025

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 432 24 Updated Feb 26, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 5,362 448 Updated Feb 28, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 1,996 176 Updated Feb 21, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,474 392 Updated Feb 28, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 10,801 712 Updated Mar 1, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 899 44 Updated Feb 28, 2025

lucasjinreal / Namo-R1

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 129 14 Updated Feb 25, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 3,626 215 Updated Feb 28, 2025

SkyworkAI / MoE-plus-plus

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 186 6 Updated Oct 16, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

5,853 123 Updated Mar 1, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,412 60 Updated Mar 1, 2025

GAIR-NLP / LIMR

Python 132 5 Updated Feb 20, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,103 1,638 Updated Feb 28, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 327 24 Updated Feb 26, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,988 364 Updated Mar 1, 2025

ZJU-LLMs / Foundations-of-LLMs

8,079 688 Updated Jan 14, 2025

modelscope / awesome-deep-reasoning

Collect every awesome work about r1!

Python 229 6 Updated Feb 28, 2025

atfortes / Awesome-LLM-Reasoning

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,679 154 Updated Feb 21, 2025

GAIR-NLP / LIMO

LIMO: Less is More for Reasoning

Python 789 34 Updated Feb 24, 2025

facebookresearch / RAM

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 204 17 Updated Feb 20, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

338 17 Updated Feb 9, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,459 339 Updated Feb 28, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 565 74 Updated Feb 26, 2025

yfzhang114 / MME-RealWorld

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 93 7 Updated Feb 14, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 902 49 Updated Feb 8, 2025

david-abel / simple_rl

A simple framework for experimenting with Reinforcement Learning in Python.

Python 300 101 Updated Feb 27, 2024

zhangfaen / finetune-Qwen2-VL

Python 326 36 Updated Feb 8, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 41,638 3,679 Updated Feb 28, 2025

FudanDISC / SocialAgent

A collection of resources that investigate social agents.

120 12 Updated Feb 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yongxin Guo gyxxyg

Achievements

Achievements

Block or report gyxxyg

Stars

Fancy-MLLM / R1-Onevision

KellerJordan / Muon

Wan-Video / Wan2.1

maitrix-org / llm-reasoners

deepseek-ai / DeepGEMM

deepseek-ai / FlashMLA

hiyouga / EasyR1

lucasjinreal / Namo-R1

om-ai-lab / VLM-R1

SkyworkAI / MoE-plus-plus

deepseek-ai / open-infra-index

Open-Reasoner-Zero / Open-Reasoner-Zero

GAIR-NLP / LIMR

huggingface / trl

TideDra / lmm-r1

volcengine / verl

ZJU-LLMs / Foundations-of-LLMs

modelscope / awesome-deep-reasoning

atfortes / Awesome-LLM-Reasoning

GAIR-NLP / LIMO

facebookresearch / RAM

daixiangzi / Awesome-Token-Compress

PKU-Alignment / align-anything

dhcode-cpp / X-R1

yfzhang114 / MME-RealWorld

EvolvingLMMs-Lab / open-r1-multimodal

david-abel / simple_rl

zhangfaen / finetune-Qwen2-VL

infiniflow / ragflow

FudanDISC / SocialAgent