This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

208 9 Updated Mar 1, 2025

JacksonWuxs / Interpret_Instruction_Tuning_LLMs

Understanding Why and How Instruction Tuning Changes Pre-trained Models

Python 21 3 Updated Mar 18, 2024

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 890 42 Updated Feb 28, 2025

tulerfeng / Video-R1

Video-R1: Towards Super Reasoning Ability in Video Understanding MLLMs

Python 69 1 Updated Feb 23, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 13,793 1,507 Updated Feb 23, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 1,413 154 Updated Feb 23, 2025

Hui-design / R1-Video-fixbug

[Blog 1] Recording a bug of grpo_trainer in some R1 projects

Python 14 Updated Feb 23, 2025

THU-KEG / LongWriter-V

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Python 16 Updated Feb 21, 2025

sycny / SelfSynthX

[ICLR2025] Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Python 11 2 Updated Feb 20, 2025

NineAbyss / S2R

This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"

Python 35 1 Updated Feb 22, 2025

OpenRL-Lab / openrl

Unified Reinforcement Learning Framework

Python 693 64 Updated Sep 6, 2024

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,409 60 Updated Feb 25, 2025

THUDM / T1

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

86 Updated Jan 23, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 564 73 Updated Feb 26, 2025

zjunlp / DynamicKnowledgeCircuits

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Jupyter Notebook 4 Updated Feb 19, 2025

facebookresearch / jepa-intuitive-physics

This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"

Jupyter Notebook 79 3 Updated Feb 17, 2025

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,809 270 Updated Feb 27, 2025

BriansIDP / video-SALMONN-o1

JavaScript 18 1 Updated Feb 17, 2025

tencent-ailab / hokoff

Python 30 3 Updated Jan 22, 2025

kapidien

Lists (4)

foundation

image

✨ Inspiration

video

Starred repositories

Deep learning