Skip to content
View linhaojia13's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xiamen University
  • Xiamen

Block or report linhaojia13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 55 Updated Mar 10, 2025

auto sign cursor

Python 6,207 909 Updated Mar 8, 2025

Official Repo for Open-Reasoner-Zero

Python 1,559 73 Updated Mar 5, 2025

minimal-cost for training 0.5B R1-Zero

Python 624 80 Updated Feb 26, 2025

Solve Visual Understanding with Reinforced VLMs

Python 3,963 243 Updated Mar 9, 2025

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 235 26 Updated Mar 8, 2025

The Next Step Forward in Multimodal LLM Alignment

Python 120 3 Updated Mar 5, 2025

Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...

Python 58 7 Updated Feb 26, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,108 229 Updated Feb 19, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,111 2,482 Updated Mar 10, 2025

A fork to add multimodal model training to open-r1

Python 1,008 51 Updated Feb 8, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 261 5 Updated Mar 8, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,072 134 Updated Mar 3, 2025

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

Jupyter Notebook 49 Updated Mar 7, 2025

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Python 26 1 Updated Jan 22, 2025

Let your Claude able to think

TypeScript 14,665 1,706 Updated Mar 10, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

382 9 Updated Jan 17, 2025
Python 38 3 Updated Dec 30, 2024
Python 5 Updated Dec 14, 2024

使用alphazero算法打造属于你自己的象棋AI

Python 244 57 Updated Sep 1, 2022

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,388 567 Updated Jun 21, 2019

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,440 983 Updated Apr 24, 2024

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Jupyter Notebook 832 202 Updated Jun 17, 2021
Python 14 Updated May 14, 2024

Paper collections of multi-modal LLM for Math/STEM/Code.

80 3 Updated Feb 21, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,472 78 Updated Mar 4, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,887 70 Updated Jan 22, 2025

The official implement of VITA, VITA15 and LongVITA.

Python 18 1 Updated Dec 13, 2024
Next