linhaojia13

Follow

🎯

Focusing

林豪佳 linhaojia13

🎯

Focusing

Follow

Email: [email protected] xdm，冲！

38 followers · 85 following

Xiamen University
Xiamen

Achievements

Achievements

Stars

MAC-AutoML / QuoTA

This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 55 Updated Mar 10, 2025

chengazhen / cursor-auto-free

auto sign cursor

Python 6,207 909 Updated Mar 8, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,559 73 Updated Mar 5, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 624 80 Updated Feb 26, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 3,963 243 Updated Mar 9, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 235 26 Updated Mar 8, 2025

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 120 3 Updated Mar 5, 2025

XU-YIJIE / grpo-flat

Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...

Python 58 7 Updated Feb 26, 2025

Mryangkaitong / deepseek-r1-gsm8k

Python 29 4 Updated Feb 10, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,108 229 Updated Feb 19, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,111 2,482 Updated Mar 10, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,008 51 Updated Feb 8, 2025

FanqingM / R1-Multimodal-Journey

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 261 5 Updated Mar 8, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,072 134 Updated Mar 3, 2025

xjtupanda / Sparrow

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

Jupyter Notebook 49 Updated Mar 7, 2025

pengshuai-rin / MultiMath

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Python 26 1 Updated Jan 22, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,665 1,706 Updated Mar 10, 2025

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

382 9 Updated Jan 17, 2025

si0wang / VisVM

Python 38 3 Updated Dec 30, 2024

Nagi-ovo / alphazero-gomoku

Python 5 Updated Dec 14, 2024

tensorfly-gpu / aichess

使用alphazero算法打造属于你自己的象棋AI

Python 244 57 Updated Sep 1, 2022

pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,388 567 Updated Jun 21, 2019

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,440 983 Updated Apr 24, 2024

Rafael1s / Deep-Reinforcement-Learning-Algorithms

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Jupyter Notebook 832 202 Updated Jun 17, 2021

deepseek-ai / DeepSeek-V3

Python 91,632 14,833 Updated Feb 24, 2025

SCNU203 / GeoQA-Plus

Python 14 Updated May 14, 2024

InfiMM / Awesome-Multimodal-LLM-for-Math-STEM

Paper collections of multi-modal LLM for Math/STEM/Code.

80 3 Updated Feb 21, 2025

AIDC-AI / Marco-o1

An Open Large Reasoning Model for Real-World Solutions

Python 1,472 78 Updated Mar 4, 2025

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,887 70 Updated Jan 22, 2025

Tencent / VITA

The official implement of VITA, VITA15 and LongVITA.

Python 18 1 Updated Dec 13, 2024