Czi24

Follow

Czi. Czi24

Follow

6 followers · 1 following

Achievements

Achievements

Stars

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,939 382 Updated Mar 21, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,345 261 Updated Mar 1, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,097 59 Updated Feb 8, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,257 264 Updated Mar 20, 2025

getAsterisk / deepclaude

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 4,881 381 Updated Feb 4, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,297 1,432 Updated Mar 10, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,122 2,104 Updated Mar 21, 2025

modelscope / awesome-deep-reasoning

Collect every awesome work about r1!

Python 302 9 Updated Mar 21, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 2,097 183 Updated Feb 16, 2025

schuy1er / EWF_official

An official code for "Endpoints Weight Fusion for Class Incremental Semantic Segmentation"

Python 33 5 Updated Sep 15, 2023

MrGiovanni / ContinualLearning

[MICCAI 2023] Continual Learning for Abdominal Multi-Organ and Tumor Segmentation

Python 69 9 Updated Jul 30, 2024

arthurdouillard / CVPR2021_PLOP

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

Python 151 22 Updated Feb 9, 2022

simplescaling / s1

s1: Simple test-time scaling

Python 6,032 706 Updated Mar 6, 2025

shawnricecake / Heima

Code for Heima

Python 37 3 Updated Feb 11, 2025

DAMO-NLP-SG / DiGIT

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Python 64 2 Updated Oct 31, 2024

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

33 1 Updated Dec 3, 2024

SegmentationBLWX / cssegmentation

CSSegmentation: An Open Source Continual Semantic Segmentation Toolbox Based on PyTorch.

Python 33 4 Updated Feb 6, 2024

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

393 9 Updated Jan 17, 2025

meta-llama / llama

Inference code for Llama models

Python 57,913 9,718 Updated Jan 26, 2025

mbzuai-oryx / LlamaV-o1

Rethinking Step-by-step Visual Reasoning in LLMs

Python 277 17 Updated Jan 24, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,330 518 Updated May 3, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 982 43 Updated Mar 18, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,622 73 Updated Aug 15, 2024

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,022 46 Updated Feb 23, 2025

PKU-YuanGroup / Next-Patch-Prediction

Python 132 3 Updated Jan 2, 2025

AILab-CVC / SEED-X

Multimodal Models in Real World

Jupyter Notebook 452 20 Updated Feb 24, 2025

mit-han-lab / vila-u

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 246 7 Updated Jan 22, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,025 450 Updated Jan 12, 2025

ByteFlow-AI / TokenFlow

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 291 1 Updated Mar 5, 2025

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 274 12 Updated Dec 22, 2024