Skip to content
View Czi24's full-sized avatar

Block or report Czi24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Align Anything: Training All-modality Model with Feedback

Python 2,939 382 Updated Mar 21, 2025

Witness the aha moment of VLM with less than $3.

Python 3,345 261 Updated Mar 1, 2025

A fork to add multimodal model training to open-r1

Python 1,097 59 Updated Feb 8, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,257 264 Updated Mar 20, 2025

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 4,881 381 Updated Feb 4, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,297 1,432 Updated Mar 10, 2025

Fully open reproduction of DeepSeek-R1

Python 23,122 2,104 Updated Mar 21, 2025

Collect every awesome work about r1!

Python 302 9 Updated Mar 21, 2025

Democratizing Reinforcement Learning for LLMs

Python 2,097 183 Updated Feb 16, 2025

An official code for "Endpoints Weight Fusion for Class Incremental Semantic Segmentation"

Python 33 5 Updated Sep 15, 2023

[MICCAI 2023] Continual Learning for Abdominal Multi-Organ and Tumor Segmentation

Python 69 9 Updated Jul 30, 2024

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

Python 151 22 Updated Feb 9, 2022

s1: Simple test-time scaling

Python 6,032 706 Updated Mar 6, 2025

Code for Heima

Python 37 3 Updated Feb 11, 2025

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Python 64 2 Updated Oct 31, 2024

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

33 1 Updated Dec 3, 2024

CSSegmentation: An Open Source Continual Semantic Segmentation Toolbox Based on PyTorch.

Python 33 4 Updated Feb 6, 2024

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

393 9 Updated Jan 17, 2025

Inference code for Llama models

Python 57,913 9,718 Updated Jan 26, 2025

Rethinking Step-by-step Visual Reasoning in LLMs

Python 277 17 Updated Jan 24, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,330 518 Updated May 3, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 982 43 Updated Mar 18, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,622 73 Updated Aug 15, 2024

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,022 46 Updated Feb 23, 2025

Multimodal Models in Real World

Jupyter Notebook 452 20 Updated Feb 24, 2025

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 246 7 Updated Jan 22, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,025 450 Updated Jan 12, 2025

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 291 1 Updated Mar 5, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 274 12 Updated Dec 22, 2024
Next