Skip to content
View techmonsterwang's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report techmonsterwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,138 64 Updated Jul 20, 2024

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 471 58 Updated Jan 17, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,159 52 Updated Mar 12, 2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 267 21 Updated Feb 23, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,015 74 Updated Mar 12, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,248 557 Updated Oct 19, 2024

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 289 6 Updated Mar 12, 2025

Paper List of Inference/Test Time Scaling/Computing

Python 89 2 Updated Mar 10, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 406 11 Updated Mar 3, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,828 502 Updated Sep 25, 2024

FlashMLA: Efficient MLA decoding kernels

C++ 11,271 790 Updated Mar 1, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,145 623 Updated Mar 12, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 721 59 Updated Feb 28, 2025

πŸ“– This is a repository for organizing papers, codes and other resources related to unified multimodal models.

397 16 Updated Jan 18, 2025

s1: Simple test-time scaling

Python 5,939 684 Updated Mar 6, 2025

πŸš€ Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 5 Updated Jan 27, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 767 39 Updated Mar 6, 2025

Fully open reproduction of DeepSeek-R1

Python 22,680 2,037 Updated Mar 12, 2025

[ICLR 2025] Reconstructive Visual Instruction Tuning

Python 68 3 Updated Mar 1, 2025
Python 2,331 165 Updated Mar 6, 2025
Python 1,043 79 Updated Jan 8, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,001 42 Updated Feb 23, 2025
Python 15 Updated Dec 11, 2024

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 370 29 Updated Feb 21, 2025

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 356 33 Updated Nov 26, 2024

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 228 29 Updated Feb 14, 2025
Python 79 3 Updated Nov 27, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,543 213 Updated Mar 12, 2025

[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 86 4 Updated Mar 2, 2025
Next