techmonsterwang

🎯

Focusing

Wang Jiahao techmonsterwang

🎯

Focusing

Ph.D. Student in Computer Science, The University of Hong Kong @mmlab Research Interest: Efficient AI, Multimodal, Generation

84 followers · 144 following

The University of HongKong
Pokfulam, Hong Kong, PRC
08:47 (UTC +08:00)
https://www.zhihu.com/people/wang-jia-hao-hku

Achievements

Highlights

Lists (10)

Sort

ML

ML Theory

1 repository

MoE

4 repositories

MultiModality

12 repositories

Quantization

11 repositories

SNN

3 repositories

Vision Backbone

9 repositories

Starred repositories

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,138 64 Updated Jul 20, 2024

voidism / DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 471 58 Updated Jan 17, 2025

Liuziyu77 / Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,159 52 Updated Mar 12, 2025

OpenNLPLab / lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 267 21 Updated Feb 23, 2025

Tencent / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,015 74 Updated Mar 12, 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,248 557 Updated Oct 19, 2024

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 289 6 Updated Mar 12, 2025

ThreeSR / Awesome-Inference-Time-Scaling

Paper List of Inference/Test Time Scaling/Computing

Python 89 2 Updated Mar 10, 2025

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 406 11 Updated Mar 3, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,828 502 Updated Sep 25, 2024

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,271 790 Updated Mar 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,145 623 Updated Mar 12, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 721 59 Updated Feb 28, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

397 16 Updated Jan 18, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,939 684 Updated Mar 6, 2025

ridgerchu / flash-linear-attention

Forked from fla-org/flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 5 Updated Jan 27, 2025

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 767 39 Updated Mar 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,680 2,037 Updated Mar 12, 2025

Haochen-Wang409 / ross

[ICLR 2025] Reconstructive Visual Instruction Tuning

Python 68 3 Updated Mar 1, 2025

MiniMax-AI / MiniMax-01

Python 2,331 165 Updated Mar 6, 2025

Stability-AI / sd3.5

Python 1,043 79 Updated Jan 8, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,001 42 Updated Feb 23, 2025

deepseek-ai / DeepSeek-V3

Python 91,916 14,891 Updated Feb 24, 2025

VainF / Remix-DiT

Python 15 Updated Dec 11, 2024

mit-han-lab / deepcompressor

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 370 29 Updated Feb 21, 2025

spcl / QuaRot

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 356 33 Updated Nov 26, 2024

facebookresearch / SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 228 29 Updated Feb 14, 2025

TencentARC / FluxKits

Python 79 3 Updated Nov 27, 2024

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,543 213 Updated Mar 12, 2025

czg1225 / CoDe

[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 86 4 Updated Mar 2, 2025

Wang Jiahao techmonsterwang

Highlights

Lists (10)

AIGC

Efficient Training

LLM

ML

ML Theory

MoE

MultiModality

Quantization

SNN

Vision Backbone

Starred repositories

Deep learning

Machine learning