-
PKU->University of Amsterdam-> LMU
- Munich
- http://taohu.me
- https://scholar.google.com/citations?user=EchdyZEAAAAJ&hl=en
- in/taohu620
- @vtaohu
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Inference-time scaling of Flux beyond denoising steps.
[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule
Janus-Series: Unified Multimodal Understanding and Generation Models
verl: Volcano Engine Reinforcement Learning for LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
A lightweight framework for building LLM-based agents
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Unsupervised text tokenizer for Neural Network-based text generation.
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
Does VLM Classification Benefit from LLM Description Semantics? (AAAI 2025)
The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”
A framework for few-shot evaluation of language models.
[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization
Official Jax Implementation of MD4 Masked Diffusion Models
official code for Diff-Instruct algorithm for one-step diffusion distillation
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
[NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"