dongzhuoyao

💭

I may be slow to respond.

Tao Hu dongzhuoyao

💭

I may be slow to respond.

Ommer-Lab Postdoc; Building something new in stealth mode

145 followers · 551 following

PKU->University of Amsterdam-> LMU
Munich
http://taohu.me
https://scholar.google.com/citations?user=EchdyZEAAAAJ&hl=en
in/taohu620
@vtaohu

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Lists (2)

Sort

🔮 Future ideas

ToRead

1 repository

Stars

sayakpaul / tt-scale-flux

Inference-time scaling of Flux beyond denoising steps.

Python 88 8 Updated Mar 3, 2025

jerpint / arxiv-txt

Fetch arxiv data to LLM-friendly text

JavaScript 75 13 Updated Feb 26, 2025

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 137 9 Updated Feb 23, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,805 659 Updated Feb 23, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,530 2,169 Updated Feb 1, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,111 377 Updated Mar 3, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,874 1,390 Updated Feb 1, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 48,604 5,343 Updated Mar 3, 2025

shawntan / stickbreaking-attention

Stick-breaking attention

Python 44 1 Updated Jan 12, 2025

CompVis / tread

48 Updated Jan 22, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,745 123 Updated Dec 6, 2024

YuHengsss / MSVMamba

[NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model

Python 67 3 Updated Dec 25, 2024

squidfunk / mkdocs-material

Documentation that simply works

Python 22,304 3,672 Updated Mar 3, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,600 486 Updated Feb 28, 2025

InternLM / lagent

A lightweight framework for building LLM-based agents

Python 2,054 216 Updated Feb 10, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,455 242 Updated Feb 20, 2025

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,431 847 Updated Feb 16, 2025

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,645 1,200 Updated Mar 1, 2025

G-U-N / Phased-Consistency-Model

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 440 17 Updated Dec 11, 2024

CompVis / DisCLIP

Does VLM Classification Benefit from LLM Description Semantics? (AAAI 2025)

Python 14 Updated Jan 6, 2025

RohollahHS / BAD

The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”

Python 40 3 Updated Oct 22, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 8,069 2,160 Updated Mar 3, 2025

facebookresearch / blt

Code for BLT research paper

Python 1,420 108 Updated Mar 1, 2025

zhaoyue-zephyrus / bsq-vit

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 136 Updated Jun 12, 2024

google-deepmind / md4

Official Jax Implementation of MD4 Masked Diffusion Models

Python 61 5 Updated Feb 27, 2025

pkulwj1994 / diff_instruct

official code for Diff-Instruct algorithm for one-step diffusion distillation

Python 69 3 Updated Jan 9, 2025

tgxs002 / HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 455 16 Updated May 24, 2024

uclaml / DNDM

1 Updated Oct 31, 2024

JiwanHur / UnlockMGM

[NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"

Python 10 2 Updated Dec 4, 2024

elizaOS / eliza

Autonomous agents for everyone

TypeScript 14,815 4,746 Updated Mar 3, 2025