Skip to content
View dongzhuoyao's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Organizations

@CompVis

Block or report dongzhuoyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference-time scaling of Flux beyond denoising steps.

Python 88 8 Updated Mar 3, 2025

Fetch arxiv data to LLM-friendly text

JavaScript 75 13 Updated Feb 26, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 137 9 Updated Feb 23, 2025

s1: Simple test-time scaling

Python 5,805 659 Updated Feb 23, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,530 2,169 Updated Feb 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,111 377 Updated Mar 3, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,874 1,390 Updated Feb 1, 2025

🙌 OpenHands: Code Less, Make More

Python 48,604 5,343 Updated Mar 3, 2025

Stick-breaking attention

Python 44 1 Updated Jan 12, 2025
48 Updated Jan 22, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,745 123 Updated Dec 6, 2024

[NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model

Python 67 3 Updated Dec 25, 2024

Documentation that simply works

Python 22,304 3,672 Updated Mar 3, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,600 486 Updated Feb 28, 2025

A lightweight framework for building LLM-based agents

Python 2,054 216 Updated Feb 10, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,455 242 Updated Feb 20, 2025

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,431 847 Updated Feb 16, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,645 1,200 Updated Mar 1, 2025

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 440 17 Updated Dec 11, 2024

Does VLM Classification Benefit from LLM Description Semantics? (AAAI 2025)

Python 14 Updated Jan 6, 2025

The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”

Python 40 3 Updated Oct 22, 2024

A framework for few-shot evaluation of language models.

Python 8,069 2,160 Updated Mar 3, 2025

Code for BLT research paper

Python 1,420 108 Updated Mar 1, 2025

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 136 Updated Jun 12, 2024

Official Jax Implementation of MD4 Masked Diffusion Models

Python 61 5 Updated Feb 27, 2025

official code for Diff-Instruct algorithm for one-step diffusion distillation

Python 69 3 Updated Jan 9, 2025

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 455 16 Updated May 24, 2024
1 Updated Oct 31, 2024

[NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"

Python 10 2 Updated Dec 4, 2024

Autonomous agents for everyone

TypeScript 14,815 4,746 Updated Mar 3, 2025
Next