Stars
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
teowu / A-Bench
Forked from Q-Future/A-Bench[LMM + AIGC] What do we expect from LMMs as AIGI evaluators and how do they perform?
Lumina-T2X is a unified framework for Text to Any Modality Generation
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
The public code for "PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts"
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
[TMM-2024] Pytorch implementation of "Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics".
Open-Sora: Democratizing Efficient Video Production for All
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising