Stars
FlagEval is an evaluation toolkit for AI large foundation models.
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
VideoGen-Eval: Agent-based System for Video Generation Evaluation
roop extension for StableDiffusion web-ui
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
DeepFaceLab is the leading software for creating deepfakes.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.
[CSUR] A Survey on Video Diffusion Models
A curated list of recent diffusion models for video generation, editing, and various other applications.
A collection of awesome video generation studies.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
AcadHomepage: A Modern and Responsive Academic Personal Homepage
A minimal Jekyll Theme to host your resume (CV) on GitHub with a few clicks.
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Open-Sora: Democratizing Efficient Video Production for All
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation