Lists (7)
Sort Name ascending (A-Z)
Stars
Official repository of In-Context LoRA for Diffusion Transformers
Training-free Regional Prompting for Diffusion Transformers 🔥
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Scalable RL solution for advanced reasoning of language models
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Official Implementations for Paper - AniDoc: Animation Creation Made Easier
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
This Git offers a faster and easy-to-use 3DMM tracking pipeline with FaceVerse V4 (CVPR 2022), which is a full head model that includes separate eyeballs, teeth, and tongue.
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
[arXiv 2024] InstantSwap: This repo is the official implementation of "InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences"
HunyuanVideo: A Systematic Framework For Large Video Generation Model
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer