Lists (1)
Sort Name ascending (A-Z)
Stars
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
The web framework for content-driven websites. ⭐️ Star to support our work!
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
A curated list of awesome things related to shadcn/ui.
A tiny (286 bytes) state manager for React/RN/Preact/Vue/Svelte with many atomic tree-shakable stores
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
[NeurIPS 2024] The official implementation of HairFastGAN. A framework for virtual hairstyle fitting.
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
A simple, high-quality voice conversion tool focused on ease of use and performance.
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
[AAAI 2025] Official codes of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
Replicate the design of mobbin.com to test the news components of shadcn-ui. #builtinpublic
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (CVPR 2024)
Custom Nodes for ComfyUI that perform color grading based on the latent tensor value range
Auto detecting, masking and inpainting with detection model.
Unofficial implementation of BRIA RMBG Model for ComfyUI
Foundational model for human-like, expressive TTS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
LAVIS - A One-stop Library for Language-Vision Intelligence
Official implementations for paper: Anydoor: zero-shot object-level image customization