Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Text and image to video generation: Kandinsky 4.0 (2024)
Official Implementations for Paper - AniDoc: Animation Creation Made Easier
Inference and training library for high-quality TTS models.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Convert your videos to densepose and use it on MagicAnimate
Open-Sora: Democratizing Efficient Video Production for All
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Webui for using XTTS and for finetuning it
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Sagemaker and Kaggle notebook for Stable Diffusion WebUI, using Pinggy and Zrok for tunneling.
Generate video from text using AI