Stars
Official Code for DragGAN (SIGGRAPH 2023)
Instant voice cloning by MIT and MyShell. Audio foundation model.
Industry leading face manipulation platform
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
リアルタイムボイスチェンジャー Realtime Voice Changer
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Understand Human Behavior to Align True Needs
roop extension for StableDiffusion web-ui
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube downlo…
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Just draw a bounding box and you can remove the object you want to remove.
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.
提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 | cf优…
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"