Stars
Stable Diffusion web UI
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Official Code for DragGAN (SIGGRAPH 2023)
A generative speech model for daily dialogue.
Open-Sora: Democratizing Efficient Video Production for All
Simple, unified interface to multiple Generative AI providers
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution