Starred repositories
Robust Speech Recognition via Large-Scale Weak Supervision
real time face swap and one-click video deepfake with only a single image
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Easily train a good VC model with voice data <= 10 mins!
The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply moni…
リアルタイムボイスチェンジャー Realtime Voice Changer
A collection of familiar, friendly, and modern emoji from Microsoft
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Extract files from any kind of container formats
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
Using OpenAI's Whisper to automatically generate YouTube subtitles
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
My usage of Real-ESRGAN to upscale anime, some test and results in the test_img folder