![c logo](https://raw.githubusercontent.com/github/explore/f3e22f0dca2be955676bc70d6214b95b13354ee8/topics/c/c.png)
Starred repositories
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Real-time face swap for PC streaming or video calls
A modular graph-based Retrieval-Augmented Generation (RAG) system
GUI for a Vocal Remover that uses Deep Neural Networks.
Rembg is a tool to remove images background
Bringing Old Photo Back to Life (CVPR 2020 oral)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Real-Time High-Resolution Background Matting
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
A social networking service scraper in Python
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]