Lists (5)
Sort Name ascending (A-Z)
Stars
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
SoftVC VITS Singing Voice Conversion
GUI for a Vocal Remover that uses Deep Neural Networks.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
🔥 2D and 3D Face alignment library build using pytorch
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
Using modified BiSeNet for face parsing in PyTorch
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
A large-scale face dataset for face parsing, recognition, generation and editing.
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
[ICCV 2019] "DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better" by Orest Kupyn, Tetiana Martyniuk, Junru Wu, Zhangyang Wang