Starred repositories
Command-line program to download videos from YouTube.com and other video sites
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
An open-source NLP research library, built on PyTorch.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
StyleGAN2 - Official TensorFlow Implementation
vits2 backbone with multilingual-bert
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
实战🐍多种网站、电商数据爬虫🕷。包含🕸:淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评、携程、小米应用商店、安居客、途家民宿❤️❤️❤️。微信爬虫展示项目:
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
A Python framework for high performance GPU simulation and graphics
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Python library for loading and using triangular meshes.
Resources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
This library provides common speech features for ASR including MFCCs and filterbank energies.
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)