Stars
Stable Diffusion web UI
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Neural net that relights portrait images given a target lighting condition
Extension of Wav2Lip repository for processing high-quality videos.
(CVPR'20 Oral) Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
[ECCV 2022] StyleLight: HDR Panorama Generation for Lighting Estimation and Editing
[ICLR'23 Spotlight & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
EVA Series: Visual Representation Fantasies from BAAI
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Online HD Map Construction CVPR2023
Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)
Real-time face swap for PC streaming or video calls
DeepFaceLab is the leading software for creating deepfakes.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Out of time: automated lip sync in the wild
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation (CVPR 2022 Oral)