Stars
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Clone a voice in 5 seconds to generate arbitrary speech in real-time
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Implementation of Korean FastSpeech2
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An unofficial PyTorch implementation of the audio LM VALL-E
This github contains the network architectures of NeuralVoicePuppetry.
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Instant neural graphics primitives: lightning fast NeRF and more
Duck-themed multi-user virtual spaces in WebVR. Built with A-Frame.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
This is an official GitHub repository for the paper, "Engorgio: Neural Enhancement at Scale"
Simple MPL-2.0-licensed C++ geometry processing library.
A collaboration friendly studio for NeRFs
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
A C++ OBJ Model Loader that will parse .obj & .mtl Files into Indices, Vertices, Materials, and Mesh Structures.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
My personal website - built with React, React-Router, React-Snap for Static-Export, and GitHub Pages.
Official repository for CVPR 2022 paper: I M Avatar: Implicit Morphable Head Avatars from Videos
SwinIR: Image Restoration Using Swin Transformer (official repository)
High-Resolution 3D Human Digitization from A Single Image.
Real-time face swap for PC streaming or video calls