-
Beijing Jiaotong University
- Beijing, China
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Download and preprocess voxceleb datasets.
Character Animation (AnimateAnyone, Face Reenactment)
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
papers about Face Reenactment/Talking Face Generation
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
An efficient video loader for deep learning with smart shuffling that's super easy to digest
The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
📖 A curated list of resources dedicated to talking face.
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models