[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,053 304 Updated Oct 6, 2024

ajay-sainy / Wav2Lip-GFPGAN

High quality Lip sync

Python 1,009 261 Updated Jul 30, 2024

Zz-ww / SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。

Python 1,833 316 Updated Jun 4, 2023

primepake / wav2lip_288x288

Python 563 144 Updated Mar 1, 2024

bytedance / music_source_separation

Python 1,260 194 Updated Apr 18, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,309 1,009 Updated Oct 8, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 54,639 5,637 Updated Aug 24, 2024

DaddyJin / awesome-faceReenactment

papers about Face Reenactment/Talking Face Generation

446 45 Updated Jan 20, 2024

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,242 193 Updated Oct 9, 2024

dmlc / decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,838 160 Updated Jul 17, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,380 852 Updated Jul 31, 2024

RenYurui / PIRender

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Python 514 65 Updated Jan 5, 2022

hzwer / ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 4,408 438 Updated Sep 9, 2024

sczhou / CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 15,507 3,269 Updated Oct 9, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,419 2,235 Updated Sep 24, 2024

JosephPai / Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

1,296 110 Updated Oct 3, 2024

ali-vilab / dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python 1,571 191 Updated Jan 15, 2024

soumik-kanad / diff2lip

Python 302 36 Updated Aug 16, 2024

AliaksandrSiarohin / video-preprocessing

Python 509 135 Updated Dec 8, 2022