Skip to content
View yamand16's full-sized avatar
  • Germany

Block or report yamand16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
388 results for source starred repositories
Clear filter

talking-face video editing

Python 235 32 Updated Jan 20, 2025

EmoStyle project page

Python 39 2 Updated Mar 11, 2024

KETI project

Python 6 Updated Nov 27, 2023

📖 A curated list of resources dedicated to talking face.

1,447 117 Updated Dec 23, 2024

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,407 243 Updated Feb 19, 2025

Code release for "LLMs can see and hear without any training"

Python 194 17 Updated Feb 20, 2025

The VoxTube dataset official repository

HTML 68 1 Updated Feb 14, 2024

A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.

272 12 Updated Feb 23, 2025

[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Python 106 9 Updated Feb 15, 2025

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python 1,676 207 Updated Jan 15, 2024
Python 990 131 Updated Oct 3, 2022

Famous Vision Language Models and Their Architectures

Markdown 651 34 Updated Feb 23, 2025

[ECCV 2024] All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation

4 Updated Jul 10, 2024

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

Python 400 38 Updated Dec 31, 2024

[CVPR2020] "Detecting Attended Visual Targets in Video"

Python 189 48 Updated May 24, 2021

Face Editor for Stable Diffusion

Python 1,049 88 Updated Sep 15, 2024

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Python 8,647 1,205 Updated May 17, 2022

[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Python 416 31 Updated Jan 4, 2023

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,849 601 Updated Jul 2, 2024

Face super resolution based on ESRGAN

Python 261 65 Updated Nov 7, 2020

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024)

Python 148 3 Updated Aug 20, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 68,210 7,333 Updated Feb 23, 2025

Open source implementation of CVPR 2020 "Video to Events: Recycling Video Dataset for Event Cameras"

Python 340 82 Updated Mar 4, 2024

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Python 1,086 72 Updated Sep 17, 2024

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 904 45 Updated Feb 9, 2025

[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 129 8 Updated Feb 12, 2025
30 Updated May 26, 2023

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]

Python 288 20 Updated Feb 11, 2025
Python 23 3 Updated Feb 21, 2025
Next