Skip to content
View UmarMJW's full-sized avatar

Block or report UmarMJW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 1,931 333 Updated Jun 4, 2023

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python 430 90 Updated Mar 27, 2024

主要写er-nerf从零到一所有部署过程

Python 42 11 Updated Aug 28, 2024

🎨ComfyUI standalone pack with 40+ custom nodes. | ComfyUI 大号整合包,预装大量自定义节点(不含SD模型)

Shell 230 31 Updated Feb 11, 2025

🧊ComfyUI-3D-Pack pre-built for Windows. | Comfy3D 整合包

Python 117 29 Updated Feb 6, 2025

🐳Dockerfile for 🎨ComfyUI. | 容器镜像与启动脚本

Dockerfile 627 110 Updated Feb 18, 2025

Text-to-Music Generation with Rectified Flow Transformers

Python 1,667 133 Updated Dec 10, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,795 1,054 Updated Feb 16, 2025

Multilingual Voice Understanding Model

Python 4,509 401 Updated Jan 8, 2025

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 813 70 Updated Feb 18, 2025

Kolors Team

Python 4,188 316 Updated Nov 13, 2024
Python 483 28 Updated Jan 20, 2025

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 212 13 Updated Feb 9, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,086 158 Updated Feb 13, 2025

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,446 177 Updated Aug 7, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,199 186 Updated Sep 23, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,682 1,091 Updated Jun 21, 2024

MagicAvatar: Multimodal Avatar Generation and Animation

622 34 Updated Aug 29, 2023

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …

Python 1,184 145 Updated Jan 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,496 5,766 Updated Feb 19, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 5,832 513 Updated Jan 24, 2025

Bring portraits to life!

Python 14,072 1,512 Updated Feb 13, 2025

JoyHallo: Digital human model for Mandarin

Python 439 43 Updated Nov 21, 2024

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 2,795 327 Updated Jan 27, 2025

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 560 67 Updated Oct 16, 2024

Official implementation of AnimateDiff.

Python 11,014 892 Updated Jul 31, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,840 600 Updated Jul 2, 2024

PantoMatrix: Generating Face and Body Animation from Speech

Python 968 160 Updated Jan 16, 2025
C++ 4,211 623 Updated Feb 18, 2025

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks

Python 1,057 145 Updated Jan 29, 2025
Next