UmarMJW

Umar UmarMJW

Stars

Zz-ww / SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。

Python 1,931 333 Updated Jun 4, 2023

saifhassan / Wav2Lip-HD

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Python 430 90 Updated Mar 27, 2024

notiom / ER-nerf

主要写er-nerf从零到一所有部署过程

Python 42 11 Updated Aug 28, 2024

YanWenKun / ComfyUI-Windows-Portable

🎨ComfyUI standalone pack with 40+ custom nodes. | ComfyUI 大号整合包，预装大量自定义节点（不含SD模型）

Shell 230 31 Updated Feb 11, 2025

YanWenKun / Comfy3D-WinPortable

🧊ComfyUI-3D-Pack pre-built for Windows. | Comfy3D 整合包

Python 117 29 Updated Feb 6, 2025

YanWenKun / ComfyUI-Docker

🐳Dockerfile for 🎨ComfyUI. | 容器镜像与启动脚本

Dockerfile 627 110 Updated Feb 18, 2025

feizc / FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

Python 1,667 133 Updated Dec 10, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,795 1,054 Updated Feb 16, 2025

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 4,509 401 Updated Jan 8, 2025

FunAudioLLM / InspireMusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 813 70 Updated Feb 18, 2025

Kwai-Kolors / Kolors

Kolors Team

Python 4,188 316 Updated Nov 13, 2024

IamCreateAI / Ruyi-Models

Python 483 28 Updated Jan 20, 2025

bytedance / Valley

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

Python 212 13 Updated Feb 9, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,086 158 Updated Feb 13, 2025

TMElyralab / MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,446 177 Updated Aug 7, 2024

Tencent / MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,199 186 Updated Sep 23, 2024

magic-research / magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,682 1,091 Updated Jun 21, 2024

magic-research / magic-avatar

MagicAvatar: Multimodal Avatar Generation and Animation

622 34 Updated Aug 29, 2023

Zheng-Chong / CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …

Python 1,184 145 Updated Jan 24, 2025