wan-h

🎯

Focusing

wan-h wan-h

🎯

Focusing

Just do what U want

40 followers · 9 following

成都

Achievements

Stars

数字人

38 repositories

xszyou / Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,661 1,843 Updated Dec 31, 2024

yakami129 / VirtualWife

VirtualWife是一个虚拟数字人项目，支持B站直播，支持openai、ollama

Python 2,159 329 Updated Oct 27, 2024

Ikaros-521 / AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊…

Python 3,312 509 Updated Jan 7, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,639 4,524 Updated Aug 16, 2024

FACEGOOD / FACEGOOD-Audio2Face

http://www.facegood.cc

Python 1,835 361 Updated Feb 8, 2023

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,107 2,351 Updated Nov 26, 2024

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,803 997 Updated Aug 5, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,964 8,837 Updated Jan 4, 2025

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,256 1,096 Updated Jan 8, 2025

rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 6,742 659 Updated Dec 26, 2024

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,518 2,406 Updated Jan 4, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,756 769 Updated Feb 11, 2024

yxlllc / DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,952 250 Updated Jan 2, 2025

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 26,258 4,880 Updated Nov 11, 2023

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,790 720 Updated Jul 3, 2024

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,574 3,992 Updated Sep 3, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,608 5,221 Updated Nov 15, 2024

imuncle / live2d

live2d模型收集+展示，可直接用于静态网站

JavaScript 735 175 Updated Jun 17, 2022

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,356 1,865 Updated Jan 6, 2025

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,422 5,332 Updated Nov 29, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,634 802 Updated Dec 29, 2024

TMElyralab / MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,253 415 Updated Nov 27, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,475 3,635 Updated Jan 7, 2025

lipku / LiveTalking

Real time interactive streaming digital human

Python 4,266 621 Updated Jan 1, 2025

Live2D / CubismWebSamples

304 99 Updated Dec 19, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 13,554 1,451 Updated Jan 1, 2025

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,183 2,272 Updated Jun 26, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,660 388 Updated Dec 4, 2024

TMElyralab / MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,550 270 Updated Jun 28, 2024

MyNiuuu / MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 684 44 Updated Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wan-h wan-h

Achievements

Achievements

Block or report wan-h

数字人

xszyou / Fay

yakami129 / VirtualWife

Ikaros-521 / AI-Vtuber

coqui-ai / TTS

FACEGOOD / FACEGOOD-Audio2Face

Rudrabha / Wav2Lip

OpenTalker / video-retalking

openai / whisper

wenet-e2e / wenet

rany2 / edge-tts

Uberi / speech_recognition

Plachtaa / VALL-E-X

yxlllc / DDSP-SVC

svc-develop-team / so-vits-svc

Plachtaa / VITS-fast-fine-tuning

mozilla / DeepSpeech

babysor / MockingBird

imuncle / live2d

PaddlePaddle / PaddleSpeech

kaldi-asr / kaldi

modelscope / FunASR

TMElyralab / MuseTalk

2noise / ChatTTS

lipku / LiveTalking

Live2D / CubismWebSamples

KwaiVGI / LivePortrait

OpenTalker / SadTalker

huggingface / speech-to-speech

TMElyralab / MuseV

MyNiuuu / MOFA-Video