Skip to content
View wan-h's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 成都

Block or report wan-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

数字人

38 repositories

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,661 1,843 Updated Dec 31, 2024

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

Python 2,159 329 Updated Oct 27, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 3,312 509 Updated Jan 7, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,639 4,524 Updated Aug 16, 2024

http://www.facegood.cc

Python 1,835 361 Updated Feb 8, 2023

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,107 2,351 Updated Nov 26, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,803 997 Updated Aug 5, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,964 8,837 Updated Jan 4, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,256 1,096 Updated Jan 8, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 6,742 659 Updated Dec 26, 2024

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,518 2,406 Updated Jan 4, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,756 769 Updated Feb 11, 2024

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,952 250 Updated Jan 2, 2025

SoftVC VITS Singing Voice Conversion

Python 26,258 4,880 Updated Nov 11, 2023

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,790 720 Updated Jul 3, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,574 3,992 Updated Sep 3, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,608 5,221 Updated Nov 15, 2024

live2d模型收集+展示,可直接用于静态网站

JavaScript 735 175 Updated Jun 17, 2022

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,356 1,865 Updated Jan 6, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,422 5,332 Updated Nov 29, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,634 802 Updated Dec 29, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,253 415 Updated Nov 27, 2024

A generative speech model for daily dialogue.

Python 33,475 3,635 Updated Jan 7, 2025

Real time interactive streaming digital human

Python 4,266 621 Updated Jan 1, 2025

Bring portraits to life!

Python 13,554 1,451 Updated Jan 1, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,183 2,272 Updated Jun 26, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,660 388 Updated Dec 4, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,550 270 Updated Jun 28, 2024

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 684 44 Updated Dec 5, 2024