Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,562 661 Updated Feb 24, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,369 866 Updated Feb 18, 2025

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,266 1,171 Updated Feb 10, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,801 774 Updated Feb 11, 2024

TingsongYu / PyTorch_Tutorial

《Pytorch模型训练实用教程》中配套代码

Python 7,757 1,757 Updated Jan 4, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,681 656 Updated Aug 13, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,169 1,313 Updated Dec 6, 2023

tkipf / pygcn

Graph Convolutional Networks in PyTorch

Python 5,252 1,230 Updated Sep 20, 2020

mozillazg / python-pinyin

汉字转拼音(pypinyin)

Python 4,983 619 Updated Jan 3, 2025

timesler / facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Python 4,715 971 Updated Aug 2, 2024

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,317 1,104 Updated Feb 22, 2025

Ikaros-521 / AI-Vtuber

Forked from sandboxdream/AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊…

Python 3,514 548 Updated Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChengBen-Xu

Block or report ChengBen-Xu

Stars

scikit-learn / scikit-learn

ageitgey / face_recognition

RVC-Boss / GPT-SoVITS

google-research / bert

svc-develop-team / so-vits-svc

pyg-team / pytorch_geometric

microsoft / unilm

fishaudio / fish-speech

FunAudioLLM / CosyVoice

vipstone / faceai

AIGC-Audio / AudioGPT

ymcui / Chinese-BERT-wwm

speechbrain / speechbrain

open-mmlab / Amphion

modelscope / FunASR

fishaudio / Bert-VITS2

Plachtaa / VALL-E-X

TingsongYu / PyTorch_Tutorial

netease-youdao / EmotiVoice

jaywalnut310 / vits

tkipf / pygcn

mozillazg / python-pinyin

timesler / facenet-pytorch

wenet-e2e / wenet

Ikaros-521 / AI-Vtuber

enhuiz / vall-e

PlayVoice / whisper-vits-svc

usefulsensors / moonshine

Kyubyong / wordvectors

coneypo / Dlib_face_recognition_from_camera