Skip to content
View ChengBen-Xu's full-sized avatar

Block or report ChengBen-Xu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
92 stars written in Python
Clear filter

scikit-learn: machine learning in Python

Python 61,218 25,622 Updated Feb 24, 2025

The world's simplest facial recognition api for Python and the command line

Python 54,197 13,562 Updated Aug 21, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,122 4,588 Updated Feb 24, 2025

TensorFlow code and pre-trained models for BERT

Python 38,710 9,665 Updated Jul 23, 2024

SoftVC VITS Singing Voice Conversion

Python 26,593 4,915 Updated Nov 11, 2023

Graph Neural Network Library for PyTorch

Python 21,914 3,761 Updated Feb 24, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,788 2,596 Updated Feb 6, 2025

SOTA Open Source TTS

Python 19,473 1,506 Updated Feb 18, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,021 1,076 Updated Feb 16, 2025

一款入门级的人脸、视频、文字检测以及识别的项目.

Python 10,860 2,521 Updated Apr 16, 2020

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,098 860 Updated Jul 6, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,825 1,392 Updated Jul 31, 2023

A PyTorch-based Speech Toolkit

Python 9,395 1,438 Updated Feb 13, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,562 661 Updated Feb 24, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,369 866 Updated Feb 18, 2025

vits2 backbone with multilingual-bert

Python 8,266 1,171 Updated Feb 10, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,801 774 Updated Feb 11, 2024

《Pytorch模型训练实用教程》中配套代码

Python 7,757 1,757 Updated Jan 4, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,681 656 Updated Aug 13, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,169 1,313 Updated Dec 6, 2023

Graph Convolutional Networks in PyTorch

Python 5,252 1,230 Updated Sep 20, 2020

汉字转拼音(pypinyin)

Python 4,983 619 Updated Jan 3, 2025

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Python 4,715 971 Updated Aug 2, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,317 1,104 Updated Feb 22, 2025

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 3,514 548 Updated Feb 24, 2025

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,986 417 Updated May 10, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,731 925 Updated Apr 23, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,579 131 Updated Feb 4, 2025

Pre-trained word vectors of 30+ languages

Python 2,221 392 Updated Oct 11, 2018

Detect and recognize the faces from camera / 调用摄像头进行人脸识别,支持多张人脸同时识别

Python 2,156 580 Updated Dec 10, 2024
Next