chr2117216003

ChenHuangrong chr2117216003

I am a machine learning enthusiast.

https://hacknical.com/chr2117216003/resume?locale=zh

Starred repositories

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 4,243 492 Updated Jan 15, 2025

csukuangfj / vits_chinese

Forked from UEhQZXI/vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单，音质最好的语音合成系统

Python 8 2 Updated Nov 5, 2023

Ailln / cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Python 693 79 Updated Dec 21, 2024

k2-fsa / icefall

Python 979 310 Updated Jan 9, 2025

mapull / chinese-dictionary

中文汉语拼音辞典，汉字拼音字典，词典，成语词典，常用字、多音字字典数据库

526 125 Updated Jan 15, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,385 1,870 Updated Jan 15, 2025

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,876 814 Updated Jul 5, 2024

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 1,776 196 Updated Jan 15, 2025

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 512 70 Updated Nov 11, 2024

vanna-ai / vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 12,661 1,040 Updated Nov 21, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,151 1,064 Updated Jan 13, 2025

DakeQQ / F5-TTS-ONNX

Running the F5-TTS by ONNX Runtime

Python 79 12 Updated Jan 14, 2025

Bigfishering / f5-tts-trtllm

Python 25 4 Updated Jan 13, 2025

ZJU-LLMs / Foundations-of-LLMs

1,712 170 Updated Jan 14, 2025

huakunyang / SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目，可以本地运行不需要网络，而且没有额外的依赖，一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be…

C++ 428 74 Updated Dec 14, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,010 143 Updated Jan 15, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,407 571 Updated Jan 14, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,359 1,451 Updated Jan 14, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

1,622 113 Updated Dec 26, 2024

Lightricks / LTX-Video

Official repository for LTX-Video

Python 2,553 207 Updated Jan 3, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,562 207 Updated Dec 5, 2024

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 899 113 Updated Jan 4, 2025

JeffC0628 / awesome-voice-conversion

A curated list of awesome voice conversion, projects and communities.

214 13 Updated Jan 13, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 13,505 1,590 Updated Jan 9, 2025

lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 975 129 Updated Dec 29, 2024

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 863 99 Updated Aug 7, 2024

fferflo / einx

Universal Tensor Operations in Einstein-Inspired Notation for Python.

Python 341 10 Updated Nov 29, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,479 1,134 Updated Jan 1, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly