实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 642 90 Updated Nov 15, 2024

twilio-samples / speech-assistant-openai-realtime-api-python

Python 176 87 Updated Jan 15, 2025

FoloToy / folotoy-server-self-hosting

Config files for self-hosting the FoloToy Community Server. Documents: https://docs.folotoy.com

Dockerfile 493 88 Updated Nov 12, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,589 210 Updated Dec 5, 2024

xushengfeng / eSearch

截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator

TypeScript 5,218 394 Updated Jan 23, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,376 2,644 Updated Dec 18, 2024

kyutai-labs / moshi

Python 7,189 564 Updated Jan 14, 2025

lovemefan / SenseVoice-python

SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime

Python 80 12 Updated Sep 24, 2024

Olney1 / ChatGPT-OpenAI-Smart-Speaker

This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and…

Python 265 28 Updated Nov 13, 2024

lTbgykio / Books-Free-Books

免费书籍汇总。　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　…

11,118 1,175 Updated Nov 11, 2024

FunAudioLLM / FunAudioLLM-APP

Python 320 61 Updated Jul 22, 2024

ricky0123 / vad

Voice activity detector (VAD) for the browser with a simple API

TypeScript 1,042 164 Updated Jan 19, 2025

lhl / voicechat2

Local SRT/LLM/TTS Voicechat

Python 599 65 Updated Oct 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hamburg miandai

Block or report miandai

Starred repositories

rasbt / LLMs-from-scratch

SWivid / F5-TTS

LMM101 / Awesome-Multimodal-Next-Token-Prediction

appl-team / appl

mdagost / openai-realtime-streamlit

unitreerobotics / unitree_guide

LqNoob / Neural-Codec-and-Speech-Language-Models

hello-robot / stretch_ai

78 / xiaozhi-esp32

liusongxiang / Large-Audio-Models

janhq / ichigo

Standard-Intelligence / hertz-dev

opendilab / CleanS2S

ga642381 / speech-trident

ddlBoJack / Awesome-Speech-Language-Model

freddyaboulton / gradio-webrtc

TMElyralab / MuseTalk

Henry-23 / VideoChat