Stars
Make websites accessible for AI agents
🚀 DeepSeek-V3 R1大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,R1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
基于树莓派和GPT实现的多功能语音家庭助手 A multifunctional voice home assistant based on Raspberry Pi and GPT
Fast-TTS 是一个基于异步框架的文本到语音转换(TTS)生成器项目。该项目利用了异步编程技术来高效处理请求和响应,实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流,并实时播放,适用于多种应用场景,如语音合成、智能助手、内容朗读等。
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,时延低至800ms,低配置也可运行,支持打断
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
智能开关平台,包含服务端、硬件端、安卓端和前端。关键词:物联网开关、远程开关、红外线遥控开关、WIFI继电器、MQTT协议、ESP8266、Iot smart switch、Android、Spring boot、Vue、Arduino
Support for Armbian in Amlogic, Rockchip and Allwinner boxes. Support a311d, s922x, s905x3, s905x2, s912, s905d, s905x, s905w, s905, s905l, rk3588, rk3568, rk3399, rk3328, h6, etc.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
本项目使用esp32、esp32s3接入Chatgpt、Claude、讯飞星火、豆包等15款大模型,实现语音对话聊天,支持语音唤醒、连续对话、音乐播放等功能,同时外接了一块显示屏实时显示对话的内容。
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…