Stars
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
first base model for full-duplex conversational audio
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.
A lightweight, object-oriented finite state machine implementation in Python with many extensions
API server and Web GUI for FreeSwitch written in Golang and Angular
Production First and Production Ready End-to-End Speech Recognition Toolkit
Multilingual Voice Understanding Model
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Inference and training library for high-quality TTS models.
Smart load balancing for Azure OpenAI endpoints
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
🐜🐜🐜 ants is the most powerful and reliable pooling solution for Go.
Intelligent gateway for AI agents. Designed with (fast) LLMs for task routing, rich observability, and seamless integration of prompts with your APIs for agentic tasks. Built by the contributors of…
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
Demo of scalable Asterisk on Kubernetes
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.