Skip to content
View Sassun's full-sized avatar
💭
to the moon!
💭
to the moon!

Block or report Sassun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kamailio evapi connector from Go

Go 11 17 Updated Mar 7, 2024

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 1,994 145 Updated Jan 21, 2025

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Rust 252 17 Updated Jan 22, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 825 104 Updated Jan 22, 2025

Go SIP UA library for client/b2bua

Go 216 86 Updated Aug 2, 2024

first base model for full-duplex conversational audio

Python 1,689 113 Updated Jan 5, 2025

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 257 16 Updated Jan 2, 2025

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 92 16 Updated Dec 20, 2024

A lightweight, object-oriented finite state machine implementation in Python with many extensions

Python 5,890 532 Updated Aug 23, 2024

API server and Web GUI for FreeSwitch written in Golang and Angular

TypeScript 61 23 Updated Dec 22, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,276 1,098 Updated Jan 10, 2025

Multilingual Voice Understanding Model

Python 4,151 368 Updated Jan 8, 2025

Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling

Python 14,247 4,254 Updated Jan 22, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,874 954 Updated Jan 15, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,586 210 Updated Dec 5, 2024

Inference and training library for high-quality TTS models.

Python 4,926 508 Updated Dec 10, 2024

Smart load balancing for Azure OpenAI endpoints

Bicep 77 26 Updated Apr 19, 2024

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,144 420 Updated Jul 25, 2024

🐜🐜🐜 ants is the most powerful and reliable pooling solution for Go.

Go 13,211 1,376 Updated Jan 17, 2025

Preprocess Audio for training

Python 293 52 Updated Oct 7, 2024

Intelligent gateway for AI agents. Designed with (fast) LLMs for task routing, rich observability, and seamless integration of prompts with your APIs for agentic tasks. Built by the contributors of…

Rust 1,380 61 Updated Jan 22, 2025

SOTA Open Source TTS

Python 18,562 1,404 Updated Jan 18, 2025

Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.

TypeScript 79,181 5,113 Updated Jan 16, 2025

Code and Data for Tau-Bench

Python 256 35 Updated Jan 22, 2025

Demo of scalable Asterisk on Kubernetes

Go 170 76 Updated Jul 29, 2023

async FreeSWITCH cluster control

Python 73 18 Updated Jan 27, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,192 2,364 Updated Nov 26, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,121 1,212 Updated Jan 22, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,108 1,887 Updated Oct 15, 2024
Next