Stars
- All languages
- AGS Script
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Cuda
- Dockerfile
- Eagle
- Elixir
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Nix
- Objective-C
- Objective-C++
- OpenSCAD
- PHP
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Solidity
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Zig
A generative world for general-purpose robotics & embodied AI learning.
This is the template I use to start new full-stack projects.
Text to speech alignment using CTC forced alignment
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
A generative speech model for daily dialogue.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
idiap / coqui-ai-TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
SoftVC VITS Singing Voice Conversion
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
18582088138 / GPT-SoVITS-OpenVINO
Forked from RVC-Boss/GPT-SoVITS[OpenVINO Enable]1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
リアルタイムボイスチェンジャー Realtime Voice Changer
The Multi-Faceted Optimizer for GenAI Workflows
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Inference and training library for high-quality TTS models.
A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Simple proxy worker for using ollama in cursor
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Beautiful and customizable React Native components
AudioBench: A Universal Benchmark for Audio Large Language Models
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
📎 ZSH plugin that reminds you to use existing aliases for commands you just typed
A control plane to oversee agents operating in the wild
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.
curl-impersonate: A special build of curl that can impersonate Chrome & Firefox