-
01:05
(UTC +01:00) - https://www.youtube.com/@Linguflex
- @LonLigrin
- https://discord.gg/f556hqRjpv
Stars
Magnificent app which corrects your previous console command.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A generative speech model for daily dialogue.
We write your reusable computer vision tools. 💜
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
Faster Whisper transcription with CTranslate2
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Multilingual Voice Understanding Model
A nearly-live implementation of OpenAI's Whisper.
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
A realtime sketch to image demo using LCM and the gradio library.
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
👻 Experimental library for scraping websites using OpenAI's GPT API.
Concatenate a directory full of files into a single prompt for use with LLMs
A terminal assistant for the hopelessly confused
idiap / coqui-ai-TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python API for Tuya WiFi smart devices using a direct local area network (LAN) connection or the cloud (TuyaCloud API).
A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Webui for using XTTS and for finetuning it
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest