-
06:40
(UTC +01:00) - https://www.youtube.com/@Linguflex
- @LonLigrin
- https://discord.gg/f556hqRjpv
Stars
Run Orpheus 3B Locally With LM Studio
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
A generative speech model for daily dialogue.
Multilingual Voice Understanding Model
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
vitaliy-sn / RealtimeSTT
Forked from KoljaB/RealtimeSTTA robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A terminal assistant for the hopelessly confused
Magnificent app which corrects your previous console command.
A Python utility for managing package versions with an interactive CLI interface. This tool helps you monitor, update, and backup your Python packages, whether they're project-specific or globally …
Simulates talk with an AI that can express emotions
Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your development workflow. It works platform-agnostically with OpenA…
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
Concatenate a directory full of files into a single prompt for use with LLMs
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow your computer journey wherever you go!
We write your reusable computer vision tools. 💜
idiap / coqui-ai-TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Android App for Well Being of a person with features like step counter, water reminder,etc
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
Efficient approach to speaker diarization using voice characteristics extraction
A financial agent, built entirely with LangChain!
Transcription, forced alignment, and audio indexing with OpenAI's Whisper