Skip to content
View KoljaB's full-sized avatar

Block or report KoljaB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
50 stars written in Python
Clear filter

Magnificent app which corrects your previous console command.

Python 91,176 3,659 Updated Jul 19, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 38,880 4,885 Updated Aug 16, 2024

A generative speech model for daily dialogue.

Python 35,417 3,833 Updated Mar 14, 2025

The first real AI developer

Python 32,528 3,303 Updated Mar 4, 2025

We write your reusable computer vision tools. 💜

Python 26,311 1,986 Updated Mar 24, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 23,626 2,062 Updated Jan 23, 2025

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 20,551 2,662 Updated Mar 27, 2025

Faster Whisper transcription with CTranslate2

Python 15,054 1,266 Updated Mar 20, 2025

structured outputs for llms

Python 9,918 762 Updated Mar 27, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 6,482 522 Updated Mar 23, 2025

Agent Zero AI framework

Python 6,411 1,393 Updated Mar 18, 2025

Multilingual Voice Understanding Model

Python 5,130 466 Updated Mar 23, 2025

TTS Towards Human-Sounding Speech

Python 3,059 218 Updated Mar 27, 2025

A nearly-live implementation of OpenAI's Whisper.

Python 2,627 343 Updated Feb 26, 2025

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Python 2,327 166 Updated Dec 11, 2024

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,802 195 Updated Mar 26, 2025

A realtime sketch to image demo using LCM and the gradio library.

Python 1,791 150 Updated Dec 2, 2023

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Python 1,589 117 Updated Jul 31, 2024

👻 Experimental library for scraping websites using OpenAI's GPT API.

Python 1,431 86 Updated Oct 9, 2024

Concatenate a directory full of files into a single prompt for use with LLMs

Python 1,424 102 Updated Feb 19, 2025

A terminal assistant for the hopelessly confused

Python 1,284 70 Updated Dec 20, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,166 128 Updated Mar 24, 2025

Python API for Tuya WiFi smart devices using a direct local area network (LAN) connection or the cloud (TuyaCloud API).

Python 1,166 207 Updated Mar 16, 2025

A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering

Python 882 198 Updated Jan 12, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 827 33 Updated Mar 7, 2025

Webui for using XTTS and for finetuning it

Python 758 149 Updated Jan 17, 2025

Command Your World with Voice

Python 619 58 Updated Dec 8, 2024

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Python 599 54 Updated Jan 8, 2025

Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.

Python 598 69 Updated Aug 12, 2024

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 525 27 Updated Jun 11, 2024
Next