Highlights
- Pro
Stars
A react-based starter app for using the Multimodal Live API over websockets with Gemini
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Open Source framework for voice and multimodal conversational AI
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
We write your reusable computer vision tools. 💜
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
A global shortcut/hotkey for Desktop Qt-Applications
llama3 implementation one matrix multiplication at a time
A framework for building cross platform GUI interfaces in Go and QML
Qt binding for Go (Golang) with support for Windows / macOS / Linux / FreeBSD / Android / iOS / Sailfish OS / Raspberry Pi / AsteroidOS / Ubuntu Touch / JavaScript / WebAssembly
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
The fastest way to create an HTML app
grep for words with similar meaning to the query
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Minimal and clean examples of machine learning algorithms implementations
Distribute and run LLMs with a single file.
Interact with your SQL database, Natural Language to SQL using LLMs
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).