Lists (1)
Sort Name ascending (A-Z)
Stars
A natural language interface for computers
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
real time face swap and one-click video deepfake with only a single image
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
We write your reusable computer vision tools. 💜
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A generative world for general-purpose robotics & embodied AI learning.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
A modular graph-based Retrieval-Augmented Generation (RAG) system
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Simple, unified interface to multiple Generative AI providers
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Low code web framework for real world applications, in Python and Javascript
Composable building blocks to build Llama Apps
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Data processing with ML, LLM and Vision LLM
Simple, online, and realtime tracking of multiple objects in a video sequence.