Lists (6)
Sort Name ascending (A-Z)
Starred repositories
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Interact with your documents using the power of GPT, 100% privately, no data leaks
Clone a voice in 5 seconds to generate arbitrary speech in real-time
LlamaIndex is the leading framework for building LLM-powered agents over your data.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
An open-source RAG-based tool for chatting with your documents.
Faker is a Python package that generates fake data for you.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
DALL·E Mini - Generate images from a text prompt
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
the first library to let you embed a developer agent in your own app!
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
thumbor is an open-source photo thumbnail service by globo.com
Pythonic AI generation of images and videos
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Pretrained language model with 100B parameters
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…