Stars
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AI's query engine - Platform for building AI that can learn and answer questions over federated data.
💫 Industrial-strength Natural Language Processing (NLP) in Python
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Stable Diffusion web UI
The recursive internet scanner for hackers. 🧡
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Open source platform for the machine learning lifecycle
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
Turns Data and AI algorithms into production-ready web applications in no time.
LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
Instant is a modern Firebase. We make you productive by giving your frontend a real-time database.
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches t…