Stars
A portable SQL query and AI compute engine, written in Rust, for data-grounded apps and agents.
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
code for training & evaluating Contextual Document Embedding models
MTEB: Massive Text Embedding Benchmark
Agentless🐱: an agentless approach to automatically solve software development problems
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
The official Python SDK for Model Context Protocol servers and clients
Cognitive Architectures for Multi-Agent Teams
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
The fastest way to create an HTML app
TUI explorer application for Amazon S3 (AWS S3) 🪣
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A fast image processing library with low memory needs.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Bayesian inference with probabilistic programming.
Modeling language for Mathematical Optimization (linear, mixed-integer, conic, semidefinite, nonlinear)
An acausal modeling framework for automatically parallelized scientific machine learning (SciML) in Julia. A computer algebra system for integrated symbolics for physics-informed machine learning a…
A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
potato: portable text annotation tool
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly
tests for cohort-level heterogeneity in panel regression
SpiderFoot automates OSINT for threat intelligence and mapping your attack surface.
Tookie is a advanced OSINT information gathering tool that finds social media accounts based on inputs.