Skip to content
View dsumpter's full-sized avatar

Block or report dsumpter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A portable SQL query and AI compute engine, written in Rust, for data-grounded apps and agents.

Rust 2,001 84 Updated Jan 16, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 700 34 Updated Jan 16, 2025

code for training & evaluating Contextual Document Embedding models

Python 160 9 Updated Jan 13, 2025

Fast Semantic Text Deduplication

Python 293 10 Updated Jan 15, 2025

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 2,089 298 Updated Jan 16, 2025

Agentless🐱: an agentless approach to automatically solve software development problems

Python 1,281 118 Updated Dec 22, 2024

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,042 175 Updated Jan 16, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 1,414 140 Updated Jan 15, 2025
Go 1,634 279 Updated Jan 15, 2025

Cognitive Architectures for Multi-Agent Teams

Python 366 27 Updated Jan 9, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 966 119 Updated Jan 15, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 16,826 2,377 Updated Jan 10, 2025

The fastest way to create an HTML app

Jupyter Notebook 5,909 252 Updated Jan 12, 2025

TUI explorer application for Amazon S3 (AWS S3) 🪣

Rust 261 11 Updated Jan 12, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 1,710 120 Updated Jan 15, 2025

🙌 OpenHands: Code Less, Make More

Python 43,505 4,825 Updated Jan 16, 2025

A fast image processing library with low memory needs.

C 9,957 688 Updated Jan 13, 2025

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 1,645 147 Updated Jan 11, 2025

Bayesian inference with probabilistic programming.

Julia 2,078 219 Updated Jan 15, 2025

Modeling language for Mathematical Optimization (linear, mixed-integer, conic, semidefinite, nonlinear)

Julia 2,255 398 Updated Jan 16, 2025

An acausal modeling framework for automatically parallelized scientific machine learning (SciML) in Julia. A computer algebra system for integrated symbolics for physics-informed machine learning a…

Julia 1,452 211 Updated Jan 13, 2025

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,804 177 Updated Jan 16, 2025

potato: portable text annotation tool

Jupyter Notebook 310 51 Updated Oct 23, 2024

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Python 12,411 283 Updated Jan 16, 2025

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Python 2,759 234 Updated Aug 15, 2024

LLM image classification library using ollama in R

R 10 1 Updated Jan 16, 2025

ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly

Jupyter Notebook 264 23 Updated Jan 3, 2025

tests for cohort-level heterogeneity in panel regression

Python 3 Updated Jan 8, 2025

SpiderFoot automates OSINT for threat intelligence and mapping your attack surface.

Python 13,509 2,337 Updated Dec 15, 2024

Tookie is a advanced OSINT information gathering tool that finds social media accounts based on inputs.

Python 979 45 Updated Jan 9, 2025
Next