Lists (1)
Sort Name ascending (A-Z)
Stars
An HTTP server with APIs useful in testing HTTP clients. Inspired by httpbin, but isn't a clone.
PgAssistant is an open-source tool designed to help developers understand and optimize their PostgreSQL database performance.
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Pass for iOS - an iOS client compatible with Pass command line application.
The official Python SDK for Model Context Protocol servers and clients
A simple zero-config tool to make locally trusted development certificates with any names you'd like.
A high-throughput and memory-efficient inference and serving engine for LLMs
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
The fastai book, published as Jupyter Notebooks
(WIP) A small but powerful, homemade PyTorch from scratch.
🔬 A fast, interactive web-based viewer for performance profiles.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
The commands and processes I run to setup a new Mac computer
eBPF distributed networking observability tool for Kubernetes
A Bulletproof Way to Generate Structured JSON from Language Models
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Complete code for the larger example programs from the book.
kubectl plugin for spinning up netshoot container for network troubleshooting
Shell-operator is a tool for running event-driven scripts in a Kubernetes cluster
Store application configuration files in Docker/OCI registries