Stars
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 15+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Websockify is a WebSocket to TCP proxy/bridge. This allows a browser to connect to any application/server/service.
No fortress, purely open ground. OpenManus is Coming.
Open-source vector similarity search for Postgres
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Example models using DeepSpeed
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
SGLang is a fast serving framework for large language models and vision language models.
deep learning for image processing including classification and object-detection etc.
DeepSeek Coder: Let the Code Write Itself
Repository for sample controller. Complements sample-apiserver
Ingress NGINX Controller for Kubernetes
contaiNERD CTL - Docker-compatible CLI for containerd, with support for Compose, Rootless, eStargz, OCIcrypt, IPFS, ...
Robyn is a Super Fast Async Python Web Framework with a Rust runtime.