Lists (1)
Sort Name ascending (A-Z)
Stars
Helm Chart & Documentation for deploying JupyterHub on Kubernetes
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…
Virtual whiteboard for sketching hand-drawn like diagrams
Ready-to-run Docker images containing Jupyter applications
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
A high-throughput and memory-efficient inference and serving engine for LLMs
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
An extremely fast Python package and project manager, written in Rust.
A lightweight data processing framework built on DuckDB and 3FS.
FUSE-based file system backed by Amazon S3
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
Visualizer for neural network, deep learning and machine learning models
Common source, scripts and utilities for creating Triton backends.
Distributed Task Queue (development branch)
This repository contains tutorials and examples for Triton Inference Server
This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
A toolkit to run Ray applications on Kubernetes
Style guides for Google-originated open-source projects
Reformats Java source code to comply with Google Java Style.