Stars
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
SGLang is a fast serving framework for large language models and vision language models.
A repository for research on medium sized language models.
Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes
This repository contains demos I made with the Transformers library by HuggingFace.
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Fast and memory-efficient exact attention
A high-throughput and memory-efficient inference and serving engine for LLMs
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs,…
A fault tolerant, protocol-agnostic RPC system
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
LLM training code for Databricks foundation models
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
TFX is an end-to-end platform for deploying production ML pipelines
Python 3.8+ toolbox for submitting jobs to Slurm
A fast, effective data attribution method for neural networks in PyTorch
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
A Data Streaming Library for Efficient Neural Network Training