Skip to content
View fp2302's full-sized avatar
:shipit:
:shipit:

Block or report fp2302

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 6,921 584 Updated Mar 4, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,256 1,129 Updated Mar 4, 2025

A repository for research on medium sized language models.

Python 492 70 Updated Jan 13, 2025

DataComp for Language Models

HTML 1,247 115 Updated Dec 11, 2024

Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes

Go 9,312 3,753 Updated Mar 3, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,074 1,529 Updated Jan 13, 2025

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,401 582 Updated Mar 4, 2025

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,231 261 Updated Mar 3, 2025

Fast and memory-efficient exact attention

Python 47 48 Updated Feb 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,089 6,002 Updated Mar 4, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,860 128 Updated Oct 30, 2024

Embedding Vector Oriented Clustering

Python 132 6 Updated Feb 28, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,055 3,637 Updated Mar 3, 2025

Apache DataFusion SQL Query Engine

Rust 6,831 1,353 Updated Mar 3, 2025

Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs,…

C 68,185 23,913 Updated Mar 3, 2025

Redis Python client

Python 12,906 2,564 Updated Mar 1, 2025

A fault tolerant, protocol-agnostic RPC system

Scala 8,807 1,450 Updated Jan 27, 2025

Distributed data engine for Python/SQL designed for the cloud, powered by Rust

Rust 2,579 186 Updated Mar 4, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,556 423 Updated Aug 7, 2024

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,651 297 Updated Jun 4, 2024

LLM training code for Databricks foundation models

Python 4,172 547 Updated Mar 3, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,236 369 Updated Mar 3, 2025

TFX is an end-to-end platform for deploying production ML pipelines

Python 2,131 722 Updated Feb 26, 2025

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,383 131 Updated Sep 18, 2024

Supercharge Your Model Training

Python 5,300 434 Updated Mar 4, 2025

A fast, effective data attribution method for neural networks in PyTorch

Python 196 26 Updated Nov 18, 2024

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Shell 707 98 Updated Dec 17, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,237 155 Updated Mar 3, 2025
Next