Skip to content
View zhumqs's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Alibaba
  • HangZhou China

Block or report zhumqs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Helm Chart & Documentation for deploying JupyterHub on Kubernetes

Python 1,593 813 Updated Mar 19, 2025

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…

Go 23,949 2,368 Updated Mar 21, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 95,164 9,171 Updated Mar 21, 2025
2 Updated Oct 29, 2024

Ready-to-run Docker images containing Jupyter applications

Python 8,147 2,992 Updated Mar 21, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 344 58 Updated Mar 21, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,239 6,383 Updated Mar 21, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,215 777 Updated Mar 20, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 45,461 1,277 Updated Mar 21, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,354 376 Updated Mar 5, 2025

FUSE-based file system backed by Amazon S3

C++ 8,993 1,035 Updated Mar 19, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,051 518 Updated Mar 16, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,350 806 Updated Mar 1, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,273 665 Updated Mar 18, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,700 2,863 Updated Mar 21, 2025

Common source, scripts and utilities for creating Triton backends.

C++ 310 95 Updated Mar 17, 2025

Distributed Task Queue (development branch)

Python 25,862 4,751 Updated Mar 20, 2025

This repository contains tutorials and examples for Triton Inference Server

Python 671 112 Updated Mar 19, 2025

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 381 72 Updated Feb 13, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,941 1,543 Updated Mar 21, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,132 3,536 Updated Mar 21, 2025

LLM101n: Let's build a Storyteller

32,836 1,797 Updated Aug 1, 2024

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,193 84 Updated Mar 19, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,296 1,432 Updated Mar 10, 2025
Jupyter Notebook 17 5 Updated Jul 10, 2023

A toolkit to run Ray applications on Kubernetes

Go 1,602 495 Updated Mar 21, 2025

Style guides for Google-originated open-source projects

HTML 37,978 13,305 Updated Mar 7, 2025

Reformats Java source code to comply with Google Java Style.

Java 5,734 872 Updated Mar 20, 2025
Next