zhumqs

🎯

Focusing

duckduck zhumqs

🎯

Focusing

6 followers · 27 following

Alibaba
HangZhou China

Lists (1)

Sort

🚀 My stack

1 repository

Stars

jupyterhub / zero-to-jupyterhub-k8s

Helm Chart & Documentation for deploying JupyterHub on Kubernetes

Python 1,593 813 Updated Mar 19, 2025

seaweedfs / seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…

Go 23,949 2,368 Updated Mar 21, 2025

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 95,164 9,171 Updated Mar 21, 2025

s09g / leetcode-fast-pass

2 Updated Oct 29, 2024

jupyter / docker-stacks

Ready-to-run Docker images containing Jupyter applications

Python 8,147 2,992 Updated Mar 21, 2025

kubernetes-sigs / lws

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 344 58 Updated Mar 21, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,239 6,383 Updated Mar 21, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,215 777 Updated Mar 20, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 45,461 1,277 Updated Mar 21, 2025

deepseek-ai / smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,354 376 Updated Mar 5, 2025

s3fs-fuse / s3fs-fuse

FUSE-based file system backed by Amazon S3

C++ 8,993 1,035 Updated Mar 19, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,051 518 Updated Mar 16, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,350 806 Updated Mar 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,273 665 Updated Mar 18, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,700 2,863 Updated Mar 21, 2025

triton-inference-server / backend

Common source, scripts and utilities for creating Triton backends.

C++ 310 95 Updated Mar 17, 2025

celery / celery

Distributed Task Queue (development branch)

Python 25,862 4,751 Updated Mar 20, 2025

triton-inference-server / tutorials

This repository contains tutorials and examples for Triton Inference Server

Python 671 112 Updated Mar 19, 2025

ray-project / ray-educational-materials

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 381 72 Updated Feb 13, 2024

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,941 1,543 Updated Mar 21, 2025

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,132 3,536 Updated Mar 21, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,836 1,797 Updated Aug 1, 2024

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,193 84 Updated Mar 19, 2025

deepseek-ai / DeepSeek-V3

Python 92,743 15,077 Updated Mar 16, 2025

deepseek-ai / DeepSeek-R1

87,081 11,240 Updated Feb 24, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,296 1,432 Updated Mar 10, 2025

amogkam / batch-inference-benchmarks

Jupyter Notebook 17 5 Updated Jul 10, 2023

ray-project / kuberay

A toolkit to run Ray applications on Kubernetes

Go 1,602 495 Updated Mar 21, 2025

google / styleguide

Style guides for Google-originated open-source projects

HTML 37,978 13,305 Updated Mar 7, 2025

google / google-java-format

Reformats Java source code to comply with Google Java Style.

Java 5,734 872 Updated Mar 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly