Skip to content
View ysjyx7's full-sized avatar

Block or report ysjyx7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 11,487 1,158 Updated Mar 7, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 3,851 312 Updated Mar 5, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,555 652 Updated Mar 6, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,505 239 Updated Mar 5, 2025

Analyze computation-communication overlap in V3/R1.

886 113 Updated Mar 3, 2025

Expert Parallelism Load Balancer

Python 1,019 146 Updated Feb 27, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,799 464 Updated Mar 5, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,029 600 Updated Mar 6, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,166 774 Updated Mar 1, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,635 186 Updated Mar 4, 2025

Fast Multimodal LLM on Mobile Devices

C++ 728 87 Updated Mar 3, 2025

Grok open release

Python 50,214 8,369 Updated Aug 30, 2024

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

JavaScript 13,442 46,008 Updated Mar 2, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,138 424 Updated Feb 19, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 3,248 316 Updated Mar 6, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,642 1,224 Updated Mar 7, 2025

Falcon: Fast OLTP Engine for Persistent Cache and Non-Volatile Memory

Rust 11 Updated Nov 1, 2023

Distributed reliable key-value store for the most critical data of a distributed system

Go 48,564 9,892 Updated Mar 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,516 6,097 Updated Mar 7, 2025

A persistent key-value storage in rust.

Rust 856 77 Updated Apr 16, 2024

the champagne of beta embedded databases

Rust 8,342 392 Updated Dec 27, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 14,993 2,035 Updated Mar 6, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 12,760 1,842 Updated Mar 1, 2025

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,879 3,453 Updated May 18, 2024

Must-read Papers for File System (FS)

270 25 Updated Mar 7, 2025

Database system for AI-powered apps

Python 2,658 264 Updated May 17, 2024

A list of papers about distributed consensus.

2,553 214 Updated Aug 8, 2024

Simple key-value store abstraction and implementations for Go (Redis, Consul, etcd, bbolt, BadgerDB, LevelDB, Memcached, DynamoDB, S3, PostgreSQL, MongoDB, CockroachDB and many more)

Go 772 72 Updated Dec 11, 2024
Next