LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,049 242 Updated Mar 23, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 12,289 1,323 Updated Mar 22, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 15,557 1,806 Updated Mar 2, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 6,335 724 Updated Oct 22, 2024

taskflow / taskflow

A General-purpose Task-parallel Programming System using Modern C++

C++ 10,702 1,258 Updated Mar 22, 2025

snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 116 22 Updated Jul 10, 2024

microsoft / vidur

A large-scale simulation framework for LLM inference

Python 350 61 Updated Nov 19, 2024

LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 507 53 Updated Aug 19, 2024

galeselee / Awesome_LLM_System-PaperList

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

235 10 Updated Mar 6, 2025

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, MLA, Parallelism, Prefix-Cache, Chunked-Prefill, etc. 🎉🎉

3,692 260 Updated Mar 4, 2025

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,443 351 Updated Mar 19, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 77,025 11,166 Updated Mar 22, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,893 185 Updated Mar 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHE Yulin CheYulin

Achievements

Achievements

Highlights

Organizations

Block or report CheYulin

Stars

deepseek-ai / 3FS

deepseek-ai / open-infra-index

PKM-er / awesome-obsidian-zh

FFY0 / AdaKV

guidance-ai / guidance

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

LMCache / LMCache

LMCache / demo

YaoJiayi / CacheBlend

MooreThreads / TurboRAG

graphcore-research / llm-inference-research

andy-yang-1 / DoubleSparse

dmemsys / SMART

antirez / rax

Flow-IPC / ipc

kvcache-ai / ktransformers

hao-ai-lab / MuxServe

ModelTC / lightllm