解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…

Shell 21,635 2,670 Updated May 23, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 4,471 450 Updated Feb 9, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,027 277 Updated May 18, 2025

HFAiLab / hai-platform

一种任务级GPU算力分时调度的高性能深度学习训练平台

Python 646 89 Updated Oct 24, 2023

withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

357 20 Updated Mar 12, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,572 1,820 Updated May 23, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 812 37 Updated May 10, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 2,244 139 Updated May 22, 2025

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular softwares

32,423 3,565 Updated May 13, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,776 277 Updated May 15, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,312 259 Updated May 22, 2025

decodingml / llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Python 3,892 645 Updated Apr 26, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,123 997 Updated May 23, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 141,379 11,839 Updated May 23, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,349 2,029 Updated May 13, 2025

Starred topics

golang

Server

Shell

Scala

Linux

Kubernetes

Java

Go

Git

unity

See all starred topics

Oneal65 Oneal65

Lists (13)

AI

Big_data

Book_and-paper

C++/C

Code_pratice

DB

FL

K8s

math_or_basics

other

parallel_compute

Systerm

tool

Starred repositories