Skip to content
View Oneal65's full-sized avatar

Block or report Oneal65

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official inference repo for FLUX.1 models

Python 21,753 1,541 Updated Feb 6, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 13,660 1,958 Updated May 20, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1,155 94 Updated Apr 1, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,427 466 Updated May 17, 2025

Xray panel supporting multi-protocol multi-user expire day & traffic & IP limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)

JavaScript 19,624 4,096 Updated May 22, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,019 310 Updated May 22, 2025

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 400 187 Updated May 22, 2025

how to optimize some algorithm in cuda.

Cuda 2,200 192 Updated May 23, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 941 60 Updated Apr 15, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 10,237 1,249 Updated May 17, 2025

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 10,145 1,599 Updated Aug 20, 2024

Perplexity GPU Kernels

C++ 307 33 Updated May 21, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 21,180 2,479 Updated Apr 30, 2025

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 49,758 3,653 Updated Aug 18, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 55,293 1,553 Updated May 22, 2025

解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…

Shell 21,635 2,670 Updated May 23, 2025

Material for gpu-mode lectures

Jupyter Notebook 4,471 450 Updated Feb 9, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,027 277 Updated May 18, 2025

一种任务级GPU算力分时调度的高性能深度学习训练平台

Python 646 89 Updated Oct 24, 2023

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

357 20 Updated Mar 12, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,572 1,820 Updated May 23, 2025

A throughput-oriented high-performance serving framework for LLMs

Cuda 812 37 Updated May 10, 2025

My learning notes/codes for ML SYS.

Python 2,244 139 Updated May 22, 2025

Integrate the DeepSeek API into popular softwares

32,423 3,565 Updated May 13, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,776 277 Updated May 15, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,312 259 Updated May 22, 2025

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Python 3,892 645 Updated Apr 26, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,123 997 Updated May 23, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 141,379 11,839 Updated May 23, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,349 2,029 Updated May 13, 2025
Next