Skip to content
View HanLi05869's full-sized avatar

Highlights

  • Pro

Block or report HanLi05869

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CSAPP 《深入理解计算机系统》 学习笔记 课后习题代码

94 12 Updated Jun 22, 2018

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

270 18 Updated Jan 21, 2025

Lab for Parallel computing (USTC COMP6201P)

C 22 Updated Nov 24, 2023

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,751 168 Updated Mar 6, 2025

My learning notes/codes for ML SYS.

Python 1,292 68 Updated Mar 7, 2025

Homework solutions for CSAPP (a.k.a. Computer System A Programmer's Perspective) Third Edition.

C 42 6 Updated Mar 28, 2018

AIOS: AI Agent Operating System

Python 3,896 475 Updated Mar 6, 2025

Simple example of how to write an Implicit GEMM Convolution in CUDA using the tensor core WMMA API and bindings for PyTorch.

Cuda 15 1 Updated Jun 29, 2023

My personal vim/neovim configuration files, dotfiles, docs and other scripts.

Vim Script 13 Updated Mar 6, 2025

根据2024年毕业论文(设计)格式要求设计过的Latex论文模板

TeX 7 Updated Mar 14, 2024

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,733 283 Updated Mar 4, 2025

A course of building an LSM-Tree storage engine (database) in a week.

Rust 3,184 447 Updated Mar 2, 2025

A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt

C++ 126 145 Updated Jul 24, 2022

play gemm with tvm

Cuda 89 10 Updated Jul 22, 2023

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 201 16 Updated Sep 24, 2023

中文版的 MIT xv6 文档

3,376 758 Updated Nov 12, 2023

电子科技大学分布式存储与计算实验室新生训练计划

895 165 Updated Sep 9, 2024

notes i made while reading the papers. Including database, distributed systems and HPC.

164 24 Updated Sep 25, 2022

CS149 xmake version

C++ 43 1 Updated Nov 30, 2023

Learning materials for Stanford CS149 : Parallel Computing

C 206 32 Updated Jul 31, 2021

Vim, but haunted

Python 16 Updated Nov 7, 2023
Cuda 32 12 Updated Aug 24, 2022

A library of GPU kernels for sparse matrix operations.

C++ 259 52 Updated Nov 24, 2020

A GPU-driven system framework for scalable AI applications

C++ 112 17 Updated Feb 5, 2025

A Easy-to-understand TensorOp Matmul Tutorial

C++ 324 36 Updated Sep 21, 2024

使用 cuda 编写的并行图像模糊化处理程序

Python 3 Updated Oct 31, 2023

Personal blog

HTML 1 Updated Mar 5, 2025
Cuda 109 29 Updated Apr 11, 2024

IPADS 实验室新人培训第二讲:CMake(2021.11.3)

C++ 627 83 Updated Feb 16, 2025
Next