Skip to content
View HanLi05869's full-sized avatar

Highlights

  • Pro

Block or report HanLi05869

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CSAPP 《深入理解计算机系统》 学习笔记 课后习题代码

95 12 Updated Jun 22, 2018

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

201 13 Updated Jan 15, 2025

Lab for Parallel computing (USTC COMP6201P)

C 23 Updated Nov 24, 2023

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,374 137 Updated Jan 17, 2025

My learning notes/codes for ML SYS.

Python 372 13 Updated Jan 14, 2025

Homework solutions for CSAPP (a.k.a. Computer System A Programmer's Perspective) Third Edition.

C 41 6 Updated Mar 28, 2018

AIOS: AI Agent Operating System

Python 3,680 448 Updated Jan 17, 2025

Simple example of how to write an Implicit GEMM Convolution in CUDA using the tensor core WMMA API and bindings for PyTorch.

Cuda 14 1 Updated Jun 29, 2023

My personal vim/neovim configuration files, dotfiles, docs and other scripts.

Vim Script 13 Updated Jan 16, 2025

根据2024年毕业论文(设计)格式要求设计过的Latex论文模板

TeX 7 Updated Mar 14, 2024

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,984 207 Updated Jan 17, 2025

A tutorial of building an LSM-Tree storage engine (database) in a week.

Rust 3,098 436 Updated Jan 14, 2025

A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt

C++ 118 140 Updated Jul 24, 2022

play gemm with tvm

Cuda 85 10 Updated Jul 22, 2023

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Cuda 195 16 Updated Sep 24, 2023

中文版的 MIT xv6 文档

3,359 762 Updated Nov 12, 2023

电子科技大学分布式存储与计算实验室新生训练计划

891 167 Updated Sep 9, 2024

notes i made while reading the papers. Including database, distributed systems and HPC.

155 24 Updated Sep 25, 2022

CS149 xmake version

C++ 41 1 Updated Nov 30, 2023

Learning materials for Stanford CS149 : Parallel Computing

C 195 28 Updated Jul 31, 2021

Vim, but haunted

Python 16 Updated Nov 7, 2023
Cuda 32 12 Updated Aug 24, 2022

A library of GPU kernels for sparse matrix operations.

C++ 251 52 Updated Nov 24, 2020

A GPU-driven system framework for scalable AI applications

C++ 111 16 Updated Oct 8, 2024

A Easy-to-understand TensorOp Matmul Tutorial

C++ 306 34 Updated Sep 21, 2024

使用 cuda 编写的并行图像模糊化处理程序

Python 3 Updated Oct 31, 2023

Personal blog

HTML 1 Updated Nov 18, 2024
Cuda 108 30 Updated Apr 11, 2024

IPADS 实验室新人培训第二讲:CMake(2021.11.3)

C++ 620 83 Updated Apr 21, 2024
Next