Skip to content
View rhmaaa's full-sized avatar

Block or report rhmaaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。

Python 639 88 Updated Jan 16, 2025

code reading for tvm

Python 72 25 Updated Jan 20, 2022

examples for tvm schedule API

Python 99 36 Updated Jun 12, 2023

This is the top-level repository for the Accel-Sim framework.

Python 324 120 Updated Oct 23, 2024

A bunch of coding tutorials for my Youtube videos on Neural Network Quantization.

Jupyter Notebook 14 6 Updated May 21, 2024

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 23 1 Updated Jan 8, 2025

《Effective Modern C++》- 完成翻译

7,984 1,193 Updated Nov 8, 2024

Achieve a tiny STL in C++11

C++ 11,637 3,277 Updated Oct 27, 2024

Fast SpMM implementation on GPUs for GNN (IPDPS'23)

C++ 6 Updated Dec 31, 2023

study of Ampere' Sparse Matmul

Cuda 16 4 Updated Jan 10, 2021

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 66,395 34,388 Updated Jan 15, 2025

Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"

C 45 3 Updated Oct 13, 2024

PyTorch tutorials.

Jupyter Notebook 8,345 4,097 Updated Jan 14, 2025

健康学习到150岁 - 人体系统调优不完全指南

13,370 975 Updated May 9, 2024

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 9,890 1,589 Updated Aug 20, 2024

A small C compiler

C 9,829 892 Updated Oct 30, 2023

heterogeneity-aware-lowering-and-optimization

C++ 254 75 Updated Jan 20, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,333 889 Updated Sep 1, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,371 136 Updated Jan 16, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 834 163 Updated Dec 30, 2024

Inference Llama 2 in one file of pure C

C 17,853 2,161 Updated Aug 6, 2024

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 211 18 Updated Jan 15, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,747 549 Updated Oct 22, 2024

Advanced Scalable Systems for X

29 1 Updated Dec 3, 2024

This repository contains tutorials and examples for Triton Inference Server

Python 623 101 Updated Jan 13, 2025

Tile primitives for speedy kernels

Cuda 1,931 95 Updated Jan 14, 2025
Cuda 36 5 Updated Jan 15, 2025

My learning notes/codes for ML SYS.

Python 364 13 Updated Jan 14, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,749 234 Updated Jan 16, 2025

PKU OS course project and notes based on Nachos and XV6

C++ 173 45 Updated Dec 8, 2020
Next