Skip to content
View pesionzhao's full-sized avatar

Block or report pesionzhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

C++和Linux学习笔记

C++ 2,145 344 Updated Feb 14, 2022

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,630 820 Updated Sep 1, 2024

A CUDNN minimal deep learning training code sample using LeNet.

Cuda 264 92 Updated Jul 30, 2023

FlashMLA: Efficient MLA decoding kernels

C++ 11,132 771 Updated Mar 1, 2025

⚡ Dynamically generated stats for your github readmes

JavaScript 71,793 23,965 Updated Mar 5, 2025

深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。

Python 437 62 Updated Feb 26, 2025

A light llama-like llm inference framework based on the triton kernel.

Python 94 9 Updated Mar 4, 2025
Python 12 3 Updated Mar 2, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,056 1,946 Updated Mar 5, 2025

Lightning fast C++/CUDA neural network framework

C++ 3,908 483 Updated Jan 27, 2025

Convolutional Neural Networks

C 26,051 21,323 Updated May 3, 2024

CUDA 算子手撕与面试指南

Cuda 179 18 Updated Jan 15, 2025

A Easy-to-understand TensorOp Matmul Tutorial

C++ 324 35 Updated Sep 21, 2024

some hpc project for learning

Cuda 20 3 Updated Aug 28, 2024

致力于实习/校招/社招进大厂打法,计算机基础知识学习,C++、Java、算法学习路线,专注于编程打法!

1,335 79 Updated Feb 18, 2025

Efficient Triton Kernels for LLM Training

Python 4,557 274 Updated Mar 5, 2025

LLM101n: Let's build a Storyteller 中文版

C++ 126 14 Updated Aug 15, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 13,403 1,546 Updated Feb 28, 2025

Awesome-LLM: a curated list of Large Language Model

21,862 1,786 Updated Mar 4, 2025

A 100 Day ML Challenge to learn and implement ML/DL concepts ranging from the basics of Machine Learning to advanced Deep Learning.

Jupyter Notebook 144 62 Updated Aug 23, 2022

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 3,915 548 Updated Mar 4, 2025

Seismic Reservoir Modeling Python package

Python 95 38 Updated Jul 11, 2024

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 410 46 Updated Dec 17, 2024

📖 作为对《C++ Concurrency in Action - SECOND EDITION》的中文翻译。

2,175 450 Updated Jan 26, 2021

Material for gpu-mode lectures

Jupyter Notebook 3,895 396 Updated Feb 9, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,751 310 Updated Oct 26, 2024

LLM training in simple, raw C/CUDA

Cuda 25,916 2,974 Updated Oct 2, 2024

Achieve a tiny STL in C++11

C++ 11,777 3,291 Updated Oct 27, 2024

Modern and efficient C++ Thread Pool Library

C++ 1,897 345 Updated Jan 26, 2023

How to Make a Computer Operating System in C++

C 21,683 3,445 Updated Dec 16, 2021
Next