Lists (1)
Sort Name ascending (A-Z)
Stars
15
stars
written in C++
Clear filter
An Open Source Machine Learning Framework for Everyone
Distribute and run LLMs with a single file.
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
Demonstration of various hardware effects on CUDA GPUs.
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
Development repository for the Triton-Linalg conversion
3b1b / moderngl
Forked from moderngl/modernglModern OpenGL binding for python
Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.