Starred repositories
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Libtpa(Transport Protocol Acceleration), a DPDK based userspace TCP stack implementation.
RoCEv2 hardware implementation in Bluespec SystemVerilog
DeepEP: an efficient expert-parallel communication library
High performance container overlay networks on Linux. Enabling RDMA (on both InfiniBand and RoCE) and accelerating TCP to bare metal performance. Freeflow requires zero modification on application …
Notes taken by zweix while learning computer related knowledge
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
A Toolchain to make Build and Run eBPF programs easier
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
《深入理解Linux进程与内存》一书的配套源码以及勘误列表
Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
NVIDIA Linux open GPU kernel module source
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
The reference implementation of the Linux FUSE (Filesystem in Userspace) interface
ericqzhao / spdk-pf
Forked from spdk/spdkStorage Performance Development Kit
oneAPI Collective Communications Library (oneCCL)