Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Development repository for the Triton language and compiler
Transformer related optimization, including BERT, GPT
Optimized primitives for collective multi-GPU communication
A machine learning compiler for GPUs, CPUs, and ML accelerators
Verilator open-source SystemVerilog simulator and lint system
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…
A fast simulator and a library dedicated to the channel coding.
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
The source code of NuevoMatch as described in "A Computational Approach to Packet Classification" (SIGCOMM, 2020)
[IEEE/ACM Trans. Netw. 2019] Provisioning Short-Term Traffic Fluctuations in Elastic Optical Networks