Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Open deep learning compiler stack for Kendryte AI accelerators ✨
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
The book "Performance Analysis and Tuning on Modern CPU"
Experimental KVM-based type-2 hypervisor in Rust implemented from scratch.
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Next-gen compile-time-checked builder generator, named function's arguments, and more!
A tool to generate ergonomic, buffer-based C++ APIs.
Hyperlight is a lightweight Virtual Machine Manager (VMM) designed to be embedded within applications. It enables safe execution of untrusted code within micro virtual machines with very low latenc…
Fast OS-level support for GPU checkpoint and restore
Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild
A CPU tool for benchmarking the peak of floating points
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
A high-performance, zero-overhead, extensible Python compiler using LLVM
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
A new operating system kernel with Linux binary compatibility written in Rust.
This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value Stores.