Highlights
- Pro
Stars
Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO
SGLang is a fast serving framework for large language models and vision language models.
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Curated collection of papers in machine learning systems
Free and Open Source telemetry overlay application for racing simulation
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Python tool for converting files and office documents to Markdown.
A highly optimized LLM inference acceleration engine for Llama and its variants.
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
Sharing the codebase and steps for artifact evaluation/reproduction for MICRO 2024 paper
DLRover: An Automatic Distributed Deep Learning System
Free Images for EVE-NG and GNS3 containing routers, switches,Firewalls and other appliances, including Cisco, Fortigate, Palo Alto, Sophos and more. Master the art of networking and improve your sk…
Penn CIS 5650 (GPU Programming and Architecture) Final Project
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
A tool for examining GPU scheduling behavior.
Enable macOS HiDPI and have a native setting.
Automatically switches between the dark and light theme of Windows 10 and Windows 11
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。