Skip to content
View fxmarty-amd's full-sized avatar

Block or report fxmarty-amd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 stars written in C++
Clear filter

Notepad++ official repository

C++ 23,766 4,696 Updated Feb 9, 2025

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 622 55 Updated Jan 21, 2025
C++ 467 70 Updated Dec 10, 2024

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 340 146 Updated Feb 12, 2025

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 300 55 Updated Feb 12, 2025

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 292 25 Updated Oct 30, 2024

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

C++ 248 35 Updated Feb 6, 2025

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 227 8 Updated Feb 12, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 211 17 Updated Feb 11, 2025

Online compiler for HIP and NVIDIA® CUDA® code to WebGPU

C++ 137 1 Updated Jan 8, 2025

ROCm BLAS marshalling library

C++ 131 81 Updated Feb 11, 2025
C++ 117 54 Updated Feb 11, 2025

AMD SMI

C++ 53 31 Updated Feb 12, 2025

AMD’s C++ library for accelerating tensor primitives

C++ 37 21 Updated Feb 5, 2025