Skip to content
View suhao2's full-sized avatar

Block or report suhao2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
6 stars written in Cuda
Clear filter

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 353 72 Updated Sep 8, 2024

Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)

Cuda 124 19 Updated Aug 18, 2020

ECE408 (Applied Parallel Programming) Fall 2022 MP

Cuda 10 5 Updated Mar 24, 2023

19FA ECE408 MP&Project

Cuda 8 3 Updated Jun 22, 2020

UIUC ECE408 Fall 2021 Project

Cuda 4 3 Updated Jun 3, 2022

CUDA WMMA test project

Cuda 3 2 Updated Jun 14, 2018