Stars
6
stars
written in Cuda
Clear filter
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
19FA ECE408 MP&Project