-
Microsoft
Stars
4
stars
written in Cuda
Clear filter
FlashInfer: Kernel Library for LLM Serving
how to optimize some algorithm in cuda.
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl