lingjiekong

Lingjie Kong lingjiekong

77 followers · 420 following

Google DeepMind
Mountain View
@0xLingjieKong
in/0xlingjiekong

Achievements

Highlights

Organizations

Lists (20)

Sort

Starred repositories

5 stars written in Cuda

Clear filter

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 25,492 2,929 Updated Oct 2, 2024

DefTruth / CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,282 244 Updated Feb 7, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,986 202 Updated Feb 12, 2025

NVIDIA / CUDALibrarySamples

CUDA Library Samples

Cuda 1,768 364 Updated Jan 28, 2025

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 943 58 Updated Jan 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lingjie Kong lingjiekong

Achievements

Achievements

Highlights

Organizations

Block or report lingjiekong

Lists (20)

AI Evaluation

AI/ML Dataset

AI/ML Learning

AI/ML Lib

AI/ML Model

AI/ML Paper

AI/ML Projects

API / SDK

Books

Crawler

Crypto

Frontend

Learning

Lib

News

RL

Robotics

Scaffold template

Visualization

Website

Starred repositories

karpathy / llm.c

DefTruth / CUDA-Learn-Notes

flashinfer-ai / flashinfer

NVIDIA / CUDALibrarySamples

thu-ml / SageAttention

Starred topics

Unity

IPFS