huang-baixin

Follow

HuangBaixin huang-baixin

Follow

[email protected]

4 followers · 11 following

Beijing
23:16 (UTC +08:00)

huang-baixin/README.md

Hi I'm Baixin 👋

📝 I’m currently working on LLM-inference
💻 I’m currently learning AI-Infra

Pinned Loading

llama.cpp llama.cpp Public

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++
Awesome-LLM-Inference Awesome-LLM-Inference Public

Forked from xlite-dev/Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
EAGLE EAGLE Public

Forked from SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python
cuda_practices cuda_practices Public

Cuda
how-to-optim-algorithm-in-cuda how-to-optim-algorithm-in-cuda Public

Forked from BBuf/how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda