HUSTHY HUSTHY

💭

I may be slow to respond.

10 followers · 3 following

China

Achievements

Stars

7 stars written in C++

Clear filter

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,537 1,444 Updated May 23, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,165 905 Updated Mar 27, 2024

xlite-dev / lite.ai.toolkit

🛠 A lite C++ AI toolkit: 100+🎉 models (Stable-Diffusion, FaceFusion, YOLO series, Det, Seg, Matting) with MNN, ORT and TensorRT.

C++ 4,098 739 Updated Apr 28, 2025

ztxz16 / fastllm

fastllm是c++实现，后端无依赖（仅依赖CUDA，无需依赖PyTorch）的高性能大模型推理库。可实现单4090推理DeepSeek R1 671B INT4模型，单路可达20+tps。

C++ 3,572 365 Updated May 20, 2025

zhihu / cuBERT

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

C++ 544 85 Updated Nov 18, 2020

dianhsu / transformer-cpp-cpu

用C++实现一个简单的Transformer模型。 Attention Is All You Need。

C++ 49 8 Updated Mar 11, 2021

zyfIvan1997 / PFNE

C++ 2 Updated Apr 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HUSTHY HUSTHY

Achievements

Achievements

Block or report HUSTHY

Stars

NVIDIA / TensorRT-LLM

NVIDIA / FasterTransformer

xlite-dev / lite.ai.toolkit

ztxz16 / fastllm

zhihu / cuBERT

dianhsu / transformer-cpp-cpu

zyfIvan1997 / PFNE