Skip to content
View Oneal65's full-sized avatar

Block or report Oneal65

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

20 stars written in C++
Clear filter

An Open Source Machine Learning Framework for Everyone

C++ 190,079 74,675 Updated May 23, 2025

LLM inference in C/C++

C++ 80,718 11,873 Updated May 23, 2025

Productive, portable, and performant GPU programming in Python.

C++ 27,128 2,340 Updated May 21, 2025

Cloud-native high-performance edge/middle/service proxy

C++ 25,985 4,940 Updated May 23, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,910 1,234 Updated Apr 1, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,537 1,444 Updated May 23, 2025

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 10,145 1,599 Updated Aug 20, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 7,575 1,243 Updated May 20, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,312 259 Updated May 22, 2025

HElib is an open-source software library that implements homomorphic encryption. It supports the BGV scheme with bootstrapping and the Approximate Number CKKS scheme. HElib also includes optimizati…

C++ 3,195 767 Updated Aug 1, 2024

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,935 331 Updated Oct 26, 2024

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,100 362 Updated Jan 21, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 941 60 Updated Apr 15, 2025

A scalable pipeline for designing reconfigurable organisms

C++ 764 167 Updated Feb 19, 2020

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 400 187 Updated May 22, 2025

Perplexity GPU Kernels

C++ 307 33 Updated May 21, 2025

SPU (Secure Processing Unit) aims to be a provable, measurable secure computation device, which provides computation ability while keeping your private data protected.

C++ 286 130 Updated May 23, 2025

A tree-based federated learning system (MLSys 2023)

C++ 147 41 Updated Jan 20, 2025

A simple, light-weight C++ library for unstructured mesh generation in 2-D using Delaunay refinement algorithms

C++ 37 13 Updated Nov 21, 2017