Skip to content
View luliyucoordinate's full-sized avatar
🏅
Focusing
🏅
Focusing
  • hangzhou

Organizations

@llcv

Block or report luliyucoordinate

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
73 stars written in C++
Clear filter

LLM inference in C/C++

C++ 72,780 10,485 Updated Feb 2, 2025

MLX: An array framework for Apple silicon

C++ 18,776 1,073 Updated Feb 3, 2025

Development repository for the Triton language and compiler

C++ 14,241 1,763 Updated Feb 3, 2025

Official inference framework for 1-bit LLMs

C++ 12,685 886 Updated Dec 20, 2024

Tensor library for machine learning

C++ 11,707 1,104 Updated Jan 29, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,280 1,089 Updated Feb 2, 2025

High performance server-side application framework

C++ 8,514 1,580 Updated Feb 2, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,109 1,053 Updated Feb 2, 2025

An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.

C++ 4,037 893 Updated Oct 25, 2024

Lightning fast C++/CUDA neural network framework

C++ 3,856 475 Updated Jan 27, 2025

10x faster matrix and vector operations

C++ 2,480 171 Updated Oct 12, 2022

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,456 146 Updated Jan 24, 2025

Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file stor…

C++ 2,344 526 Updated Aug 13, 2024

A lightning fast Finite State machine and REgular expression manipulation library.

C++ 1,834 131 Updated Dec 8, 2024

Simple, light-weight and easy-to-use asynchronous components

C++ 1,812 267 Updated Jan 23, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,750 234 Updated Feb 2, 2025

A collection of modern C++ libraries, include coro_rpc, struct_pack, struct_json, struct_xml, struct_pb, easylog, async_simple

C++ 1,660 254 Updated Jan 27, 2025

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,602 53 Updated Jan 28, 2025

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,504 199 Updated Jun 12, 2023

CUDA Core Compute Libraries

C++ 1,409 185 Updated Feb 3, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,249 529 Updated Feb 2, 2025

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,210 526 Updated Aug 21, 2024

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 933 147 Updated Dec 16, 2024

Probably the fastest coroutine lib in the world!

C++ 929 129 Updated Jan 22, 2025

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 842 100 Updated Jan 24, 2025

Simple, portable, and self-contained stacktrace library for C++11 and newer

C++ 798 82 Updated Feb 3, 2025

A performant and modular runtime for TensorFlow

C++ 759 122 Updated Jan 24, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 724 40 Updated Jan 30, 2025

Fastest RPC in the west

C++ 723 68 Updated Apr 12, 2023

The Legion Parallel Programming System

C++ 705 146 Updated Jan 7, 2025
Next