Lists (12)
Sort Name ascending (A-Z)
Starred repositories
A toolkit for making real world machine learning and data analysis applications in C++
C++ Parallel Computing and Asynchronous Networking Framework
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A C++ library for interacting with JSON.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
High-speed Large Language Model Serving for Local Deployment
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Implementation of popular deep learning networks with TensorRT network definition API
lightweight, standalone C++ inference engine for Google's Gemma models.
Transformer related optimization, including BERT, GPT
A C++ standalone library for machine learning
A Robust and Versatile Monocular Visual-Inertial State Estimator
The Kalibr visual-inertial calibration toolbox
A fast single-producer, single-consumer lock-free queue for C++
Lightning fast C++/CUDA neural network framework
A lightweight library for portable low-level GPU computation using WebGPU.