Stars
A highly optimized LLM inference acceleration engine for Llama and its variants.
Universal LLM Deployment Engine with ML Compilation
Sockets, timers, resolvers, events, reactors, proactors, and thread pools for asynchronous network programming
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
An awesome & curated list of best LLMOps tools for developers
A machine learning compiler for GPUs, CPUs, and ML accelerators
Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)
A distributed, fast open-source graph database featuring horizontal scalability and high availability
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
A list of awesome compiler projects and papers for tensor computation and deep learning.
Collections of vector search related libraries, service and research papers
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
zimo-mo / akg
Forked from mindspore-ai/akgAKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.
zimo-mo / mindspore
Forked from mindspore-ai/mindsporeMindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
A curated list of pretrained sentence and word embedding models
A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.
Chinese translation of Bjarne Stroustrup's HOPL4 paper
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Learning embeddings for classification, retrieval and ranking.
A collection of C++ HTTP libraries including an easy to use HTTP server.
Drogon: A C++14/17/20 based HTTP web application framework running on Linux/macOS/Unix/Windows
FlatBuffers: Memory Efficient Serialization Library