Stars
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
Gaming meets modern C++ - a fast and reliable entity component system (ECS) and much more
HIP: C++ Heterogeneous-Compute Interface for Portability
Jupyter kernel for the C++ programming language
A retargetable MLIR-based machine learning compiler and runtime toolkit.
A family of header-only, very fast and memory-friendly hashmap and btree containers.
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…
PositionBasedDynamics is a library for the physically-based simulation of rigid bodies, deformable solids and fluids.
Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
a compile-time, header-only, dimensional analysis and unit conversion library built on c++14 with no dependencies.
A library for enabling task-based multi-threading. It allows execution of task graphs with arbitrary dependencies.
Probably the fastest coroutine lib in the world!
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.
C++14 asynchronous allocation aware futures (supporting then, exception handling, coroutines and connections)
Run TensorFlow models in C++ without installation and without Bazel
A lightweight high performance tensor algebra framework for modern C++
We make any object thread-safe and std::shared_mutex 10 times faster to achieve the speed of lock-free algorithms on >85% reads
A C++ Algorithmic Differentiation Package: Home Page