- São Paulo, Brazil
💹 GPU computing
An efficient C++17 GPU numerical computing library with Python-like syntax
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Productive, portable, and performant GPU programming in Python.
Python 3.8+ toolbox for submitting jobs to Slurm
A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources
Read and write Tensorflow TFRecord data from Apache Spark.
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
A memory efficient DLRM training solution using ColossalAI
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
CUDA Implementation of Parallel Matrix Factorization Algorithm for Recommender Systems
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference i…
Hummingbird compiles trained ML models into tensor computation for faster inference.