Skip to content
View thiagopbueno's full-sized avatar

Block or report thiagopbueno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

💹 GPU computing

16 repositories

An efficient C++17 GPU numerical computing library with Python-like syntax

C++ 1,262 93 Updated Feb 25, 2025
C++ 515 90 Updated Feb 25, 2025

ArrayFire: a general purpose GPU library.

C++ 4,631 538 Updated Feb 21, 2025

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,404 195 Updated Apr 29, 2021

Productive, portable, and performant GPU programming in Python.

C++ 26,793 2,329 Updated Feb 25, 2025

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,381 131 Updated Sep 18, 2024
Python 42 8 Updated Nov 9, 2019

A curated list of awesome NVIDIA Issac Gym frameworks, papers, software, and resources

833 65 Updated Feb 20, 2025

Read and write Tensorflow TFRecord data from Apache Spark.

Scala 292 56 Updated Apr 22, 2024

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Python 1,070 145 Updated Sep 3, 2024

A memory efficient DLRM training solution using ColossalAI

Python 103 14 Updated Nov 22, 2022

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,432 526 Updated Feb 22, 2025

CUDA Implementation of Parallel Matrix Factorization Algorithm for Recommender Systems

Cuda 13 1 Updated Apr 15, 2019

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 847 201 Updated Feb 25, 2025

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference i…

Python 804 122 Updated Dec 5, 2024

Hummingbird compiles trained ML models into tensor computation for faster inference.

Python 3,391 279 Updated Jan 21, 2025