- Türkiye
-
00:55
(UTC +03:00)
Highlights
- Pro
Stars
Python code accompanying the course "A deep understanding of deep learning (with Python intro)"
A simple keylogger for Windows, Linux and Mac
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"
Kernel Fusion and Runtime Compilation Based on NNVM
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Serve, optimize and scale PyTorch models in production
Optimization Of The Beluga Whale Optimization (BWO) Algorithm - Beluga Balinası Optimizasyonu (BWO) Algoritmasının Optimizasyonu
Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memory within GPU thread blocks.
Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
These projects are part of exhaustive lessons on parallel computing algorithms and patterns on GPUs using CUDA.
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
Yolo v3 object detection implemented in Tensorflow.
Basic implementation of ResNet 50, 101, 152 in PyTorch
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Kubernetes community content
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A resource for learning about Machine learning & Deep Learning
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.