This repo contains some CUDA kernels I'm writing for learning purposes, mostly about signal processing. To compile it, just use the makefile with
make all
(make sure to havenvcc
added to your path and a working CUDA install).
A CUDA implementation of 2D Haar Wavelet transform. I use it to compute the full-level decomposition of my webcam video feed with OpenCV (in colors with channels treated separately), thanks to GPU acceleration it can run smoothly. Here is a screenshot:
A parallel 2D implementation of convolution with CUDA, and benchmarking against CPU. This task is highly parallel and the achieved speedup important (up to 50x with my settings), image displaying and reading is doen with OpenCV.
A CUDA implementation of Discrete Wavelet Transform with Daubechies-4 wavelets. Since this DWT algorithm is recursive, the speedup is less important, 4 times with my settings.