NOTE

At the moment, C2C examples require https://github.com/mnicely/cub.

Getting Started

These examples utilize the following toolsets:

cuFFT
cuFFTDx (Requires joining CUDA Math Library Early Access Program) https://developer.nvidia.com/CUDAMathLibraryEA
C++11

Hardware

Volta+

cuFFT_vs_cuFFTDx

This code runs three scenarios

cuFFT using cudaMalloc
cuFFT using cudaMallocManaged
cuFFTDx using cudaMalloc

Objectives

Compare coding styles between cuFFT, using cudaMalloc and cudaMallocManaged
Compare performance between cuFFT, using cudaMalloc and cudaMallocManaged
Compare performance and results between cuFFT and cuFFTDx

Execution

For float

make
./cuFFT_vs_cuFFTDx

For double

export USE_DOUBLE=1
make
./cuFFT_vs_cuFFTDx

To compare results (cuFFT and cuFFTDx are not expected to be exact)

export PRINT=1
make
./cuFFT_vs_cuFFTDx

Output

export PRINT=1
exportUSE_DOUBLE=1
make
./cuFFT_vs_cuFFTDx

FFT Size: 2048 -- Batch: 16384 -- FFT Per Block: 1 -- EPT: 16
cufftExecC2C - FFT/IFFT - Malloc        XX.XX ms
cufftExecC2C - FFT/IFFT - Managed       XX.XX ms

Compare results
All values match!

cufftExecC2C - FFT/IFFT - Dx            XX.XX ms

Compare results
All values match!

Notes

This code utilizes cuFFT Callbacks

https://devblogs.nvidia.com/cuda-pro-tip-use-cufft-callbacks-custom-data-processing/

This code utilizes separate compilation and linking

https://devblogs.nvidia.com/separate-compilation-linking-cuda-device-code/

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
c2c_example		c2c_example
c2r_r2c_example		c2r_r2c_example
common		common
r2c_c2r_example		r2c_c2r_example
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NOTE

Getting Started

Hardware

cuFFT_vs_cuFFTDx

Objectives

Execution

Output

Notes

About

Releases

Packages

Languages

gp1322719830/cufft_examples

Folders and files

Latest commit

History

Repository files navigation

NOTE

Getting Started

Hardware

cuFFT_vs_cuFFTDx

Objectives

Execution

Output

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages