Skip to content

Discrete Cosine Transform optimized by using NVIDIA Tensorcores 🎶 🔛 📶

Notifications You must be signed in to change notification settings

jnfran92/tensorDCT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tensorDCT

Discrete Cosine Transform optimized by using NVIDIA Tensorcore

Content

Script comparing_dct.cpp:

Results:

    Comparing DCT tensorcores , cublas and cufft
    dim_y 3000   dim_x 3000
    cublas_dct
    cublas took[ms]: 18.1094
    cublas_idct
    cublas took[ms]: 18.3419
    fftw_dct
    fftw took[ms]: 507
    cufft_float_fft
    cufft took[ms]: 1.89949
    cufft_double_fft
    cufft took[ms]: 3.19274

About cufft_test folder

On Ubuntu, BLAS and LAPACK can be installed in one command:

	sudo apt-get install liblapack-dev -y ; sudo apt-get install liblapack3 -y ; sudo apt-get install libopenblas-base -y ; sudo apt-get install libopenblas-dev -y ;

Therefore, include library in the Makefile like:

	g++ ... -L/usr/lib -llapack -lblas

About

Discrete Cosine Transform optimized by using NVIDIA Tensorcores 🎶 🔛 📶

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published