I'm trying to figure out how to pass a matrix from Matlab to a C++ CUDA file (*.ptx). I want to process the matrix differently in every thread! I definitely don't want to split the matrix up in different threads! I want each thread to have the same matrix!
I've tried something like:
_global_ void radialAverage(int* image[][]) {
...
}
but it didn't work. I am getting the following error:
kernel.cu(8): error: an array may not have elements of this type
1 error detected in the compilation of "C:/Users/ADMINI~1/AppData/Loca/Temp/tmpxft_00000c44_00000000-8_kernel.cpp1.ii".
Can you think of any way to do this? Or is it possible?
Btw: I'm not using any libraries for C++, only the CUDA-Api.