Cuda program results are always zero in HW, correct in EMU??

Posted by Orion Nebula on Stack Overflow See other posts from Stack Overflow or by Orion Nebula
Published on 2010-05-20T23:29:04Z Indexed on 2010/05/20 23:30 UTC
Read the original article Hit count: 243

Filed under:
|
|
|
|

Hi all!

I am having a weird problem .. I have written a CUDA code which executes correctly in emulation and all results show up.. however, when executed on hardware "G210" .. the results in the result memory are always 0

I am passing two vectors to the kernel, one with random variables the other is initialized to zero, the code copies the first vector to shared memory, does some swapping and other operations and then writes back the results on the second vector (the one with the initial 0's)

I am using double precision, the -arch sm13 flag is used, all memory allocation also use sizeof(double) ..

I have checked if the kernel is invoked, it does .. so no problems here .. the cudaMemCpy has no problems ..

what could be the problem .. :( why would it work in emulation but not on HW

I am quite confused .. any ideas?

© Stack Overflow or respective owner

Related posts about cuda

Related posts about result