Optimize code performance when odd/even threads are doing different things in CUDA
- by Orion Nebula
Hi all!
I have two large vectors, I am trying to do some sort of element multiplication, where an even-numbered element in the first vector is multiplied by the next odd-numbered element in the second vector .... and where the odd-numbered element in the first vector is multiplied by the preceding even-numbered element in the second vector
Ex.
…