A more important question would be, how are you going to move those addresses back to host memory? CUDA has a copy-memory-between-device-and-host function, but I am not so sure about OpenCL - at any rate, calling such a function for each address would be very slow, and I advise you to fill up a block of device memory with address bytes before copying it to the host RAM.
@n0nce
You can't directly collect entropy from OpenCL's device memory. That requires some *SSL call followed by a copy-from-host-to-device call.
Thank you I will check them out.
A more important question would be, how are you going to move those addresses back to host memory? CUDA has a copy-memory-between-device-and-host function, but I am not so sure about OpenCL - at any rate, calling such a function for each address would be very slow, and I advise you to fill up a block of device memory with address bytes before copying it to the host RAM.
@n0nce
You can't directly collect entropy from OpenCL's device memory. That requires some *SSL call followed by a copy-from-host-to-device call.
I am working with Rust but for example sake, on PyOpenCL I have previously used "enqueue_copy" to copy a GPU Buffer back to host Memory, you are correct however that to make the program more streamlined and faster I should really fill this buffer before pulling anything back to the host otherwise I will be wasting precious time.