Im looking for help on how to stream out data from Device to Host asynchronously. Specifically, every n iterations in my GPU kernel, I want to copy a vector from Decive to Host. I found DeviceStream, but that only seems to be intended for calling it from the Host.
This topic was automatically closed 180 days after the last reply. New replies are no longer allowed.