Launching Mojo kernels on a specific CUDA/HIP stream?

This is not possible in Mojo today. I have created a proposal to add this feature here: Proposal: Allow importing CUDA/HIP stream handles in DeviceContext.