Launching Mojo kernels on a specific CUDA/HIP stream?

bwibking · March 10, 2026, 11:33pm

Hi,

I am writing a set of Mojo bindings for a CUDA/HIP library that expects applications to launch their kernels on a given stream, because the library may launch dependent kernels on the same stream.

Is there a way to specify the CUDA/HIP stream on which to launch a Mojo kernel? If not, we can work around this, but it requires doing a device synchronize before/after the Mojo kernel launch, so it’s not performance-optimal.

Thanks,

Ben

bwibking · March 12, 2026, 5:36pm

This is not possible in Mojo today. I have created a proposal to add this feature here: Proposal: Allow importing CUDA/HIP stream handles in DeviceContext.

Topic		Replies	Views
Proposal: Allow importing CUDA/HIP stream handles in DeviceContext Standard Library feature-request	3	112	April 11, 2026
Async Streaming from Device to Host Mojo discussion	0	77	July 2, 2025
Launch_bounds support for GPU code Mojo gpu	4	122	February 18, 2026
CDNA2 / MI250X support? GPU Programming	2	72	March 8, 2026
[Hackathon] Experiment: CUDA Kernel → Mojo in Bitnet Community Showcase modular-hack-weekend	0	169	June 29, 2025

Launching Mojo kernels on a specific CUDA/HIP stream?

Related topics