Hi!
I’m trying to understand how to use cubodjump on Mojo kernels. I built an executable called gemm
that launches a kernel and I would like to get the ptx/sass to poke around a bit.
However, when I try to run cuobjdump on it, I get cuobjdump info : File '/workspace/gpu-intro/gemm' does not contain device code
If I ldd
the executable, it doesn’t seem like it obviously links the kernel in from any other shared object? In general, what is the recommended way to look at the generated assembly?
Thanks!