`prefix_sum` incorrect results with `gpu.warp.prefix_sum` and `gpu.block.prefix_sum`

BradLarson · May 7, 2025, 2:56pm

Huge thanks for the PR to fix this!

Random question: you mentioned over on the GPU MODE Discord that you were running this on V100, did you encounter any issues building Mojo code for that GPU? We only recently were able to lower the floor for GPU support to Turing (sm_75), so I’m surprised that this worked for you on Volta (sm_70). Did you have to hack anything in your Mojo standard library to get that to work for you?

Topic		Replies	Views
Mojo manual gpu basics exercise does not compile GPU Programming 25_3	7	132	April 2, 2025
GPU Programming Manual Community Showcase gpu , docs , modular-content	17	495	March 26, 2025
CUDA_ERROR_ILLEGAL_ADDRESS when running p19 solution of mojo-gpu-puzzles GPU Programming gpu_puzzle	1	25	July 20, 2025
GPU Puzzles P09 Shared memory indexing issue Standard Library gpu	2	77	June 27, 2025
Gpu-puzzles: initialization of shared_a in problem 11 General debugging	3	33	July 10, 2025

`prefix_sum` incorrect results with `gpu.warp.prefix_sum` and `gpu.block.prefix_sum`

Related topics