Having issues with MAX' Matmul on default Google Colab GPU (T4)

TilliFe · June 15, 2025, 8:51am

I am getting an error when trying to build a MAX Graph with a matmul on a T4 GPU (which is the default in Google Colab)

LLVM ERROR: Cannot select: intrinsic %llvm.nvvm.ldmatrix.sync.aligned.m8n8.x4.b16

Other Graph Ops work just fine. The full Notebook which creates the error: Google Colab (before execution: select runtime → T4)

BradLarson · June 15, 2025, 3:51pm

Yes, we currently have Tensor Core support in the kernels for Ampere and newer NVIDIA GPUs, but Turing support was a new community addition and the Tensor Core operations haven’t been modified to extend support back to that architecture. Therefore, graphs that include matrix multiplication operations will fail on that architecture, but other GPU programming capabilities should still work.

Everything needed to extend support to Turing Tensor Cores should be present in the open source gpu module in the Mojo standard library, and we’d greatly appreciate any community contributions that could enable this.

Topic		Replies	Views
MAX Graph Python API built-in ops fail to compile for GPU - what's the correct pattern? MAX discussion , gpu	3	93	January 19, 2026
Learning MAX Graph API Through Working Examples Community Showcase	7	189	January 26, 2026
Modular: Matrix Multiplication on Blackwell: Part 2 - Using Hardware Features to Optimize Matmul Content blog	2	142	September 6, 2025
NVIDIA hardware support in MAX 24.6 MAX discussion , 24_6	12	493	December 18, 2024
Support for Turing Architecture? MAX	7	372	May 4, 2025

Having issues with MAX' Matmul on default Google Colab GPU (T4)

Related topics