MAX Nightly 26.3.0.dev2026031822 Released

Modular · March 18, 2026, 11:43pm

A new nightly version has been released!

See the quickstart guide for installation instructions: Quickstart | Modular

MAX changelog updates:

Fixed slow axis=None reductions (mean, sum, prod, max, min) in
max.experimental.functional. The previous implementation flattened the
tensor before reducing, serializing the work onto a single GPU block.
Reductions now iterate axis-by-axis to preserve parallelism.
Added experimental max.experimental.distributed module with DTensor,
DeviceMesh, and placement types (Replicated, Sharded, Partial) for
expressing how tensors are distributed across multiple devices. Op dispatch
is not yet supported.
max/python/max/benchmark/benchmark_throughput.py has been deprecated and
will be removed in a future MAX release.
Added GPU kernel examples from the Programming Massively Parallel Processors
(PMPP) textbook covering reductions, scans, histograms, sorting, sparse
matrix operations, graph algorithms, convolutions, FlashAttention, and more.

Mojo changelog updates:

Topic	Replies	Views
MAX Nightly 26.3.0.dev2026040905 Released Nightly	35	April 9, 2026
MAX Nightly 26.3.0.dev2026041520 Released Nightly	34	April 15, 2026
MAX Nightly 26.3.0.dev2026040205 Released Nightly	22	April 2, 2026
MAX Nightly 26.3.0.dev2026040805 Released Nightly	35	April 8, 2026
MAX Nightly 26.3.0.dev2026041105 Released Nightly	36	April 11, 2026