MAX 25.1 is here, featuring MAX Builds, prefix caching, and more!

Modular · February 19, 2025, 12:50am

AI just got faster with MAX 25.1!

Prefix caching and paged attention boost LLM performance, offline batch inference improves latency & load time, and MAX Builds is your go-to hub for GenAI models, recipes, and packages.

Start building smarter today and check out the full release blog post: Modular: MAX 25.1 - Introducing MAX Builds

Don’t miss our upcoming livestream on LinkedIn to learn more about what makes 25.1 awesome: Introducing MAX 25.1 | LinkedIn

Topic		Replies	Views
About the MAX category MAX	0	80	December 4, 2024
MAX Nightly 25.5.0.dev2025072205 Released Nightly	0	34	July 22, 2025
MAX: A new GenAI-native inference solution for 2025 by Chris Lattner, CEO of Modular Community Showcase	1	122	August 5, 2025
Modular: MAX 25.2: Unleash the power of your H200's–without CUDA! Content blog	0	49	March 25, 2025
MAX Model Repository MAX	3	77	August 6, 2025

MAX 25.1 is here, featuring MAX Builds, prefix caching, and more!

Related topics