Gemma 4 is live on Modular Cloud with day zero support and the fastest performance on both NVIDIA and AMD

Gemma 4 is live on Modular Cloud with day zero support and the fastest performance on both NVIDIA and AMD. MAX delivers 15% higher throughput vs. vLLM on B200, and we’re the only inference provider shipping Gemma 4 on a framework we built ourselves. Learn more.