Modular has acquired BentoML! Ask us anything

toiletsandpaper · February 17, 2026, 7:28pm

BentoML services scaling in Mammoth?
BentoML SDK for Mojo? (for example, serve Mojo code, use Python client, but scale it with Ray?)
BentoML and OpenLLM inference with MAX backend?
BentoML and OpenLLM license changes?

trojan_x · February 17, 2026, 7:51pm

Awesome we are getting max engine models into industrial scale. We’ll deploy them on BentoML. I appreciate all your contributions on the questions and we’ve effectively learned how BentoML works at industrial scale deployment.

It was more like a tutorial event

Thanks Chris

cyy · February 17, 2026, 9:41pm

As mentioned above, Mammoth won’t go extinct! Mammoth is a key part of our technology stack for large scale distributed inference. It works great within BentoML already and we’ll be tightening the integration. Our vision is a unified Modular Cloud product that makes it super simple to scale and deploy with full control and performance for your workloads.

We definitely plan to implement increased support for both Mojo and MAX within BentoML! We’re still determining exactly what that looks like. We’re excited to share more concrete details as they develop.

The license for BentoML and OpenLLM won’t change. It will remain Apache 2.0.

Topic	Replies	Views
BentoML + Modular: Acquisition Roundup & Resources BentoML	65	March 1, 2026
About the BentoML category BentoML	27	February 17, 2026
Modular: Modular + AMD: Unleashing AI performance on AMD GPUs Content blog	66	June 10, 2025
Modular: Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging Content blog	60	May 6, 2025
Modular: Modular Platform 25.5: Introducing Large Scale Batch Inference Content blog	36	August 5, 2025

Modular has acquired BentoML! Ask us anything

Related topics