A new nightly version has been released! ![]()
See the quickstart guide for installation instructions: Quickstart | Modular
MAX changelog updates:
- Enabled overlap scheduling for select model architectures like
LlamaForCausalLM_Legacyby default. This optimization reduces CPU overhead
by overlapping python host code with GPU kernel execution. This optimization
is currently incompatible with some features such as structured outputs or cpu
models. This feature is very experimental! You can forcibly disable it via
--no-enable-overlap-scheduler --force.
Mojo changelog updates:
- [KGEN][oss][max][GC][Kernels] Rename TrivialRegisterType to TrivialRegisterPassable.
- [KGEN][oss][max][GC][Kernels] Rename RegisterType to RegisterPassable.
Raw MAX diff: https://github.com/modular/modular/compare/a968ccc177c8d1b2caca8b4f4dd62cecb75cd726...140159fe1966f226083dc0640fee70422b561083)>
Current Mojo changelog: https://github.com/modular/modular/blob/main/mojo/docs/changelog.md