A new nightly version has been released! ![]()
See the quickstart guide for installation instructions: Quickstart | Modular
MAX changelog updates:
- Changes to
max generate:- Added
--top-k,--temperature, and--seedflags. - Changed
--num-warmupsbehavior. Previously, it would run the model
on the promptNtimes, generating until reaching a stop condition each
time. Now,--num-warmupsruns the model forNsteps instead, generating
Nnew tokens as a warmup.
- Added
- Added
generate()toTextGenerationPipelineand
SpeculativeDecodingPipeline, a convenience method for getting text
generations.generate_asyncis available for getting streamed outputs.
Mojo changelog updates:
Raw MAX diff: https://github.com/modular/modular/compare/2ca4e563565cc24dc0891f78a1f159781b7ea150...e3fc382cd80729bc0ffe641a67bdd125d1ffce8e)>
Current Mojo changelog: https://github.com/modular/modular/blob/main/mojo/docs/changelog.md