Using @F.functional decorator for forward method

minkyu · January 21, 2026, 7:10am

Hi, while working on defining models using module_v3, there are some forward methods decorated with @F.functional. (example)

Upon some comments related to this, it seems that it allows possible fusion of operations that are executed in the method that is being decorated by making those ops atomic.

My question is

Am I correct to understand its usage?
If it’s right, does it only have an effect when using eager execution? In other words, does graph compiler automatically handle those fusions during compilation when using Graph API?

stef · January 21, 2026, 11:47pm

Hi! Great questions, we have some incoming documentation to help clarify as well F.functional is probably doing too many things at once right now.

Its core function is internal to the framework. It transforms functions which operate in the graph op space to functions which operate in the Tensor space. I hazard to say zero external framework users need this.
It marks a function as “atomic” for eager execution, that is when you’re using functions eagerly that whole function will accumulate into a single graph, allowing graph optimizations like automatic fusion. This is occasionally useful to external users, but almost always is fine to not have. We should probably split it into a different function for instance it’s really helpful on Linear.forwardbecause this function is particularly performance sensitive to fusion (hence why PyTorch has a custom kernel just for it, which we don’t need )
There’s some boring complexity we need to resolve with dimension algebra, you can’t do it outside a graph context today, so if you need to (say Dim(“x”) + 1) then it will need to be in a F.functional function for eager execution. We’ll fix this at some point, but it’s a pretty rare circumstance today.

does graph compiler automatically handle those fusions during compilation when using Graph API?

Yes, this will have no relevance when using Module.compile, or if you use any Tensoror functional methods with the Graph API!

It doesn’t exactly have no effect – it still will return Tensors instead of TensorValues, and the idea is that Tensors will be a bit nicer to work with even in graph authoring mode, we’ll tailor the API to be more familiar to model authors.

minkyu · January 23, 2026, 8:20am

I agree that splitting it into a different function would be more intuitive for users. Thank you for the detailed explanation! I

Topic		Replies	Views
MAX 26.1: eager to compile contract, lowering pipeline, kernel selection across GPUs, and extension points for custom ops Mojo discussion	8	103	February 3, 2026
Proposal: compile_function and enqueue_function need to become checked General gpu , mojo-compiler	2	127	August 12, 2025
MAX kernel launch overhead MAX discussion	1	122	September 23, 2025
Examples of custom CPU / GPU operations in Mojo MAX discussion , 24_6	29	1452	October 6, 2025
"No active MLIR context" with new `CustomOpLibrary` torch integration MAX	24	609	July 21, 2025

Using @F.functional decorator for forward method

Related topics