Rethinking the AI-Tool Use Policy for the agentic era

owenhilyard · March 20, 2026, 12:23am

I have most of that set up, including a harness that’s derived from formal verification tooling (TLA+) which acts as a ground truth for externally visible behavior. As for context, I have a dataset that I convert into a RAG db for new models which has most of the information I personally reference when trying to write high performance code, as well as all of the notes from my undergrad and research. It’s enough information that I think I could reasonably expect a human to learn to write high performance code from it.

It’s possible it’s because I am mixing multiple areas which are poorly represented in the training data (network drivers, a custom network stack, large scale distributed systems, very tight integration with relatively new hardware, and taking advantage of hardware capabilities that most OSes don’t actually expose) combined with a few other things that make the problem a bit more tricky, and that’s what throws LLMs off. However, even getting LLMs to do something simple like handle endianness properly is like pulling teeth at times, because most of the examples of the task in the training data are actually incorrectly and I have a feeling “network stack development” doesn’t see as much fine-tuning as ReactJS does from closed models. I’ve even gone to the extent of fine-tuning my own models which, at least subjectively, despite running on consumer hardware, seem to perform much better, although the brute force approach I use tends to ruin the ability of the models to write JS and Python code.

This isn’t a Mojo specific complaint, I find most models have problems writing Rust and C++ at what I consider “speed of the hardware” too. In particular, they really like not doing null pointer checks, and when they do null pointer checks on a buffer they almost never vectorize it.

I probably could fix a lot of these with LLMs, but I can write the correct implementation faster than I can prod the LLM into fixing it.

Topic		Replies	Views
Modular’s bet to break out of the Matrix (Democratizing AI Compute, Part 10) Content	5	215	May 9, 2025
Package submitter feedback Community Packages Preview	32	632	June 24, 2025
The modular community repo seems to be broken Community Showcase	3	94	March 24, 2026
Augment Code - AI Assistant w. Superpowers Mojo discussion	6	741	December 8, 2025
Introducing the Modular AI Skills Repo Private Preview General	0	94	March 2, 2026

Rethinking the AI-Tool Use Policy for the agentic era

Related topics