Law 19 · Scope & Design

Decompose Before You Scale

When it's unreliable, split it — don't supersize it.

The principle

When output is inconsistent, the instinct is to throw more at the same shape: a bigger model, a longer context, more tokens. That rarely fixes a structural problem — it just dilutes attention further. Splitting the task into focused, single-purpose passes almost always beats making one overloaded pass smarter.

Why it happens

When one pass is asked to do many things at once, the model must split a fixed attention budget across every sub-goal, so adding a bigger model or longer prompt often dilutes focus further instead of fixing the structural overload. Decomposing the task into focused single-purpose passes lets each step be prompted, examined, and optimized in isolation, which is why staged approaches consistently beat one heroic pass on multi-step work. Least-to-most prompting showed that solving easier sub-problems first and feeding their results forward generalizes far better than tackling the whole task in one shot, and decomposed prompting generalized this into a modular library of sub-task solvers that each step can call or further break down. The practical move is to analyze per item in a tight pass, then reconcile across items in a separate pass, rather than overloading a single call.

Watch for

A single pass handling many items is inconsistent, and a bigger model or longer prompt makes it blurrier, not sharper.
One call is responsible for several distinct sub-tasks at once.
Errors cluster on the hardest sub-step that is buried inside an overloaded prompt.

In practice

Your invoice extractor is inconsistent across 30-line documents, so you reach for a bigger model and a longer prompt, and it gets blurrier, not sharper, because one overloaded pass is splitting attention across every row. The instinct to supersize masks a structural problem. Split it instead: extract each line item in a focused per-item pass, then run a separate reconciliation pass to total and cross-check. Several stages that each do one thing well beat one heroic pass trying to do everything.

Apply it

Split the work into stages that each do one thing, like extract per item, then reconcile across items.
Solve simpler sub-problems first and feed their results into later steps rather than answering all at once.
Optimize and inspect each focused pass in isolation instead of supersizing one overloaded call.

The takeaway

Break the work into stages that each do one thing well — analyze per-item, then reconcile across items. A focused pass beats a heroic pass trying to do everything at once.

Sources and further reading

Read every law in the digital edition Back to all 50 laws

The principle

Why it happens

Watch for

Apply it

Sources and further reading

Related laws