Law 10 · Reasoning & Planning

More Thinking Can Hurt

Extra reasoning past the answer is wasted — or a wrong turn.

The principle

Reasoning models 'overthink': they pour disproportionate effort into trivial problems for minimal gain, and on harder ones, extended deliberation can talk them out of a correct initial answer. Reasoning depth has a sweet spot, not a monotonic payoff. An agent grinding tokens on a simple lookup burns latency and money; one that keeps re-deriving can reason its way to the wrong conclusion.

Why it happens

Reasoning depth has a sweet spot rather than a monotonic payoff because extended deliberation can revisit and overturn a correct initial answer, and the marginal token stops adding information once the answer is settled. Apple's Illusion of Thinking (2025) made the non-monotonic shape concrete: reasoning models increase their thinking effort with problem complexity up to a threshold, then counterintuitively reduce effort right as accuracy collapses, even with token budget to spare, and on simple problems they often find the right answer early then overthink their way to a worse one. The cost is two-sided: on trivial lookups the extra deliberation is pure latency and money for no gain, and on harder ones re-deriving can talk the model out of a right answer. The fix is to match the reasoning budget to difficulty, cap thinking on easy paths, and stop once a confident answer is in hand instead of letting the model wander.

Watch for

Trivial lookups take seconds and cost multiples because everything is routed through extended reasoning.
The model reaches a correct answer early, keeps deliberating, and lands on a wrong one.
Longer thinking traces show no accuracy gain, or even a drop, on your easy cases.

In practice

You route every query through extended reasoning to be safe, and your 'what is the order status' lookups now take 8 seconds and cost 4x while occasionally talking themselves out of the correct status field. Reasoning has a sweet spot, not a monotonic payoff: trivial lookups get burned latency for nothing, and over-deliberation can overturn a right first answer. Match the thinking budget to difficulty, cap it on easy paths, and stop the moment you have a confident answer instead of letting it wander.

Apply it

Match the reasoning budget to problem difficulty rather than maxing it out everywhere.
Cap or skip extended thinking on simple, low-stakes steps like direct lookups.
Stop once a confident answer is reached instead of letting the model keep re-deriving.

The takeaway

Match reasoning budget to problem difficulty. Cap thinking on easy steps, and stop once you have a confident answer instead of letting the model wander.

Sources and further reading

Read every law in the digital edition Back to all 50 laws

The principle

Why it happens

Watch for

Apply it

Sources and further reading

Related laws