Laws of AI Agents

01

Law of Context Decay

Agents fail at context, not reasoning.

Context & Reliability

02

Compounding Error Law

Reliability multiplies, it doesn't add.

Context & Reliability

03

Position Is Power

Models read the edges; the middle gets lost.

Context & Reliability

04

The Model Optimizes for Looking Done

Agents declare victory early.

Context & Reliability

05

Design for the Worst Case

Plan around the ceiling, not the average.

Context & Reliability

06

Think Before You Touch

Spend reasoning tokens before you spend actions.

Reasoning & Planning

07

Don't Bet on One Chain

Sample many reasoning paths and let them vote.

Reasoning & Planning

08

Branch When the First Step Matters

For decisions you can't take back, explore before you commit.

Reasoning & Planning

09

Stop Tuning, Start Scaling

General methods plus compute beat your clever scaffolding.

Reasoning & Planning

10

More Thinking Can Hurt

Extra reasoning past the answer is wasted — or a wrong turn.

Reasoning & Planning

11

Retrieval Is the Ceiling

Your answer can only be as good as what you retrieved.

Retrieval & Memory

12

Grounding Is Not a Guarantee

Retrieval reduces hallucination; it does not eliminate it.

Retrieval & Memory

13

Relevant Beats Plenty

Near-misses poison context worse than random noise.

Retrieval & Memory

14

Keyword Still Carries Weight

Pure semantic search quietly loses to a 40-year-old baseline.

Retrieval & Memory

15

Memory Is a System, Not a Window

Give the agent a hierarchy, not just a bigger prompt.

Retrieval & Memory

16

Narrow Beats General

Three sharp tools beat thirty dull ones.

Scope & Design

17

Determinism at the Edges

Model in the middle, code at the boundaries.

Scope & Design

18

Observability Precedes Autonomy

You can't grant autonomy you can't trace.

Scope & Design

19

Decompose Before You Scale

When it's unreliable, split it — don't supersize it.

Scope & Design

20

The Cheapest Fix First

Reach for the prompt before the platform.

Scope & Design

21

The Tool Description Is the Prompt

An agent is only as capable as its tools are legible.

Instruction & Output

22

Show, Don't Tell

When prose fails, stop writing prose.

Instruction & Output

23

Confidence Is Not Calibrated

A model's certainty is not evidence.

Instruction & Output

24

Surface Ambiguity, Don't Resolve It

When the data is unclear, don't guess confidently.

Instruction & Output

25

Averages Lie

97% overall can hide a 60% segment.

Instruction & Output

26

Vibes Don't Scale

Eyeballing outputs feels like progress until you can't tell if a change helped.

Evaluation & Measurement

27

Look at Your Data

The highest-ROI activity in AI is the one teams skip first.

Evaluation & Measurement

28

The Judge Is Biased

An LLM grader reacts to length and position, not just substance.

Evaluation & Measurement

29

Goodhart's Trap

When your eval becomes the goal, it stops measuring what you cared about.

Evaluation & Measurement

30

Regress or Repeat

Every fixed bug is a future regression unless it becomes a test.

Evaluation & Measurement

31

The Lethal Trifecta

Private data, untrusted content, and an exfiltration path — pick at most two.

Safety & Security

32

Tokens Don't Wear Badges

The model can't tell your instructions from the attacker's — they're all just tokens.

Safety & Security

33

The Confused Deputy

An agent with your privileges will wield them on an attacker's behalf.

Safety & Security

34

Quarantine Untrusted Tokens

Let the privileged planner orchestrate, but never let it read the poison.

Safety & Security

35

Sandbox the Blast Radius

Assume the agent gets compromised — then contain what it can reach.

Safety & Security

36

Don't Build an Agent When a Workflow Will Do

Agents buy flexibility with latency, cost, and unpredictability.

Architecture & Operations

37

Cascade Before You Escalate

Try the cheap model first; only the hard cases deserve the expensive one.

Architecture & Operations

38

The Multi-Agent Tax

Every extra agent multiplies your token bill — make sure the task can pay it.

Architecture & Operations

39

Your Architecture Mirrors Your Org Chart

Ship a system shaped like your teams — so design the teams first.

Architecture & Operations

40

Retries Demand Idempotency

If an action can run twice, a retry will eventually run it twice.

Architecture & Operations

41

Trip the Breaker

Stop calling the thing that's already failing.

Architecture & Operations

42

The Ironies of Automation

The more you automate, the harder the leftover human job becomes.

Humans & Autonomy

43

Automation Bias

People will trust the machine over their own eyes.

Humans & Autonomy

44

Match the Level to the Stakes

Full autonomy is a setting, not a default.

Humans & Autonomy

45

Mind the Mode

Most automation surprises start with 'what mode is it in?'

Humans & Autonomy

46

The Handoff Is the Hard Part

In multi-agent systems, failures live in the seams.

Trust & Coordination

47

Trust Is Calibrated, Not Granted

Autonomy is earned in proportion to track record.

Trust & Coordination

48

The Escape Hatch Law

No clean exit means a fabricated one.

Trust & Coordination

49

Don't Let the Author Be the Judge

The thing that made it shouldn't grade it.

Trust & Coordination

50

Preserve Provenance

Don't lose where a fact came from.

Trust & Coordination

Every law, in full — with a diagram for each.

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle