Planning and Reflection: How AI Breaks Down and Revises Work

AI, De-Mystified · Article 8

Most AI assistants answer one prompt at a time. When a task is larger than a single answer, the system needs to know what to do first, next, and when something goes wrong. That is where planning and reflection come in.

Point C1 Planning breaks a goal into ordered steps before action; reflection checks results against the goal and decides whether to revise the plan.

Plain English Meaning

Planning is answering “What should I do, and in what order?” before doing it. A plan could be a simple list or a branching set of options: if the first search fails, try the second query.

Reflection is answering “Did that work?” after doing something. It compares the result to the goal and decides whether to continue, back up, or change the plan.

Together they make an agent less like a calculator and more like a worker who writes a to-do list, does the first item, checks the result, and adjusts the rest.

Existing Concept It Resembles

Planning and reflection are not new. They appear in older fields and everyday life:

Project management breaks work into tasks with dependencies and milestones, then reviews progress.
The scientific method designs an experiment, runs it, examines the outcome, and revises the hypothesis.
Means-ends analysis in classical AI chooses actions that reduce the distance to a goal.
After-action reviews ask what happened and how to do better next time.

Point C2 Planning and reflection are rooted in project management, scientific method, and classical AI search, not only in recent language models.

What Is Actually New?

What changed is that a language model can write, read, and rewrite its own plan in natural language. Older planners often needed formal rules and handcrafted domain descriptions. A modern agent can be told to plan a trip, search for hotels, notice a conflict, and revise the itinerary.

Reflection has also changed. Instead of an external evaluator for every step, the model can critique its own draft and generate a revised version.

The flexibility is real, but so is the risk. A plan written in prose can look convincing without being correct, and a self-critique can be confidently wrong.

From Reflection to Harnesses

A newer pattern goes beyond asking the model to critique its own output. It asks the model to design a harness — a structured routine of checks, tools, and guardrails — for a recurring weakness it notices in itself. The loop has three stages:

Mine a weakness: run the model on examples and collect failures.
Propose a harness: write a small test, tool call, or verification rule that would catch that failure.
Validate the harness: check that the proposed routine actually helps on held-out examples and does not hurt overall performance.

This turns reflection into a durable engineering artifact rather than a one-time self-critique. The harness can be reused across similar tasks, reviewed by humans, and improved when new failures appear.

Point C5 A self-harness pattern can turn one-off reflection into reusable verification routines that are proposed by the model and validated against held-out examples.

How It Works In Practice

Here are three common patterns.

1. Plan-then-execute:

generate plan → do step 1 → do step 2 → do step 3 → return result

The agent writes a plan up front and follows it.

2. Plan-revise loop:

plan → act → observe → reflect → revise plan → act again

The agent starts with a plan, takes one step, checks what happened, and updates the plan. Common in coding and research agents.

3. Chain-of-thought reasoning:

question → reason step by step → answer

Here the “plan” is reasoning produced alongside the answer. Reflection may appear as a final check: “Review the reasoning above and correct any mistakes before giving the final answer.”

Point C3 In practice, AI planning and reflection appear as upfront plans, iterative plan-revise loops, and step-by-step reasoning with final verification.

Where It Helps

Planning helps when a task is too large to fit in one shot. Breaking a report into outline, research, draft, and review makes each step manageable. A coding agent that runs tests after each edit catches bugs early.

Both also help humans collaborate. A visible plan gives a human something to approve or stop, and a reflection step creates a checkpoint.

Where It Fails

Planning without enough information: The agent commits to a plan before discovering that a key assumption is wrong. It might outline a feature based on an API that does not exist.

Reflection without external grounding: The agent checks its own work using the same model that produced the work. It can confirm its own mistakes because the model’s errors are often consistent across attempts.

Over-planning: The agent spends so long refining the plan that it never starts the work, or the plan becomes longer than the work itself.

Reflection theater: The agent produces a critique that sounds thorough but does not actually lead to a better result.

Point C4 Reflection in AI agents is most reliable when paired with external checks such as tests, retrieved sources, or human review; self-critique alone can confirm rather than catch errors.

Academic Connections

Task decomposition studies how complex problems can be split into subtasks.
Metacognition in psychology is thinking about one’s own thinking; in agents, it maps to self-evaluation.
Self-refinement refers to systems that generate output, critique it, and generate an improved version.
Search in classical AI explores action sequences to reach a goal.

The practical challenge remains: make a plan, act, check the result, and decide what to do next.

Practical Checklist

When you use or build a planning-and-reflection agent, ask:

What is the plan before the first action? Is it visible?
What does the agent observe after each step? Is it reliable?
How does the agent decide whether to continue, revise, or stop?
What external checks exist? Tests? Sources? Human review?
What happens when the plan is wrong? Can it backtrack?
Is reflection producing real improvements, or just plausible commentary?

The De-Hype Check

Old name for this idea: task planning, project plans, after-action reviews, means-ends analysis, self-critique.
What is genuinely new: language models can generate, read, and revise plans in natural language without hand-coded domain rules.
What gets exaggerated: “Agents can plan anything and fix their own mistakes.” Plans are only as good as the information behind them, and self-reflection can amplify the same blind spots that caused the original error.
Who benefits from the hype: Vendors selling fully autonomous agents and consultants promising hands-off automation. The truth is more modest: planning and reflection extend what models can do, but they still need clear goals, reliable observations, and human oversight.

Open Questions

When should an agent plan everything up front versus plan one step at a time?
How can an agent recognize that its own reflection is unreliable?
What is the cheapest way to add external grounding to a reflection step?
How do we prevent planning from becoming a form of procrastination that delays real work?
When should a human approve the plan versus approve the final result?

Article guide Important points and sources 5 points Show guide Hide guide

C001 core · high Planning breaks a goal into ordered steps before action; reflection checks results against the goal and decides whether to revise the plan.
C002 landscape · high Planning and reflection are rooted in project management, scientific method, and classical AI search, not only in recent language models.
C003 design · medium-high In practice, AI planning and reflection appear as upfront plans, iterative plan-revise loops, and step-by-step reasoning with final verification.
C004 risk · medium Reflection in AI agents is most reliable when paired with external checks such as tests, retrieved sources, or human review; self-critique alone can confirm rather than catch errors.
C005 design · medium A self-harness pattern can turn one-off reflection into reusable verification routines that are proposed by the model and validated against held-out examples.

Sources Sources used 5 sources Show sources Hide sources

Look closer

Sources and notes

Open details Close details

These notes collect the sources, counterpoints, and review status behind the article's important points. Read the essay first; open this when you want to check something.

Confidence reflects how strongly the sources support the point (low / medium / high). Status describes the point's role (e.g., core, argument, landscape). Sources link to supporting material; counterpoints note boundary conditions or conflicting findings.

C001 high core

Planning breaks a goal into ordered steps before action; reflection checks results against the goal and decides whether to revise the plan.

Sources (1): “ReAct generates a high-level plan of reasoning and action steps, then uses observations to decide the next action.”
ReAct: Synergizing Reasoning and Acting in Language Models direct
Counterpoints (1): Some agent designs interleave planning and action so tightly that the distinction between plan and reflection becomes a design choice rather than a fixed property.

C002 high landscape

Planning and reflection are rooted in project management, scientific method, and classical AI search, not only in recent language models.

Sources (1): “Step-by-step reasoning in large language models resembles the human practice of breaking a problem into intermediate stages before producing a final answer.”
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models background
Counterpoints (1): Earlier planning systems often required formal symbolic descriptions, whereas modern agents can plan from natural-language prompts with ambiguous constraints.

C003 medium-high design

In practice, AI planning and reflection appear as upfront plans, iterative plan-revise loops, and step-by-step reasoning with final verification.

Sources (2): “Self-Refine iteratively refines outputs using feedback generated by the model itself, forming a generate-feedback-refine loop.”
Self-Refine: Iterative Refinement with Self-Feedback direct

“Reflexion equips agents with a reflective text memory of past failures to improve decision-making in subsequent trials.”
Reflexion: Language Agents with Verbal Reinforcement Learning direct
Counterpoints (1): Many simple tasks are better solved by a single well-crafted prompt than by adding planning and reflection overhead.

C004 medium risk

Reflection in AI agents is most reliable when paired with external checks such as tests, retrieved sources, or human review; self-critique alone can confirm rather than catch errors.

Sources (1): “Self-refinement improves performance more on tasks where the model can reliably detect its own errors than on tasks where the model shares the same blind spot across generation and critique.”
Self-Refine: Iterative Refinement with Self-Feedback indirect
Counterpoints (1): Some studies show that self-consistency checks and majority voting can catch errors even without external tools, especially on well-defined reasoning tasks.

C005 medium design

A self-harness pattern can turn one-off reflection into reusable verification routines that are proposed by the model and validated against held-out examples.

Sources (1): “Self-Harness mines weaknesses in an LLM, proposes harnesses to mitigate them, and validates each harness on held-out data to ensure it improves reliability without degrading overall performance.”
Self-Harness: Mining Weaknesses to Generate Test-Time Harnesses direct
Counterpoints (1): If the validation data share the model's blind spots, a harness can appear effective while reinforcing the same errors.

Review recordHow this was madeShow detailsHide details

Created 2026-06-29 by human. Policy: policy:default v1.0.0.

✓ Approved hash matches current article

Agent runs

draftingkimi2026-06-29in:00000000…out:d3dce937…
reviewkimi2026-06-29in:00000000…out:d3dce937…

Reviews

agentapproved2026-06-29
Scope: claims, tone, privacy, scope
contentHash: d3dce937b302fd9f…
Sibling-agent review against article-proposal-ideation eval-card. Privacy scan passed. No proprietary or personal content detected.
humanapproved2026-06-29
Scope: thesis, examples, tone, safety
contentHash: d3dce937b302fd9f…
Human author approved the draft for publication.

Machine-readable files

The same points, sources, and relationships are also available as structured files for agents and tools. The JSON follows the publication record schema.

JSON file Brief (Markdown)