Automation Workflow

Overview

This document describes how the Autonomous Engineer system divides responsibilities between the User and the AI-driven automation. The goal is to eliminate routine development work — Users invest effort only at the start and end; the AI handles everything in between.

User touchpoints: initial context preparation + final PR review.

AI-automated: branch creation, spec generation, review loops, approvals, implementation, commits, and PR creation.

Full Workflow Diagram

User Responsibilities

Before automation starts

Step	Action
1	Organize prerequisite information in `docs/`
2	Run `/kiro:spec-init "description"` to create the spec directory
3	Edit the Project Description (Input) section in `requirements.md` with sufficient context for the AI

The quality of the generated spec depends directly on how well step 3 is filled in. The AI cannot infer intent that isn't written down.

Future idea: Add a pre-design validation step that checks whether requirements.md contains sufficient information before proceeding to /kiro:spec-design.

After automation completes

Step	Action
4	Review the pull request created by the automation

All intermediate phases (requirements, design, tasks, implementation) are approved by the AI without User intervention. The PR is the single User review gate.

Branch Naming

Before any spec work begins, the automation creates a dedicated feature branch. The default pattern is:

text

feature/spec-{spec-name}

The branch naming rule is configurable. The automation must:

Detect the current branch
Refuse to proceed if already on main or master
Check if the target feature branch already exists
If the branch exists, interactively confirm with the User before continuing on it
Otherwise create and check out the feature branch

Review Loop Pattern

All three spec phases (requirements, design, tasks) use the same review-and-fix loop. Before /clear, the AI captures learnings to prevent knowledge loss across context resets:

Key rules:

The AI always resolves to a concrete action — it never just reports problems without fixing them
When multiple fix options exist, the AI selects the most appropriate one given the system architecture and technology
The loop has a configurable maximum iteration count (suggested default: 2)
After the loop, the AI writes approved: true to spec.json for the corresponding phase
Before /clear, the AI captures accumulated learnings to persistent resources (see Knowledge Capture)
/clear is executed after each phase approval to prevent context from carrying over into the next phase

Validate Gap (Optional)

After requirements are approved and before design begins, /kiro:validate-gap can be run to analyze the gap between the new feature requirements and the existing codebase:

Identifies reusable components already present in the codebase
Detects missing functionality that must be newly implemented
Maps integration points where the new feature connects to existing modules
Flags areas requiring new implementation so the design phase starts with full context

Typical position in the flow:

text

spec-requirements → validate-gap (optional) → spec-design → spec-tasks → spec-impl

This step is most valuable when working in an existing codebase. It prevents the design from duplicating existing work or missing integration constraints that are only visible from the current code.

Implementation Loop

After all spec artifacts are approved, committed, and context is cleared, the implementation loop begins. For each task group:

/kiro:spec-impl {task-group} — agent implements the specified tasks
AI review & fix — automated review against design doc and requirements; issues are fixed inline
Commit — changes are committed with a descriptive message
Knowledge capture — AI persists accumulated insights before context reset (see Knowledge Capture)
/clear — context is cleared to prevent cross-task pollution
Repeat until all tasks are complete

Task Batching: (P) Marker

Tasks in tasks.md marked with (P) are safe to batch into a single spec-impl call:

text

/kiro:spec-impl tool-system 3.1,3.2,3.3

See cc-sdd Parallel Task Analysis for the full rules on when a task qualifies for (P).

Knowledge Capture Before Context Reset

Before every /clear, the AI must persist any accumulated insights to prevent knowledge loss across context resets. This is a required step — not optional.

What to capture:

Search queries or investigation paths that took multiple attempts to resolve
Ambiguous requirements or design decisions where the reasoning matters for future phases
Reusable patterns, conventions, or gotchas discovered during the phase
Architectural tradeoffs that influenced implementation choices

Where to write:

Resource	Path	Use for
Steering docs	`.kiro/steering/`	Project-specific patterns, tech stack insights, architectural decisions
Rules	`.claude/rules/`	Workflow rules, code conventions, recurring process fixes
Skills	`.claude/commands/`	Reusable prompt patterns that emerged during the phase

Key constraint: only capture insights that are generalizable across future phases or sessions — not task-specific state that won't recur.

This mechanism ensures each new context window inherits the accumulated intelligence of all prior phases, counteracting the knowledge loss that /clear would otherwise cause.

Approval Mechanism

Phase approvals are written to spec.json by the automation — no manual edits required:

json

{
  "approvals": {
    "requirements": { "generated": true, "approved": true },
    "design":       { "generated": true, "approved": true },
    "tasks":        { "generated": true, "approved": true }
  },
  "ready_for_implementation": true
}

The ready_for_implementation flag is set to true once all three phases are approved, enabling the implementation loop to begin.

Workflow Phase Reference

Each phase in the workflow has a defined executor, execution behavior, and purpose. The table below maps every phase from the canonical flow to its role in the pipeline.

Phase	Executor	Execution stops?	Description
`SPEC_INIT`	LLM (slash command)	No	Creates the spec directory structure and initial `spec.json`. Generates the scaffold that subsequent phases populate.
`HUMAN_INTERACTION`	Human	Yes — process halts here	The workflow pauses so the user can review the generated scaffold and write the Project Description section in `requirements.md`. This is the only point where human input is required before the automated pipeline runs. Re-running the command after editing resumes automatically.
`VALIDATE_PREREQUISITES`	LLM (prompt)	No	Checks that all inputs and prerequisites for the spec phase are in place before proceeding (e.g. steering docs loaded, spec directory valid).
`SPEC_REQUIREMENTS`	LLM (slash command)	No	Generates a comprehensive `requirements.md` from the Project Description and project context.
`VALIDATE_REQUIREMENTS`	LLM (prompt)	No	Reviews the generated requirements for completeness, consistency, and alignment with steering. Issues are fixed inline; `spec.json` is updated to `requirements.approved = true` on success.
`REFLECT_ON_EXISTING_INFORMATION`	LLM (prompt)	No	Surveys the existing codebase and steering to identify relevant patterns, conventions, and constraints that should inform the next phase. Output is fed forward as context.
`VALIDATE_GAP`	LLM (slash command, optional)	No	Analyzes the gap between the requirements and the current codebase: identifies reusable components, missing functionality, and integration points. Can be skipped for greenfield work.
`CLEAR_CONTEXT`	System (`/clear`)	No	Resets the LLM context window. Required between phases to prevent token accumulation and reasoning degradation. Accumulated learnings are captured to steering/rules before each clear.
`SPEC_DESIGN`	LLM (slash command)	No	Produces `design.md` — the technical design including architecture, data models, API contracts, and component interactions.
`VALIDATE_DESIGN`	LLM (slash command, optional)	No	Reviews the design for technical correctness, alignment with requirements, and adherence to project conventions. Issues are fixed inline; `spec.json` updated to `design.approved = true`.
`SPEC_TASKS`	LLM (slash command)	No	Breaks the design down into discrete, ordered implementation tasks in `tasks.md`. Tasks may be marked `(P)` to indicate they can be batched in a single `spec-impl` call.
`VALIDATE_TASK`	LLM (prompt)	No	Reviews the task list for completeness, correct ordering, and traceability to design decisions. `spec.json` updated to `tasks.approved = true`.
`SPEC_IMPL`	LLM (slash command)	No	Implements the specified task group following TDD: write tests first, then implement to pass.
`VALIDATE_IMPL`	LLM (prompt)	No	Reviews the implementation against requirements, design, and tasks. Fixes issues inline.
`COMMIT`	System (git)	No	Commits the implementation with a descriptive message.
`PULL_REQUEST`	System (git)	No	Creates a pull request from the feature branch to main. This is the second and final human review point — the user reviews the PR.

Execution stop points

The workflow contains exactly two points where execution stops and waits for a human:

HUMAN_INTERACTION — stops after SPEC_INIT so the user can fill in the Project Description before the automated pipeline starts.
PULL_REQUEST — stops after all automation is complete so the user can review the output.

All other phases run automatically without requiring any human input or approval.

Workflow State & Resume

The orchestrator persists workflow state to disk after each phase. This enables crash recovery and controlled resumption without restarting from scratch.

State file location: .aes/state/<spec-name>.json

Behavior on re-run

Condition	Behavior
No state file	Start from the beginning (SPEC_INIT)
State file exists	Resume from the recorded phase

The state is restored automatically on every run — no flags or manual steps are required.

HUMAN_INTERACTION

HUMAN_INTERACTION is the first pause point in the workflow. It follows SPEC_INIT and is designed to stop execution so the user can review the initial spec output and provide input (e.g. editing requirements.md) before the automated pipeline runs.

Behavior:

First run — SPEC_INIT executes, then the workflow pauses at HUMAN_INTERACTION and saves state.
User action — inspect the generated artifacts, make any edits needed.
Re-run — the saved state is restored; the workflow automatically advances past HUMAN_INTERACTION and continues with the remaining phases.

No manual edits to spec.json are needed to resume. Simply re-running the command is sufficient.

All subsequent approval gates (requirements, design, tasks) do require explicit approval in spec.json before the workflow will advance.

Automation Workflow ​

Overview ​

Full Workflow Diagram ​

User Responsibilities ​

Before automation starts ​

After automation completes ​

Branch Naming ​

Review Loop Pattern ​

Validate Gap (Optional) ​

Implementation Loop ​

Task Batching: (P) Marker ​

Knowledge Capture Before Context Reset ​

Approval Mechanism ​

Workflow Phase Reference ​

Execution stop points ​

Workflow State & Resume ​

Behavior on re-run ​

HUMAN_INTERACTION ​

Automation Workflow

Overview

Full Workflow Diagram

User Responsibilities

Before automation starts

After automation completes

Branch Naming

Review Loop Pattern

Validate Gap (Optional)

Implementation Loop

Task Batching: (P) Marker

Knowledge Capture Before Context Reset

Approval Mechanism

Workflow Phase Reference

Execution stop points

Workflow State & Resume

Behavior on re-run

HUMAN_INTERACTION