Context Engineering Architecture

Overview

Context Engineering is the process of constructing the information provided to the language model at each reasoning step.

For an AI Dev Agent, the quality of context directly determines the quality of decisions the agent can make.

Unlike simple chat systems, an autonomous engineering agent must reason about:

large codebases
specifications
previous work
architectural decisions
repository state
task progress

Because modern language models have limited context windows, the system must carefully select what information is included.

Context Engineering is therefore one of the most critical components of the system.

Goals

The Context Engineering system is designed to achieve several goals.

Relevance

Only the most relevant information should be included in the context.

Irrelevant information increases token usage and decreases model performance.

Determinism

Context construction should follow predictable rules.

The agent should not rely entirely on the model to discover relevant information.

Scalability

The system must support large repositories and long-running development tasks.

Context must remain efficient even for large projects.

Explainability

It should be possible to understand why a particular piece of information was included in the context.

Context Layers

The agent constructs context using multiple layers of information.

+--------------------------------+
| System Instructions            |
+--------------------------------+
| Task Description               |
+--------------------------------+
| Active Specification           |
+--------------------------------+
| Relevant Code Context          |
+--------------------------------+
| Repository State               |
+--------------------------------+
| Memory Retrieval               |
+--------------------------------+
| Tool Results                   |
+--------------------------------+

Each layer provides a different type of information.

System Instructions

System instructions define the behavior and capabilities of the agent.

Examples include:

agent role
tool usage rules
coding standards
safety constraints

These instructions are typically static and always included in the context.

Example:

You are an autonomous software engineer.

You can interact with the repository using tools.

Always prefer modifying existing code rather than creating duplicate functionality.

Task Description

The task description defines the current objective.

Examples:

implement a feature
fix a bug
refactor a module
write tests

Example:

Task: Implement caching for the user profile service.

This information anchors the agent's reasoning.

Active Specification

The agent often works from a specification.

Examples:

design documents
feature specifications
API contracts

Example sources:

docs/specs/*
docs/architecture/*

Relevant sections of these documents are injected into the context.

This ensures that the implementation follows the intended design.

Relevant Code Context

Code context is one of the most important parts of the context.

However, including the entire repository is impossible.

The system must therefore select relevant code.

Methods include:

Symbol-Based Retrieval

When the agent is working on a symbol, the system retrieves:

function definition
class definition
related interfaces
imports

Example:

UserService.ts
UserRepository.ts
CacheClient.ts

Dependency Graph Retrieval

The system may retrieve code that depends on the current module.

This helps the agent understand side effects.

File Proximity

Nearby files are often relevant.

Example:

/services/user/*

These heuristics allow the agent to see the most relevant code.

Repository State

The agent must be aware of the repository state.

Important information includes:

git status
modified files
current branch
pending changes

Example:

Modified files:

* src/user/UserService.ts
* src/user/cache.ts

This prevents the agent from overwriting its own changes.

Memory Retrieval

The agent may retrieve knowledge from long-term memory.

Examples:

previous implementation decisions
debugging notes
architecture discussions
common solutions

Memory retrieval may use:

vector search
keyword search
metadata filters

Relevant memories are injected into the context.

Tool Results

The output of tool calls is also included in the context.

Examples:

file contents
command outputs
test results
error messages

Example:

Test failure:

Expected status 200
Received status 500

This allows the agent to observe the results of its actions.

Context Construction Pipeline

Context is constructed using a pipeline.

Task
│
▼
Context Planner
│
▼
Code Retrieval
│
▼
Memory Retrieval
│
▼
Context Assembly
│
▼
Token Budget Optimization
│
▼
LLM Input

Each step contributes information to the final context.

Context Planner

The Context Planner determines what information should be retrieved.

Inputs include:

task description
current step
previous tool results

The planner decides:

which files to load
which memories to retrieve
which specs to include

This reduces unnecessary data retrieval.

Token Budget Management

Language models have limited context windows.

The system must allocate tokens carefully.

Example budget:

Context Layer	Token Budget
system instructions	1000
task description	500
specification	2000
code context	4000
memory	1500
tool results	2000

The budget may vary depending on the model.

Context Compression

When context exceeds the token budget, compression techniques may be applied.

Examples include:

Summarization

Large documents may be summarized before inclusion.

Code Extraction

Only relevant functions or classes may be included.

Memory Filtering

Lower-priority memories may be excluded.

These techniques help maintain a manageable context size.

Iterative Context Expansion

Sometimes the agent may need additional information.

The system supports iterative expansion.

Example workflow:

Agent inspects function
↓
Agent needs dependency
↓
Agent retrieves additional file
↓
Context updated

This avoids loading too much information initially.

Context Caching

Some context components do not change frequently.

Examples:

system instructions
architecture documents
coding standards

These may be cached to reduce overhead.

Observability

Context construction should be observable.

Logs may include:

retrieved files
retrieved memories
token usage
compression operations

This helps developers understand agent behavior.

Failure Modes

Common context-related failures include:

Missing Code Context

The agent may modify code incorrectly if important dependencies are missing.

Excessive Context

Too much context may reduce reasoning quality.

Irrelevant Memory

Incorrect memory retrieval may confuse the agent.

The context engineering system must mitigate these risks.

Future Improvements

Possible improvements include:

learned context selection
adaptive token budgeting
dynamic code summarization
structural code embeddings

These techniques may further improve agent reasoning.

Summary

Context Engineering is a core component of the AI Dev Agent architecture.

It determines what information the language model receives at each reasoning step.

The system constructs context from multiple layers:

system instructions
task description
specifications
relevant code
repository state
memory
tool results

By carefully selecting and managing context, the system enables the agent to reason effectively about complex software systems.

Context Engineering Architecture ​

Overview ​

Goals ​

Relevance ​

Determinism ​

Scalability ​

Explainability ​

Context Layers ​

System Instructions ​

Task Description ​

Active Specification ​

Relevant Code Context ​

Symbol-Based Retrieval ​

Dependency Graph Retrieval ​

File Proximity ​

Repository State ​

Memory Retrieval ​

Tool Results ​

Context Construction Pipeline ​

Context Planner ​

Token Budget Management ​

Context Compression ​

Summarization ​

Code Extraction ​

Memory Filtering ​

Iterative Context Expansion ​

Context Caching ​

Observability ​

Failure Modes ​

Missing Code Context ​

Excessive Context ​

Irrelevant Memory ​

Future Improvements ​

Summary ​

Context Engineering Architecture

Overview

Goals

Relevance

Determinism

Scalability

Explainability

Context Layers

System Instructions

Task Description

Active Specification

Relevant Code Context

Symbol-Based Retrieval

Dependency Graph Retrieval

File Proximity

Repository State

Memory Retrieval

Tool Results

Context Construction Pipeline

Context Planner

Token Budget Management

Context Compression

Summarization

Code Extraction

Memory Filtering

Iterative Context Expansion

Context Caching

Observability

Failure Modes

Missing Code Context

Excessive Context

Irrelevant Memory

Future Improvements

Summary