📐

Aider Architect/Editor

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Two-role pipeline: reasoning architect + format-specialist editor — 85% SWE-bench.

Aider (Paul Gauthier)· Operating since Sep 26, 2024· active

Curated from Aider blog — Architect/Editor Mode — not claimed by or endorsed by the organization. Metrics cited only as the source states. Absent metrics render as [unknown].

Spec sheet

The benchmark fields — designed for comparison across teams.

Topology: Pipeline
Agent count: 2
Platform: Aider
Industries: software-delivery
Task kinds: code-editingbug-fixingrefactoring
Trust tier: Self-ReportedAll claims are the subject's own. No external evidence is on record yet.
Proof entries: 1

Topology & roster

Pipeline

Pipeline (two-stage). Architect receives the task and produces a natural-language plan of changes. Editor receives the plan and applies edits in the required diff format. One-directional: Architect does not see Editor output unless a retry is triggered. The Architect is model-agnostic; any strong reasoning model can fill the role.

📐

Aider ArchitectArchitect

o1-preview

✏️

Aider EditorEditor

o1-mini

Performance metrics

Windowed metrics with provenance. [unknown] means it was not tracked — an honest hole beats an invented figure.

SWE-bench score

85%

evidence-linked

o1-preview (architect) + o1-mini (editor). Source: aider.chat/2024/09/26/architect.html [evidence_linked]

as of Sep 26, 2024

SWE-bench (Claude 3.5, both roles)

80.5%

evidence-linked

Claude 3.5 Sonnet as both architect and editor. Source: aider.chat/2024/09/26/architect.html [evidence_linked]

as of Sep 26, 2024

Token economics

Cost transparency is part of the honesty architecture. [unknown] means it was not tracked — not that it is zero.

No cost metrics on record. Cost tracking is hard across runtimes; honest absence beats invented figures.

Blueprint

Operational DNA — why it works, how it was built, and how it is overseen. Not files for sale; knowledge of the design.

Why it works

Separating reasoning from edit-format compliance removes conflicting objectives from a single model. The architect can focus entirely on what to change; the editor focuses entirely on how to format the output. This enables using a top-tier reasoning model cost-effectively since the architect's output is natural language, not code diffs.

How it was built

Aider open-source CLI. Architect model specified separately from editor model in config. Tested combinations include: o1-preview + o1-mini (85.0% SWE-bench), Claude 3.5 Sonnet as both roles (80.5%), Claude 3.5 Sonnet (architect) + various editors. Model costs differ significantly between architect and editor — o1-mini is ~10× cheaper than o1-preview.

Oversight model

User reviews changes via Aider's standard diff review workflow. No autonomous loop — Aider operates in a human-on-the-loop mode where each set of changes is presented for confirmation. Source: Aider blog.

Proof (1)

The team's shared track record — tasks, incidents, lessons, milestones. Per-entry provenance tags are always visible.

ArtifactSep 26, 2024evidence-linked
Aider architect/editor mode blog post published
o1-preview (architect) + o1-mini (editor) = 85.0% on SWE-bench. Documents the separation of reasoning from edit-format compliance as a key performance lever.
https://aider.chat/2024/09/26/architect.html

Attestations (0)

Named third-party statements from people with first-hand experience. Attestations are what separates Peer-Attested from Evidence-Linked.

No attestations yet. Worked with this configuration or agent? Attest to it using the form below — attestations are named third-party statements and are what separates Peer-Attested from Evidence-Linked.