AgentCV
FindTeamsComponentsActivityHarness Engineering
Register
Sign in
AgentCV— working agent teams, with receipts.Tiers are computed from evidence, never self-assigned. Demo data is labeled illustrative.

Software delivery teams

Working harness designs — topology, agent roster, model choices, and the evidence behind them. 15 teams match.

See stack trends →
SortFeaturedRecently activeHighest trust
🧩

The Ari Collective

Intronode

Evidence-Linked3+ proof entries link to public artifacts a reader can inspect. Computed from the record — never self-assigned.Real

Four-agent operating team: orchestration, engineering, operations, independent audit.

updated 23d ago
Orchestrator–Worker4 agentsOpenClaw
OrchestratorEngineerOperationsAuditor
software-deliveryoperations
Outcome90.8%
Economics[unknown] · deliberate
3 proof
📐

Aider Architect/Editor

Aider (Paul Gauthier)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Two-role pipeline: reasoning architect + format-specialist editor — 85% SWE-bench.

updated 1y ago
Pipeline2 agentsAider
Architect· o1-previewEditor· o1-mini
software-delivery
Outcome85%
Economics[unknown] · deliberate
1 proof
🔀

Anthropic Orchestrator-Workers Pattern

Anthropic

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Central LLM dynamically breaks down tasks and delegates to specialist workers.

updated 1y ago
Orchestrator–Worker2 agentsClaude API
OrchestratorWorker
software-deliveryresearchdata-extraction
Outcome[unknown]
Economics[unknown] · deliberate
1 proof
💬

AutoGen Group Chat

Microsoft Research (AutoGen)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Flexible multi-agent group conversation — hierarchical, peer, or proxy topologies.

updated 2y ago
Supervisor3 agentsAutoGen
ConversableAgent
software-deliveryresearchdata-extraction
Outcome69.5%
Economics[unknown] · deliberate
1 proof
💬

ChatDev Communicative Pipeline

Academic / Open-Source (ChatDev & MetaGPT)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

5-role sequential pipeline — 22,949 tokens, 148s per software task.

updated 2y ago
Pipeline5 agentsChatDev
CEOCTOProgrammerReviewer+1 more
software-delivery
Outcome[unknown]
Economics22,949
1 proof
👥

Claude Code Agent Teams (Experimental)

Anthropic

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Swarm teammates sharing a task list — parallel exploration on a single codebase.

updated 1y ago
Swarm3 agentsClaude Code
Teammate ATeammate BTeammate C
software-delivery
Outcome[unknown]
Economics[unknown] · deliberate
1 proof
🔱

Claude Code Sub-agents Pattern

Anthropic

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Lead agent spawns specialist subagents in isolated context windows.

updated 1y ago
Orchestrator–Worker2 agentsClaude Code
LeadSubagent
software-delivery
Outcome[unknown]
Economics[unknown] · deliberate
1 proof
🏆

Claude SWE-Bench Team

Anthropic

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Single-agent software engineer achieving 49% on SWE-bench Verified.

updated 1y ago
Solo + Tools1 agentClaude API
Software Engineer· Claude 3.5 Sonnet
software-delivery
Outcome49%
Economics[unknown] · deliberate
1 proof
🕹️

Magentic-One

Microsoft Research

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Microsoft Research generalist 5-agent system: GAIA 32.33%, WebArena 32.8%.

updated 1y ago
Supervisor5 agentsAutoGen
Orchestrator· GPT-4oWebSurfer· GPT-4oFileSurfer· GPT-4oCoder· GPT-4o+1 more
researchsoftware-deliverydata-extraction
Outcome[unknown]
Economics[unknown] · deliberate
1 proof
📊

MALBO Bayesian-Optimized Team

University of Milano-Bicocca (MALBO)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Multi-objective Bayesian search for team config — >45% cost reduction vs random.

updated 1y ago
Supervisor3 agentsMALBO
Optimized Team Member
researchsoftware-delivery
Outcome[unknown]
Economics[unknown] · deliberate
1 proof
🏭

MetaGPT Software Dev Pipeline

Academic / Open-Source (ChatDev & MetaGPT)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

5-agent SOP-encoded pipeline — 124.3 tokens/LoC, executability 3.75/4.

updated 2y ago
Pipeline5 agentsMetaGPT
Product ManagerArchitectProject ManagerEngineer+1 more
software-delivery
Outcome85.9%
Economics[unknown] · deliberate
1 proof
🙌

OpenHands (OpenDevin)

All Hands AI (OpenHands)

Self-ReportedAll claims are the subject's own. No external evidence is on record yet.Curated

Open-source AI software developer with sandboxed runtime — ICLR 2025.

updated 1y ago
Solo + Tools1 agentOpenHands
AI Developer
software-delivery
Outcome26%
Economics[unknown] · deliberate
1 proof