Spec: Velocity & Quality Roadmap (The "Engine First" Strategy)¶
Version: 2.0 — April 2026 Status: DRAFT Owner: Orchestrator Context: This roadmap pivots Agent Baton away from operational oversight and toward maximizing software development velocity and output quality. It focuses on the technical accelerators that turn the system into a compounding intelligence asset.
Wave 1: High-Throughput Parallelism¶
Goal: Enable multiple agents to work concurrently without friction or collisions.
1.1 Automated Git Worktree Isolation¶
- Problem: Sequential execution is the primary bottleneck. Parallel execution in a single tree causes index locks and file collisions.
- Requirement: Implement
WorktreeManagerto spin up isolated git worktrees for every parallel branch in aMachinePlan. - Velocity Impact: Enables 3-10× throughput by allowing true concurrent development across backend, frontend, and tests.
1.2 Declarative baton.yaml Workflow¶
- Problem: Manual CLI configuration is overhead.
- Requirement: A project-level config to define auto-routing, default quality gates, and agent preferences.
- Velocity Impact: Reduces task setup time from minutes to seconds.
Wave 2: Compounding Institutional Memory¶
Goal: Formalize the processes for Knowledge Graph and Context Profile creation so the system gets smarter with every commit.
2.1 Formalized Knowledge Graph Synthesis¶
- Problem: Beads are currently flat records. Relationships must be inferred manually.
- Process Requirement: Implement a
BeadSynthesizerthat runs post-phase. It must:- Infer Edges: Use semantic similarity and file-overlap to automatically link beads (e.g., "Feature A" extends "Architecture B").
- Cluster Discovery: Group related discoveries into "Knowledge Nodes."
- Conflict Detection: Automatically flag contradictory beads (e.g., two agents making different assumptions about an API).
- Quality Impact: Prevents regression and ensures all agents work from a unified, evolving understanding of the codebase.
2.2 Formalized Agent Context Profile Harvesting¶
- Problem: Agents start every task as "strangers" to the project.
- Process Requirement: Implement a
ContextHarvesterthat executes after every successfultask_completedevent.- Expertise Extraction: Summarize the specific patterns and files the agent mastered during the task.
- Strategy Recording: Note which implementation strategies worked (passed gates) vs. which failed.
- Persistent Storage: Store in an
agent_contexttable, indexed by(agent_name, domain).
- Velocity Impact: Eliminates "cold start" token waste. Agents arrive at the next task already knowing the project's conventions and pitfalls.
Wave 3: Precision & Verification¶
Goal: Tighten the loop between planning, execution, and quality.
3.1 Expected Outcome (Demo Statements)¶
- Problem: Passing tests != accomplishing the goal.
- Requirement: Require an
expected_outcomefor every plan step. This becomes the primary prompt for thecode-reviewerandtest-engineeragents. - Quality Impact: Shifts the focus from "no errors" to "behavioral correctness."
3.2 Automated Handoff Synthesis¶
- Problem: Context is lost during handoffs between agents.
- Requirement: Auto-generate a "Handoff Bead" that summarizes the
git diff, active discoveries, and remaining blockers from the previous agent. - Velocity Impact: Reduces the "re-orientation" phase for the next agent in the sequence.
Wave 4: Integration & Unblocking¶
Goal: Connect to external systems to automate the "last mile" of delivery.
4.1 CI-Driven Quality Gates¶
- Problem: Local gates can't always replicate production environments.
- Requirement: Direct integration with GitHub Actions/GitLab CI. The orchestrator dispatches the agent and then waits for the CI check to pass.
- Quality Impact: Guarantees that agent-produced code meets the exact same standards as human PRs.
Removed/Deferred (Governance & Overhead)¶
The following items from the previous roadmap have been removed to focus on velocity: - PMO Dashboard: Deferred (secondary to engine performance). - Cost Accounting: Removed (operational overhead). - Incident Response Tooling: Removed (replaced by higher-quality verification). - Chargeback Systems: Removed.