Files
Triple-C/container/mission-control/.claude/skills/agentic-workflow/SKILL.md
Josh Knapp 7840bddbb4 Sync bundled mission-control to upstream 15fbc94
Pulls in 15 upstream commits since the April 3 bundling snapshot
(msieurthenardier/mission-control). Notable changes:

- agentic-workflow rewritten as the "fast" variant: per-leg design and
  implement, single review and commit across the whole flight
- New Skill-Project Boundary section: skills no longer read or write
  project-owned artifacts by literal heading
- routine-maintenance scoped to post-mission only; adds state-machine
  reachability and cache freshness audits
- Test metrics capture threaded through debrief, maintenance, and flight
- Crew prompts no longer carry skill-required instructions; SKILL.md is
  the protocol
- Worktree git strategy removed; standardized on {target-project}
- Jira artifact template removed upstream

Local URL correction in init-project/README.md preserved
(anthropics/flight-control -> msieurthenardier/mission-control).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-18 08:10:32 -07:00

10 KiB

name, description
name description
agentic-workflow Active orchestrator for multi-agent flight execution. Drives leg design per leg, then batches implementation across all autonomous legs, with a single code review and commit at the end of the flight.

Agentic Workflow

Orchestrate multi-agent flight execution. You drive the full leg cycle — designing legs, spawning Developer and Reviewer agents, and managing git workflow — for a target project's flight. Leg design is reviewed per leg, but code review and commit are deferred until after the last autonomous leg completes. This eliminates per-leg review/commit overhead while keeping the same leg design and implementation structure.

Prerequisites

  • Project must be initialized with /init-project (.flightops/ARTIFACTS.md must exist)
  • A mission must exist and be active
  • A flight must exist and be ready or in-flight

Invocation

/agentic-workflow flight {number} for {project-slug} mission {number}

Example: /agentic-workflow flight 03 for epipen mission 04

Phase 1: Context Loading

  1. Read projects.md to find the target project's path
  2. Read {target-project}/.flightops/ARTIFACTS.md for artifact locations
  3. Read {target-project}/.flightops/agent-crews/leg-execution.md for project crew definitions, interaction protocol, and prompts (fall back to defaults at .claude/skills/init-project/defaults/agent-crews/leg-execution.md)
    • Validate structure: The phase file MUST contain ## Crew, ## Interaction Protocol, and ## Prompts sections. Each prompt subsection MUST have a fenced code block.
    • If the file exists but is malformed: STOP. Tell the user: "Phase file leg-execution.md is missing required sections. Either fix it manually or re-run /init-project to reset to defaults." Do NOT improvise missing prompts — halt and get the file fixed.
  4. Read the mission artifact — outcomes, success criteria, constraints
  5. Read the flight artifact — objective, design decisions, leg list
  6. Read the flight log — ground truth from prior execution
  7. Count total legs from the flight spec — track progress throughout
  8. Determine starting point — which leg is next based on flight log and leg statuses

Mark flight as in-flight: After loading the flight artifact, if the flight status is ready, update it to in-flight before proceeding. If already in-flight, leave it as-is.

If resuming a flight already in progress, verify state consistency:

  • Flight log entries must match leg statuses
  • If discrepancies exist, remediate before proceeding

Phase 2: Leg Cycle

Repeat for each leg in the flight.

Mid-execution scope changes: if the work in this flight stops serving its original purpose (operator pivots, prior assumptions invalidated), don't rewrite the mission/flight artifacts in place. Preserve the original framing as commentary, record the pivot decision in the flight-log Flight Director Notes with rationale, and treat the new framing as the live spec going forward. If the pivot supersedes content in an upstream artifact (maintenance report, prior debrief), annotate at the artifact header rather than rewriting the body — inspection records are snapshots, not living plans.

2a: Leg Design

  1. Design the leg using the /leg skill (if the Skill tool is unavailable, read .claude/skills/leg/SKILL.md and follow the workflow directly)
    • Read the flight spec, flight log, and relevant source code
    • Create the leg artifact with acceptance criteria
  2. Spawn a Developer agent for design review (Task tool, subagent_type: "general-purpose")
    • Working directory: {target-project}
    • Provide the "Review Leg Design" prompt from the leg-execution phase file's Prompts section
    • The Developer reads the leg artifact and cross-references against actual codebase state
    • The Developer provides a structured assessment: approve, approve with changes, or needs rework
  3. Incorporate feedback — update the leg artifact to address any issues raised
    • High-severity issues: must fix before proceeding
    • Medium-severity issues: fix unless there's a clear reason not to
    • Low-severity issues and suggestions: apply at discretion
  4. Re-review if substantive changes were made — spawn another Developer for a second pass
    • Skip if only minor/cosmetic fixes were applied
    • If the second review raises new high-severity issues, fix and re-review once more
    • Max 2 design review cycles — if issues persist after 2 rounds, escalate to human
  5. Update leg status to ready
  6. Signal [HANDOFF:review-needed] when the leg design is finalized

2b: Leg Implementation

NEVER implement code directly. Spawn a Developer agent via the Task tool.

Interactive/HAT legs: If the leg is a HAT (human acceptance test), alignment, or other interactive leg (identified by slug like hat-*, alignment-*, or explicit marking in the flight spec), do NOT spawn agents to execute it autonomously. The human performs verification — the Flight Director guides them through it:

  1. Design the leg normally (2a), but keep it lightweight — the acceptance criteria are verification steps, not implementation tasks
  2. Skip the autonomous implementation cycle (no Developer/Reviewer agents)
  3. Guide the human through verification steps one at a time — present a single step, wait for the human to perform it and report results, then proceed to the next step
  4. Fix issues inline — if the human reports a failure, diagnose and fix it (spawning a Developer agent if code changes are needed), then re-verify that step before moving on
  5. Commit when all steps pass — update artifacts and commit

Standard (autonomous) legs: Spawn a Developer agent — but do NOT review or commit after each leg.

  1. Spawn a Developer agent (Task tool, subagent_type: "general-purpose")
    • Working directory: {target-project}
    • Provide the "Implement" prompt from the leg-execution phase file's Prompts section
    • The Developer updates leg status to in-flight, implements to acceptance criteria
    • When done, the Developer updates leg status to landed and updates flight log — do NOT let it commit or signal [HANDOFF:review-needed]

2c: Leg Transition

After the Developer completes a leg:

  1. Increment legs_completed
  2. If more autonomous legs remain → return to 2a
  3. If this was the last autonomous leg → proceed to Phase 2d

2d: Flight Review and Commit

After all autonomous legs are implemented (all uncommitted):

  1. Spawn a Reviewer agent (Task tool, subagent_type: "general-purpose")
    • Working directory: {target-project}
    • Provide the "Review" prompt from the leg-execution phase file's Prompts section
    • The Reviewer evaluates ALL uncommitted changes against acceptance criteria and code quality
    • The Reviewer signals [HANDOFF:confirmed] or lists issues with severity
  2. If issues found, spawn a new Developer agent to fix them
    • Provide the "Fix Review Issues" prompt from the leg-execution phase file with the Reviewer's feedback
    • Loop review/fix until the Reviewer confirms
  3. Commit after review passes — include all code changes, updated flight log, and all leg statuses updated to completed
  4. Manage PR: Open a draft PR with the leg checklist in the body (see PR Body Format below), all legs checked off

Phase 3: Flight Completion

  1. Verify all legs show completed status
  2. Verify flight log has entries for all legs
  3. Verify documentation — check that CLAUDE.md, README, and other project docs reflect any new commands, endpoints, configuration, or APIs introduced during the flight. If not, spawn a Developer agent to update them.
  4. Update flight status to landed
  5. Check off flight in mission artifact
  6. Signal [COMPLETE:flight]

The flight debrief is a separate step run via /flight-debrief after the flight lands. The debrief transitions the flight to completed.

Architecture

The Flight Director (you) orchestrates according to this skill. Project crew composition, roles, models, and prompts are defined in {target-project}/.flightops/agent-crews/leg-execution.md.

Separation is mandatory. Project crew agents run in the target project and load its CLAUDE.md and conventions. The Reviewer has no knowledge of the Developer's reasoning — only the resulting changes. This provides objective review.

Model selection: Follow the model preferences in the phase file. MC may use Opus for complex planning. Never use Opus for the Reviewer.

Handoff Signals

Signals are part of the Flight Control methodology and are NOT configurable per-project. All crew agents must use these exact signals:

Signal Emitted By Meaning
[HANDOFF:review-needed] Developer Code/artifact ready for review
[HANDOFF:confirmed] Reviewer Review passed
[BLOCKED:reason] Any crew agent Cannot proceed, needs resolution
[COMPLETE:leg] Developer Leg finished and committed
[COMPLETE:flight] Flight Director Flight landed

Flight Director Decision Log

Log orchestration decisions in the flight log under ### Flight Director Notes — phase file loaded, agents spawned, review-cycle calls, escalations, signal interpretations. Anyone reading the log should understand not just what the crew did but why MC made the choices it did.

Git Workflow

All agents work in the target project root on a feature branch created at flight start.

Branch naming: flight/{number}-{slug}

Flight start: git checkout -b flight/{number}-{slug}

Commit message format:

flight/{number}: {description}

Mission: {mission-number}

PR lifecycle:

Event Action
All legs complete Open draft PR with all legs checked off
Flight landed Mark PR ready for review

PR body format:

## {Flight Title}

{Flight objective — one paragraph}

**Mission**: {Mission Title}

## Legs

- [x] `{leg-slug}` — {brief description}
- [x] `{leg-slug}` — {brief description}

Error Handling

Situation Action
Developer agent fails mid-leg Spawn new Developer with context of what failed
Design review loops > 2 times Escalate to human with unresolved design issues
Code review loops > 3 times Escalate to human
Leg marked aborted Escalate to human with abort details
Artifact discrepancy Remediate before proceeding
Off the rails Roll back to last leg commit, escalate
Agent hangs on tests Kill the agent, spawn new Developer to isolate and fix hanging tests