| Agent | Description | Install |
|---|---|---|
| Orchestrator | Coordinates full dev lifecycle, delegates to subagents. Never implements directly. | |
| Researcher | Investigates technologies, maps codebases, Context7-first source verification. | |
| Planner | Creates roadmaps and executable plans. Plans are prompts — WHAT not HOW. | |
| Coder | Writes production code with per-task atomic commits. 9 mandatory coding principles. | |
| Designer | UI/UX with anti-AI-slop aesthetics. Usability > accessibility > aesthetics. | |
| Verifier | Goal-backward verification. Task completion ≠ goal achievement. | |
| Debugger | Scientific debugging with hypothesis testing and persistent debug files. |
| Instruction | Applies To | Install |
|---|---|---|
| Admin Docs | {doc,docs}/admin/**/*.md |
|
| Technical Docs | {doc,docs}/technical/**/*.md |
|
| User Docs | {doc,docs}/user/**/*.md |
| Skill | License | Install |
|---|---|---|
| Frontend Design | Apache 2.0 |
Model: Claude Sonnet 4.6 · Tools: read/readFile, agent, memory
Coordinates the full development lifecycle by delegating to subagents. Never implements directly. Routes requests to the shortest path — a bug report goes straight to the Debugger, a UI-only task goes to the Designer, and a full project goes through the 10-step execution model.
Key capabilities:
- 10-step execution model: Research → Plan → Execute → Verify → Debug → Iterate
- Request routing table — picks the shortest path for each request type
- Parallel execution with
/fleetmode - Debugger mode selection (
find_root_cause_onlyvsfind_and_fix) - File conflict prevention strategies for parallel work
.planning/artifact management
Model: GPT-5.4 · Tools: vscode, execute, read, context7/*, edit, search, web, memory
Investigates technologies, maps codebases, and researches implementation approaches. Context7-first, source-verified. Training data is treated as a hypothesis — everything is verified against live sources.
4 Modes:
| Mode | Output |
|---|---|
| project | SUMMARY.md, STACK.md, FEATURES.md, ARCHITECTURE.md, PITFALLS.md |
| phase | RESEARCH.md for specific phase implementation |
| codebase | STACK.md, INTEGRATIONS.md, ARCHITECTURE.md, CONVENTIONS.md, etc. |
| synthesize | Consolidated SUMMARY.md across all research |
Source hierarchy: Context7 (HIGH) → Official docs (HIGH) → Web search (MEDIUM) → Training data (LOW)
Model: GPT-5.4 · Tools: vscode, execute, read, context7/*, edit, search, web, memory, todo
Creates roadmaps, implementation plans, validates plans, creates gap-closure plans, and revises based on feedback. Plans are prompts — every plan must be executable by a single agent in a single session.
5 Modes:
| Mode | Output |
|---|---|
| roadmap | ROADMAP.md, STATE.md, REQUIREMENTS.md |
| plan | PLAN.md per task group with dependency graph |
| validate | 6-dimension verification (coverage, completeness, dependencies, links, scope, derivation) |
| gaps | Gap-closure PLAN.md files from verification failures |
| revise | Targeted plan updates from validation issues |
Philosophy: WHAT not HOW. Goal-backward. Keep plans under 50% context utilization (2–3 tasks per plan). Fleet wave assignments in dependency graphs for parallel execution.
Model: Claude Opus 4.6 · Tools: vscode, execute, read, context7/*, github/*, edit, search, web, memory, todo
Writes production code following 9 mandatory coding principles. Executes plans atomically with per-task commits. Always uses #context7 to look up documentation before coding.
9 Mandatory Principles: Structure, Architecture, Functions, Naming & Comments, Logging & Errors, Regenerability, Platform Use, Modifications, Quality
Execution model:
- Load project state → Load plan → Execute tasks
- Run
ide-get_diagnosticson modified files after implementation - Run verification commands
- Per-task conventional commits (never
git add .) - Handle deviations with priority rules (ask about architecture, auto-fix bugs/blockers)
- TDD support: RED → GREEN → REFACTOR when detected
Model: Gemini 3 Pro Preview · Tools: vscode, execute, read, context7/*, edit, search, web, memory, todo
Handles all UI/UX design tasks. Prioritizes usability over accessibility over aesthetics. Uses the frontend-design skill for production-grade design quality. Pushes back on technical constraints that harm UX.
Workflow: Understand intent → Research (Context7 + @file mentions) → Design with full implementation → Verify (accessibility, responsiveness, ide-get_diagnostics)
Principles: Less is more · Consistency · Feedback · Hierarchy · Whitespace · Purposeful motion
Model: Claude Sonnet 4.6 · Tools: vscode, execute, read, edit, search, memory
Goal-backward verification of phase outcomes and cross-phase integration. Task completion ≠ goal achievement. Does NOT trust SUMMARY.md claims — verifies everything independently.
3 Modes:
| Mode | Output |
|---|---|
| phase | VERIFICATION.md — 10-step process with observable truths, 3-level artifact checks, key link verification |
| integration | INTEGRATION.md — cross-phase wiring, API coverage, auth protection, end-to-end flows |
| re-verify | Updated VERIFICATION.md after gap closure |
3-Level Artifact Verification:
- Existence — Does the file exist?
- Substance — Is it real code, not a stub? (line count, TODO scan, LSP diagnostics)
- Wired — Is it actually imported and used?
Model: Claude Opus 4.6 · Tools: vscode, execute, read, edit, search, web, memory, context7/*
Scientific debugging with hypothesis testing, persistent debug files, and structured investigation. Never guesses — every conclusion must have evidence.
2 Modes:
| Mode | Description |
|---|---|
| find_and_fix | Find root cause AND implement fix (default) |
| find_root_cause_only | Document root cause without fixing |
Cognitive bias guards: Confirmation, Anchoring, Availability, Sunk Cost — each with specific antidotes.
Techniques: LSP Diagnostics (always first for crashes/type errors) · Binary Search · Rubber Duck · Minimal Reproduction · Working Backwards · Differential Debugging · Observability First · Comment Out Everything · Git Bisect
Debug file protocol: Immutable symptoms, overwrite current focus, append-only evidence log.
Three documentation instruction sets that automatically apply based on file path patterns.
For operators and support engineers. Lead with prerequisites and blast radius. Rollback section before the procedure. Decision-tree troubleshooting. Post-action verification mandatory. Destructive operations require safety gates.
For engineers maintaining the system. State invariants before implementation. Design rationale non-negotiable. Catalog known failure modes. Performance with proof. Link code examples to actual source. Make trade-offs explicit.
For first-time users. Lead with outcome, not feature. Quick start in 3–5 steps. Progressive complexity (quickstart → concepts → reference → advanced). Troubleshoot by symptom. "You" voice throughout.
Production-grade frontend interfaces that avoid generic "AI slop" aesthetics. The Designer agent references this skill automatically for all UI/UX work.
Design thinking before coding:
- Purpose — What problem does this interface solve?
- Tone — Bold aesthetic direction (brutally minimal, maximalist, retro-futuristic, editorial, etc.)
- Differentiation — What makes this unforgettable?
Aesthetics guidelines:
- Typography — Distinctive, characterful fonts. Never generic (Arial, Inter, Roboto).
- Color — Cohesive palette with sharp accents. Dominant colors over timid distribution.
- Motion — High-impact moments: staggered page load reveals, scroll-triggering, surprising hover states.
- Spatial — Unexpected layouts, asymmetry, overlap, grid-breaking elements.
- Backgrounds — Atmosphere and depth: gradient meshes, noise textures, geometric patterns, grain overlays.
Licensed under Apache 2.0 (LICENSE).
User: "Build a recipe sharing app"
│
▼
Orchestrator
├─1─► Researcher (project mode) → domain research
├─2─► Researcher (synthesize) → consolidated findings
├─3─► Planner (roadmap mode) → phase breakdown
│
│ For each phase:
├─4─► Researcher (phase mode) → implementation research
├─5─► Planner (plan mode) → task-level plans
├─6─► Planner (validate mode) → 6-dimension plan check
├─7─► Coder + Designer (/fleet) → parallel implementation
├─8─► Verifier (phase mode) → goal-backward verification
│ └── gaps? → Planner (gaps) → Coder → Verifier (max 3 cycles)
│
│ After all phases:
├─9─► Verifier (integration) → cross-phase wiring check
└─10─► Report + /share → final report + session export
Not every request needs the full lifecycle:
| Request | Route |
|---|---|
| New project / greenfield | Full flow (Steps 1–10) |
| New feature on existing codebase | Steps 3–10 |
| Bug report — "why is this failing?" | Debugger (find_root_cause_only) |
| Bug report — "fix this" | Debugger (find_and_fix) |
| Quick code change (single file) | Coder directly |
| UI/UX only | Designer directly |
| Verify existing work | Verifier directly |
Parallel when: tasks touch different files, different domains, no data dependencies. Sequential when: Task B needs Task A output, same file modified, design before implementation.
The Orchestrator uses /fleet mode + explicit file scoping to run Coder and Designer concurrently on independent tasks.
Enable fleet mode before launching parallel subagent calls. Coder + Designer run concurrently on independent tasks rather than one at a time.
Use @path/to/file in delegation prompts to anchor agents to specific files. The agent receives file contents directly in context rather than spending turns searching.
After the final report, export the session to a GitHub gist — a permanent, linkable record of architectural decisions, research, and verification results.
For long-running full flows (Steps 1–10), enable Autopilot (Shift+Tab to cycle modes) to run without approval at each step. Checkpoints still pause for genuine human decisions.
.planning/
├── REQUIREMENTS.md # Requirements with REQ-IDs
├── ROADMAP.md # Phase breakdown with success criteria
├── STATE.md # Project state tracking
├── INTEGRATION.md # Cross-phase verification
├── research/ # Research outputs
│ ├── SUMMARY.md # Consolidated research
│ ├── STACK.md # Technology choices
│ ├── FEATURES.md # Feature analysis
│ ├── ARCHITECTURE.md # Architecture patterns
│ └── PITFALLS.md # Known pitfalls
├── codebase/ # Codebase analysis
│ ├── STACK.md
│ ├── CONVENTIONS.md
│ ├── ARCHITECTURE.md
│ └── ...
├── phases/
│ ├── 1/
│ │ ├── RESEARCH.md # Phase research
│ │ ├── PLAN.md # Task plans
│ │ ├── SUMMARY.md # Execution summary
│ │ └── VERIFICATION.md # Phase verification
│ └── N/
└── debug/ # Debug session files
└── BUG-[timestamp].md
- VS Code with GitHub Copilot extension (Chat enabled)
- Agent mode enabled in Copilot settings
- Models available: Claude Sonnet 4.6, Claude Opus 4.6, GPT-5.4, Gemini 3 Pro Preview
- Git initialized in the workspace
- Context7 first — All agents verify library/framework docs via Context7 before using training data. This prevents stale-knowledge bugs.
- Plans are prompts — Each plan is consumed by exactly one agent in one session. If a plan needs a meeting to understand, it's too complex.
- Never tell agents HOW — Describe WHAT needs to be done (outcomes), not implementation steps.
- 3-level artifact verification — Existence ≠ Substance ≠ Wired. The Verifier checks all three levels.
- LSP diagnostics integration — Coder, Designer, Verifier, and Debugger all use
ide-get_diagnosticsfor fast type/lint error detection before full builds. - Fleet mode for parallel work — Use
/fleetwhen Coder and Designer work on independent files simultaneously. Always scope agents to specific files to prevent conflicts. @mentions save context — Reference files with@path/to/filein prompts instead of describing them. Saves agent turns and prevents miscommunication.- Debug files are persistent — Every debug session creates
.planning/debug/BUG-[timestamp].mdwith immutable symptoms and append-only evidence. - Gap-closure loop — After verification, the Planner creates targeted fix plans (max 3 cycles) before escalating to the user.
- Resumability — Read
STATE.mdto determine current position and resume from the correct step.
When tasks in a phase touch different files:
- Enable
/fleetmode - Call Coder and Designer with explicit file scoping
- Wait for all to complete before verification
- Strategy 1: Explicit file assignment — "Create
src/contexts/ThemeContext.tsx. Do NOT touch any other files." - Strategy 2: Sequential sub-phases when files overlap
- Strategy 3: Component boundaries — assign agents to distinct component subtrees
When test frameworks are configured or user mentions TDD, plans auto-structure as RED → GREEN → REFACTOR:
- Write failing test → commit
- Minimum code to pass → commit
- Refactor without behavior change → commit
- Read
.planning/STATE.md - Check current phase and status
- Resume from the correct step based on what artifacts exist
- Research before assumption — Training data is stale. Verify everything.
- Plans are prompts — If an agent can't execute it in one session, split it.
- WHAT not HOW — Describe outcomes. Agents decide implementation.
- Goal-backward verification — Start from the desired end state and work backwards.
- Task completion ≠ Goal achievement — Files existing doesn't mean they work.
- Scientific debugging — Hypothesize, test, eliminate. Never guess.
- Atomic commits — One task, one commit. Never
git add . - Immutable symptoms — Original bug reports are ground truth. Never edit.
- Context7 first — Live documentation over training data, always.
- Anti-enterprise — Solo developer workflow. If it needs a meeting, it's too complex.
Repository: https://github.com/japperJ/JP-agent-flow-fleet
The new version is here using Fleet multiple agents when possibly https://gist.github.com/japperJ/c0df8aa1d320c69deeea513d4aacc3ac