AndrewAltimit/README.md

## README.md

      
    Raw
  

              README.md
            
          
Board-Centric Agent Workflow

This document describes the end-to-end workflow for AI-assisted issue resolution, from initial issue creation through merged PR.

PDF Version: Build from Agentic_Workflow_Handout.tex using pdflatex for a printable 1-page handout.

Overview

All agent work flows through the GitHub Projects v2 board as a deliberate triage step. This ensures:

Human oversight on what gets worked on
Clear visibility into work queue
Prevents agents from working on inappropriate issues
Single source of truth for agent work status

Key Principle


Humans decide WHAT to work on and WHEN to merge.
Agents handle HOW with automated quality loops.

Main Workflow Diagram

┌─────────────────────────────────────────────────────────────────────────────────┐
│                        AI-ASSISTED DEVELOPMENT WORKFLOW                         │
│                  Human oversight with automated agent feedback loops            │
├─────────────────────────────────────────────────────────────────────────────────┤
│                                                                                 │
│═════════════════════════════════════════════════════════════════════════════════│
│                                FORWARD FLOW                                     │
│═════════════════════════════════════════════════════════════════════════════════│
│                      (human+agent)                                              │
│  ┌────────────┐     ┌────────────┐     ┌────────────┐     ┌────────────┐        │
│  │   Issue    │────►│  Add to    │────►│  Approve   │────►│   Agent    │        │
│  │ Refinement │     │   Board    │     │   Work     │     │ Implements │        │
│  │  (Agent)   │     │ (Human OR  │     │  (Human)   │     │  (Agent)   │        │
│  │            │     │   Agent)   │     │            │     │            │        │
│  └────────────┘     └────────────┘     └────────────┘     └────────────┘        │
│       │                  │                  │                   │               │
│       │                  │               GATE 1                 │               │
│       │                  │             (Approval)               │               │
│       ▼                  ▼                  ▼                   ▼               │
│   [Backlog]           [Todo]            [Todo]           [In Progress]          │
│                                                                 │               │
│═════════════════════════════════════════════════════════════════│═══════════════│
│                                                                 ▼               │
│   ┌────────────┐     ┌────────────┐     ┌────────────┐     ┌────────────┐       │
│   │   Merge    │◄────│   Review   │◄────│     CI     │◄────│  Create    │       │
│   │    PR      │     │   Code     │     │  Feedback  │     │    PR      │       │
│   │  (Human)   │     │  (Human)   │     │   Loop     │     │  (Agent)   │       │
│   └────────────┘     └────────────┘     └────────────┘     └────────────┘       │
│        │                  │                  ▲                   │              │
│     GATE 3             GATE 2                │                   │              │
│    (Merge)            (Review)    ┌──────────┴──────────┐        ▼              │
│        ▼                  │       │                     │   ┌────────────┐      │
│     [Done]                │       │    Auto-Fix Loop    │   │ AI Council │      │
│                           │       │   (Format/Lint)     │   │  Reviews   │      │
│                           │       │  Max 5 iterations   │   └─────┬──────┘      │
│                           │       └──────────┬──────────┘         │             │
│                           │                  ▲                    │             │
│                           │                  │                    ▼             │
│                           │                  └────────────────────┘             │
│                           │                   (review response)                 │
│                           │                                                     │
│                           └─────────────────────────────────────────────┐       │
│                                    (changes requested)                  │       │
│                                                                         ▼       │
│                                                                  ┌────────────┐ │
│                                                                  │ AI Council │ │
│                                                                  │  Re-review │ │
│                                                                  └────────────┘ │
│                                                                                 │
└─────────────────────────────────────────────────────────────────────────────────┘

Legend: ■ Human Gate (3 total)   □ Agent Work   ◇ Review/Feedback   ○ Kanban State

Key: Humans decide WHAT to work on and WHEN to merge. Agents handle HOW.
     Both humans AND agents can add issues to the board.

Stage 1: Issue Refinement

Workflow: backlog-refinement.yml
Schedule: Manual trigger (can be scheduled)
AI agents periodically review open issues and add insights:

Architecture perspective: Design considerations, patterns to use
Security perspective: Potential vulnerabilities, auth requirements
Implementation perspective: Suggested approach, complexity estimate
Maintenance perspective: Testing needs, documentation requirements

How It Works


Workflow queries issues older than N days (default: 3)
Multiple AI agents analyze each issue
Agents post comments with unique insights
14-day cooldown prevents same agent re-commenting

Configuration

# .github/workflows/backlog-refinement.yml inputs
agents: 'claude,gemini'      # Which agents to use
max-issues: 5                # Issues per run
max-comments: 2              # Comments per issue
min-age-days: 3              # Minimum issue age
dry-run: true                # Preview without posting
enable-issue-management: false  # Allow agents to close/update issues
Issue Management (Optional)

When enable-issue-management: true, agents can:

Close duplicate or resolved issues
Update unclear titles
Add/remove labels
Link related PRs

Maintainer comments (from agent_admins) get highest priority in decisions.

Stage 2: Add to Board (Manual Triage)

Action: Manual
Who: Repository maintainer
After refinement, maintainers decide which issues are ready for agent work:

Review issue and AI insights
Add issue to the GitHub Projects v2 board
Set status to "Todo"
Optionally assign an agent via the "Agent" field

Board Configuration

The board is configured in ai-agents-board.yml:
project:
  owner: AndrewAltimit
  number: 1  # Your project number

columns:
  todo: "Todo"
  in_progress: "In Progress"
  blocked: "Blocked"
  done: "Done"

fields:
  agent: "Agent"           # Which agent should work on it
  priority: "Priority"     # P0, P1, P2, P3
  complexity: "Complexity" # Low, Medium, High
What Makes an Issue "Ready"

An issue is considered ready for agent work when:

Status is "Todo"
Not blocked by other issues
Not currently claimed by another agent
Has [Approved] trigger from an agent_admin


Stage 3: Approval

Action: Comment with trigger keyword
Who: Users in agent_admins list (.agents.yaml)
To authorize an agent to work on an issue:
[Approved][Claude]

Or let the board decide which agent:
[Approved]

Trigger Keywords


Keyword
Action


[Approved]
Implement the issue


[Review]
Review and provide feedback


[Summarize]
Summarize the discussion


[Debug]
Debug the issue


[Close]
Close the issue


Agent Selection Priority


Explicit in trigger: [Approved][Claude] uses Claude
Board field: Agent field on the project board item
Config priority: First available from agent_priorities.issue_creation

Who Can Approve

Only users in .agents.yaml security.agent_admins:
security:
  agent_admins:
    - AndrewAltimit  # Only humans who can trigger agent actions

Stage 4: PR Creation

Workflow: scheduled-agent-work.yml → board-agent-worker.yml
Schedule: Every 12 hours (6 AM / 6 PM UTC)
How It Works


Workflow queries board for ready work
Agent claims the issue (prevents conflicts)
Agent creates feature branch: fix-issue-{number}-{short-id}
Agent implements the fix using its tools
Agent creates PR linked to issue
Claim is released

Claim Management

Claims prevent multiple agents from working on the same issue:

Claims expire after 2 hours (configurable)
Stale claims are automatically cleaned up
Agents can renew claims for long-running work

PR Format

The agent creates a PR with:
## Summary
- Brief description of changes

## Test plan
- [ ] Unit tests added/updated
- [ ] Manual testing performed

Fixes #123

---
Generated by Claude via board-agent-worker

Stage 5: AI + Human Feedback Loop

Once the PR is created, multiple feedback mechanisms engage in a tight automated loop.
Agent Council

Multiple AI agents provide diverse review perspectives:
┌────────────────────────────────────────────────────────────────┐
│                       AGENT COUNCIL                            │
├────────────────────────────────────────────────────────────────┤
│                                                                │
│   ┌─────────────┐     ┌─────────────┐     ┌─────────────┐      │
│   │   Gemini    │     │   Codex     │     │   Claude    │      │
│   │  (Review)   │     │  (Review)   │     │   (Fix)     │      │
│   └──────┬──────┘     └──────┬──────┘     └──────┬──────┘      │
│          │                   │                   │             │
│          │    Security &     │   Code Quality    │             │
│          │   Architecture    │    & Patterns     │             │
│          │                   │                   │             │
│          └───────────────────┴───────────────────┘             │
│                              │                                 │
│                              ▼                                 │
│                    ┌─────────────────┐                         │
│                    │  Consolidated   │                         │
│                    │    Feedback     │                         │
│                    └─────────────────┘                         │
│                                                                │
└────────────────────────────────────────────────────────────────┘

Note: Reviews are NOT weighted equally.
Maintainer feedback > AI suggestions > Community feedback

5a. Gemini Automated Review

Workflow: pr-validation.yml
Trigger: PR opened/synchronized
Gemini automatically reviews every PR for:

Security vulnerabilities
Code quality issues
Adherence to project standards
Container configuration problems

Review appears as PR comment with actionable feedback.
5b. Codex Automated Review

Workflow: pr-validation.yml
Trigger: PR opened/synchronized
Codex provides complementary review focusing on:

Code patterns and best practices
Performance considerations
API design feedback

5c. Inline Agent Review Response (NEW)

Workflow: pr-validation.yml → agent-review-response job
Trigger: Immediately after Gemini/Codex reviews complete
┌─────────────────────────────────────────────────────────────────┐
│                  TIGHT FEEDBACK LOOP 1                          │
│                  (Review Response)                              │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  ┌─────────────┐     ┌─────────────┐     ┌─────────────┐        │
│  │   Gemini    │────►│   Cleanup   │────►│   Codex     │        │
│  │   Review    │     │   Working   │     │   Review    │        │
│  │             │     │    Dir      │     │             │        │
│  └─────────────┘     └─────────────┘     └──────┬──────┘        │
│                                                 │               │
│                                                 ▼               │
│                                          ┌─────────────┐        │
│                                          │   Cleanup   │        │
│                                          │   Working   │        │
│                                          │    Dir      │        │
│                                          └──────┬──────┘        │
│                                                 │               │
│                                                 ▼               │
│  ┌─────────────┐     ┌─────────────┐     ┌─────────────┐        │
│  │   Push      │◄────│   Claude    │◄────│  Download   │        │
│  │   Changes   │     │  Auto-Fix   │     │  Artifacts  │        │
│  └──────┬──────┘     └─────────────┘     └─────────────┘        │
│         │                                                       │
│         │         (Re-triggers pipeline)                        │
│         └──────────────────────────────────────────────────►    │
│                                                                 │
│  Reviews always run on all commits.                             │
│  Agent evaluates if there are new/unaddressed issues to fix.    │
│  Safety: Max 5 iterations per PR                                │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

How It Works:

Gemini completes review, working directory cleaned
Codex downloads Gemini artifact and completes review, working directory cleaned
Agent-review-response downloads review artifacts
Claude parses review artifacts for actionable feedback
Autoformat runs first (black, isort)
Claude fixes remaining lint/format issues
Changes committed by agent (detected via author name)
Push retriggers pipeline; reviews run again but agent evaluates if new issues exist

5d. CI Failure Handler (NEW)

Workflow: pr-validation.yml → agent-failure-handler job
Trigger: When format-check, basic-lint, or full-lint fails
┌─────────────────────────────────────────────────────────────────┐
│                  TIGHT FEEDBACK LOOP 2                          │
│                  (CI Failure Handler)                           │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  ┌─────────────┐     ┌─────────────┐     ┌─────────────┐        │
│  │  Format/    │────►│  Autoformat │────►│   Claude    │        │
│  │  Lint Fail  │     │   (black,   │     │  Fixes      │        │
│  │             │     │   isort)    │     │  Remaining  │        │
│  └─────────────┘     └─────────────┘     └──────┬──────┘        │
│                                                 │               │
│                                                 ▼               │
│                                          ┌─────────────┐        │
│                                          │    Push     │        │
│                                          │  Changes    │        │
│       ◄──────────────────────────────────┴─────────────┘        │
│       │         (Re-triggers pipeline)                          │
│       ▼                                                         │
│  ┌─────────────┐                                                │
│  │   CI Runs   │  Fixed issues should pass                      │
│  │   Again     │                                                │
│  └─────────────┘                                                │
│                                                                 │
│  Handles: formatting, linting, type errors, unused imports      │
│  Does NOT handle: test failures, build errors                   │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

Scope (Safe Categories):

Formatting issues (ruff format)
Linting errors (ruff)
Type errors (mypy)
Unused imports

Out of Scope (Requires Human):

Test failures (may indicate logic bugs)
Build failures (may require architecture decisions)

5e. Human Comments

Humans can:

Add comments with suggestions
Request specific changes
Ask clarifying questions

The PR Review Monitor can respond to human feedback if it contains clear action items.
Loop Prevention

Multiple safety mechanisms prevent infinite loops:


Mechanism
Description


Always Review
Reviews run on all commits; agent decides whether to act on feedback


Clean Slate
Working directory cleaned between reviewers to ensure consistency


Iteration Counter
Max 5 iterations via workflow artifacts


Human Reset
Counter resets when human pushes


Smart Agent
Agent checks if issues are new/unaddressed before committing


Feedback Priority


Security issues: Auto-fixed immediately
Maintainer feedback: Highest priority (from agent_admins)
AI review feedback: Standard priority
Community feedback: Considered but requires maintainer validation

Auto-Fix Confidence

The agent uses a confidence system:


Confidence
Action


High (>80%)
Auto-fix without asking


Medium (50-80%)
Fix but flag for review


Low (<50%)
Ask maintainer for guidance


High-confidence issues:

Formatting/linting errors
Type errors with clear fixes
Security issues with standard solutions

Low-confidence issues:

Architectural changes
API modifications
Behavioral changes


Stage 6: Merge

Action: Manual
Who: Repository maintainer
Merge Requirements

A PR is ready to merge when:

CI Passes: All required checks green
Reviews Approved: No outstanding "changes_requested"
Human Approval: Maintainer approves the PR
No Conflicts: Branch is up to date with main

Auto-Undraft

The PR Review Monitor automatically marks PRs as "Ready for Review" when:

All CI checks pass
No pending review changes
Agent work is complete

Merge Process


Maintainer reviews final changes
Approves PR via GitHub UI
Merges (squash recommended for clean history)
Issue automatically closes via "Fixes #N" reference


Workflow Files Reference


Workflow
Purpose
Trigger


backlog-refinement.yml
AI insights on issues
Manual/scheduled


scheduled-agent-work.yml
Process board work
Every 12 hours


board-agent-worker.yml
Reusable agent executor
Called by above


pr-validation.yml
Gemini PR review
PR events


pr-review-monitor.yml
Respond to feedback
Hourly + review events


Configuration Files


File
Purpose


.agents.yaml
Agent config, security lists


ai-agents-board.yml
Board column/field mapping


CLAUDE.md
Instructions for Claude


docs/agents/project-context.md
Context for Gemini reviews


Security Model

Separation of Concerns

# .agents.yaml
security:
  # Can trigger [Approved][Agent] - humans only
  agent_admins:
    - AndrewAltimit

  # Comments trusted for context - includes bots
  trusted_sources:
    - AndrewAltimit
    - github-actions[bot]
    - dependabot[bot]
Safety Guarantees


Board gating: Issues must be triaged before agent work
Approval required: Only agent_admins can trigger agents
Human merge: Final merge always requires human approval
CI validation: All changes must pass automated checks
Audit trail: All agent actions logged in issue/PR comments


Example: Full Lifecycle

# Day 1: Issue Created
# User opens issue #42: "Add rate limiting to API"

# Day 4: Refinement
# backlog-refinement.yml runs
# Claude comments: "Consider token bucket algorithm, add Redis for distributed limiting"
# Gemini comments: "Security: Ensure rate limit headers are included in responses"

# Day 5: Triage
# Maintainer reviews insights, adds issue to board
# Sets Agent field to "Claude"

# Day 5: Approval
# Maintainer comments: [Approved][Claude]

# Day 5 (6 PM UTC): Implementation
# scheduled-agent-work.yml runs
# Claude claims issue, creates branch fix-issue-42-abc123
# Implements rate limiting with tests
# Creates PR #43

# Day 5: Review
# Gemini auto-reviews PR
# Requests change: "Add rate limit bypass for health checks"

# Day 5: Feedback Loop
# pr-review-monitor.yml detects feedback
# Claude implements health check bypass
# Commits and comments completion

# Day 6: Merge
# Maintainer reviews final changes
# Approves and merges PR
# Issue #42 automatically closes

Troubleshooting

Issue Not Being Picked Up


Check issue is on the board with "Todo" status
Verify [Approved] comment exists from agent_admin
Check for blocking issues
Review scheduled-agent-work.yml logs

Agent Not Responding to Feedback


Check pr-review-monitor.yml workflow runs
Verify review state is "changes_requested"
Check agent availability (Claude auth, API keys)
Review feedback format (must be actionable)

PR Stuck in Draft


Check all CI checks pass
Verify no "changes_requested" reviews
Review agent comments for errors
Manually undraft if needed
Keyword	Action
`[Approved]`	Implement the issue
`[Review]`	Review and provide feedback
`[Summarize]`	Summarize the discussion
`[Debug]`	Debug the issue
`[Close]`	Close the issue
Mechanism	Description
Always Review	Reviews run on all commits; agent decides whether to act on feedback
Clean Slate	Working directory cleaned between reviewers to ensure consistency
Iteration Counter	Max 5 iterations via workflow artifacts
Human Reset	Counter resets when human pushes
Smart Agent	Agent checks if issues are new/unaddressed before committing
Confidence	Action
High (>80%)	Auto-fix without asking
Medium (50-80%)	Fix but flag for review
Low (<50%)	Ask maintainer for guidance
Workflow	Purpose	Trigger
`backlog-refinement.yml`	AI insights on issues	Manual/scheduled
`scheduled-agent-work.yml`	Process board work	Every 12 hours
`board-agent-worker.yml`	Reusable agent executor	Called by above
`pr-validation.yml`	Gemini PR review	PR events
`pr-review-monitor.yml`	Respond to feedback	Hourly + review events
File	Purpose
`.agents.yaml`	Agent config, security lists
`ai-agents-board.yml`	Board column/field mapping
`CLAUDE.md`	Instructions for Claude
`docs/agents/project-context.md`	Context for Gemini reviews