bedwards/a.md

## a.md

      
    Raw
  

              a.md
            
          
    Claude Agent SDK Lessons Learned (War Stories)

Real-world lessons from building with the Claude Agent SDK:
Process Discipline


Manager vs Worker Separation - The conversational Claude should NEVER write implementation code. Create issues, let worker agents implement. This prevents scope creep and ensures all work is tracked.


Don't Remove Functionality - When adding features, expand existing code rather than replacing it. Easy to accidentally break things during "simplification."


Maintain Broad Perspective - Easy to get tunnel vision on one feature. Keep the whole system in mind.


Technical Lessons


Git Worktrees for Isolation - Workers operate in worktrees so failures don't affect main branch. Essential for autonomous operation.


Frequent Commits - Worker should commit after each significant change. Enables recovery and visibility.


Local Validation First - Run lint/typecheck/tests locally before pushing. Faster feedback than CI.


Handle API Rate Limits - GitHub API has rate limits. Worker needs backoff logic.


Claude Agent SDK Specifics


Tool Results Are Rich - Tool results can include system reminders and context. Parse carefully.


Context Windows Fill Up - Long conversations get compacted. Maintain external state (todo lists, issue descriptions) rather than relying on conversation memory.


Parallel Tool Calls - Use parallel tool calls when operations are independent. Significant speedup.


Subagent Spawning - Task tool spawns subagents for complex work. Each subagent is fresh context.


What Works Well


GitHub Issues as source of truth for requirements
Worker agent handling full PR lifecycle
Claude GitHub integration for automated review
Structured logging for debugging autonomous workers

What's Tricky


Error recovery when worker hits unexpected states
Balancing autonomy with user oversight
Managing context across long conversations
Testing async/autonomous behaviors


## b.md

      
    Raw
  

              b.md
            
          
    Analysis: Why We're Struggling

Parallel work on conflicting files


Launched 5 workers simultaneously on issues that ALL modified main.ts
Result: 4 PRs with merge conflicts, had to close and redo


Missing deployment automation


Worker code merged but never deployed to Cloudflare
User discovered broken app in production


Incomplete requirements capture


Original issue #40 was broad ("AI-generated branches")
Didn't specify: home page, URL routing, multi-story, saved games
Discovered requirements incrementally during implementation


No end-to-end verification


Workers ran tests, CI passed, but nobody tested the actual deployed app
Broken user experience shipped

Recommendations:

Sequential work on shared files - Don't parallelize issues that touch the same core files
Deployment in CI - Just added via PR#56
Upfront requirements - Spend more time specifying before creating issues
Smoke tests - Add automated E2E tests or manual verification step
Smaller, focused issues - Break down by file/component, not by feature

Should I create issues to address these process improvements?
No results found