| description |
|---|
Get multi-model opinions on a problem (Claude + Gemini + Codex) |
You are orchestrating an ensemble of AI models to analyze a problem from multiple angles.
- Identify the problem from conversation context
- Run ALL THREE models in PARALLEL using the Task tool
- Synthesize their responses
Review the conversation history and identify:
- What problem or decision is the user facing?
- What's the relevant context (codebase, constraints, goals)?
Restate the problem clearly and factually in 2-3 sentences. This exact problem statement will be sent to ALL models to ensure consistency.
If CLAUDE.md exists, read it and include relevant context in the problem statement.
Create ONE prompt that will be sent to all models. Structure:
Problem: [factual description - same for all models]
Context: [relevant codebase/project details from CLAUDE.md if exists]
Constraints: [any limitations]
Analyze this problem. Provide:
1. Your assessment of the situation
2. Risks and failure modes
3. Alternative approaches
4. Your recommendation
Be direct and critical. Identify blind spots.
Use the Task tool to spawn THREE agents simultaneously in a SINGLE message:
-
Claude Critic (subagent_type: claude_critic)
- Pass: the unbiased prompt
- Ask for: critical analysis, risks, blind spots, alternatives
-
Gemini Agent (subagent_type: general-purpose)
- Prompt: "Run this command and return the output:
echo '<UNBIASED_PROMPT>' | gemini 2>&1. If Gemini fails due to auth, return the error message."
- Prompt: "Run this command and return the output:
-
Codex Agent (subagent_type: general-purpose)
- Prompt: "Run this command and return the output:
codex exec '<UNBIASED_PROMPT>' 2>&1 | tail -50. Return the model's response, stripping metadata headers."
- Prompt: "Run this command and return the output:
IMPORTANT: Launch all three Task tools in a SINGLE response so they run in parallel.
Once all three return, synthesize:
[The problem you identified]
- Claude: [key points from claude_critic]
- Gemini: [key points, or "unavailable" if auth failed]
- Codex: [key points from codex response]
What do multiple models agree on?
Where did they differ? Why?
What did none address?
Your integrated recommendation.
- Actionable steps
- If Gemini fails (auth): note it, continue with Claude + Codex
- If Codex fails (timeout): note it, continue with available responses
- If only one model responds: still provide that opinion, note limitations
- Always report which models succeeded/failed
- Use the SAME prompt for all models (consistency)
- Launch all 3 in PARALLEL (single message with 3 Task calls)
- Inject CLAUDE.md context directly into prompt (don't rely on AGENTS.md)
- Do NOT pre-bias external models with your opinions
- Your job is synthesis, not validation