1.6 KiB
1.6 KiB
Performance Optimization
Model Selection Strategy
Haiku 4.5 (90% of Sonnet capability, 3x cost savings):
- Lightweight agents with frequent invocation
- Pair programming and code generation
- Worker agents in multi-agent systems
Sonnet 4.6 (Best coding model):
- Main development work
- Orchestrating multi-agent workflows
- Complex coding tasks
Opus 4.5 (Deepest reasoning):
- Complex architectural decisions
- Maximum reasoning requirements
- Research and analysis tasks
Context Window Management
Avoid last 20% of context window for:
- Large-scale refactoring
- Feature implementation spanning multiple files
- Debugging complex interactions
Lower context sensitivity tasks:
- Single-file edits
- Independent utility creation
- Documentation updates
- Simple bug fixes
Extended Thinking + Plan Mode
Extended thinking is enabled by default, reserving up to 31,999 tokens for internal reasoning.
Control extended thinking via:
- Toggle: Option+T (macOS) / Alt+T (Windows/Linux)
- Config: Set
alwaysThinkingEnabledin~/.claude/settings.json - Budget cap:
export MAX_THINKING_TOKENS=10000 - Verbose mode: Ctrl+O to see thinking output
For complex tasks requiring deep reasoning:
- Ensure extended thinking is enabled (on by default)
- Enable Plan Mode for structured approach
- Use multiple critique rounds for thorough analysis
- Use split role sub-agents for diverse perspectives
Build Troubleshooting
If build fails:
- Use build-error-resolver agent
- Analyze error messages
- Fix incrementally
- Verify after each fix