56 lines
1.6 KiB
Markdown
56 lines
1.6 KiB
Markdown
|
|
# Performance Optimization
|
||
|
|
|
||
|
|
## Model Selection Strategy
|
||
|
|
|
||
|
|
**Haiku 4.5** (90% of Sonnet capability, 3x cost savings):
|
||
|
|
- Lightweight agents with frequent invocation
|
||
|
|
- Pair programming and code generation
|
||
|
|
- Worker agents in multi-agent systems
|
||
|
|
|
||
|
|
**Sonnet 4.6** (Best coding model):
|
||
|
|
- Main development work
|
||
|
|
- Orchestrating multi-agent workflows
|
||
|
|
- Complex coding tasks
|
||
|
|
|
||
|
|
**Opus 4.5** (Deepest reasoning):
|
||
|
|
- Complex architectural decisions
|
||
|
|
- Maximum reasoning requirements
|
||
|
|
- Research and analysis tasks
|
||
|
|
|
||
|
|
## Context Window Management
|
||
|
|
|
||
|
|
Avoid last 20% of context window for:
|
||
|
|
- Large-scale refactoring
|
||
|
|
- Feature implementation spanning multiple files
|
||
|
|
- Debugging complex interactions
|
||
|
|
|
||
|
|
Lower context sensitivity tasks:
|
||
|
|
- Single-file edits
|
||
|
|
- Independent utility creation
|
||
|
|
- Documentation updates
|
||
|
|
- Simple bug fixes
|
||
|
|
|
||
|
|
## Extended Thinking + Plan Mode
|
||
|
|
|
||
|
|
Extended thinking is enabled by default, reserving up to 31,999 tokens for internal reasoning.
|
||
|
|
|
||
|
|
Control extended thinking via:
|
||
|
|
- **Toggle**: Option+T (macOS) / Alt+T (Windows/Linux)
|
||
|
|
- **Config**: Set `alwaysThinkingEnabled` in `~/.claude/settings.json`
|
||
|
|
- **Budget cap**: `export MAX_THINKING_TOKENS=10000`
|
||
|
|
- **Verbose mode**: Ctrl+O to see thinking output
|
||
|
|
|
||
|
|
For complex tasks requiring deep reasoning:
|
||
|
|
1. Ensure extended thinking is enabled (on by default)
|
||
|
|
2. Enable **Plan Mode** for structured approach
|
||
|
|
3. Use multiple critique rounds for thorough analysis
|
||
|
|
4. Use split role sub-agents for diverse perspectives
|
||
|
|
|
||
|
|
## Build Troubleshooting
|
||
|
|
|
||
|
|
If build fails:
|
||
|
|
1. Use **build-error-resolver** agent
|
||
|
|
2. Analyze error messages
|
||
|
|
3. Fix incrementally
|
||
|
|
4. Verify after each fix
|