claude-code/.claude/rules/common/performance.md

# Performance Optimization

## Model Selection Strategy

**Haiku 4.5** (90% of Sonnet capability, 3x cost savings):
- Lightweight agents with frequent invocation
- Pair programming and code generation
- Worker agents in multi-agent systems

**Sonnet 4.6** (Best coding model):
- Main development work
- Orchestrating multi-agent workflows
- Complex coding tasks

**Opus 4.5** (Deepest reasoning):
- Complex architectural decisions
- Maximum reasoning requirements
- Research and analysis tasks

## Context Window Management

Avoid last 20% of context window for:
- Large-scale refactoring
- Feature implementation spanning multiple files
- Debugging complex interactions

Lower context sensitivity tasks:
- Single-file edits
- Independent utility creation
- Documentation updates
- Simple bug fixes

## Extended Thinking + Plan Mode

Extended thinking is enabled by default, reserving up to 31,999 tokens for internal reasoning.

Control extended thinking via:
- **Toggle**: Option+T (macOS) / Alt+T (Windows/Linux)
- **Config**: Set `alwaysThinkingEnabled` in `~/.claude/settings.json`
- **Budget cap**: `export MAX_THINKING_TOKENS=10000`
- **Verbose mode**: Ctrl+O to see thinking output

For complex tasks requiring deep reasoning:
1. Ensure extended thinking is enabled (on by default)
2. Enable **Plan Mode** for structured approach
3. Use multiple critique rounds for thorough analysis
4. Use split role sub-agents for diverse perspectives

## Build Troubleshooting

If build fails:
1. Use **build-error-resolver** agent
2. Analyze error messages
3. Fix incrementally
4. Verify after each fix
chinese 2026-02-27 13:44:09 +00:00			`# Performance Optimization`

			`## Model Selection Strategy`

			`Haiku 4.5 (90% of Sonnet capability, 3x cost savings):`
			`- Lightweight agents with frequent invocation`
			`- Pair programming and code generation`
			`- Worker agents in multi-agent systems`

			`Sonnet 4.6 (Best coding model):`
			`- Main development work`
			`- Orchestrating multi-agent workflows`
			`- Complex coding tasks`

			`Opus 4.5 (Deepest reasoning):`
			`- Complex architectural decisions`
			`- Maximum reasoning requirements`
			`- Research and analysis tasks`

			`## Context Window Management`

			`Avoid last 20% of context window for:`
			`- Large-scale refactoring`
			`- Feature implementation spanning multiple files`
			`- Debugging complex interactions`

			`Lower context sensitivity tasks:`
			`- Single-file edits`
			`- Independent utility creation`
			`- Documentation updates`
			`- Simple bug fixes`

			`## Extended Thinking + Plan Mode`

			`Extended thinking is enabled by default, reserving up to 31,999 tokens for internal reasoning.`

			`Control extended thinking via:`
			`- Toggle: Option+T (macOS) / Alt+T (Windows/Linux)`
			- Config: Set `alwaysThinkingEnabled` in `~/.claude/settings.json`
			- Budget cap: `export MAX_THINKING_TOKENS=10000`
			`- Verbose mode: Ctrl+O to see thinking output`

			`For complex tasks requiring deep reasoning:`
			`1. Ensure extended thinking is enabled (on by default)`
			`2. Enable Plan Mode for structured approach`
			`3. Use multiple critique rounds for thorough analysis`
			`4. Use split role sub-agents for diverse perspectives`

			`## Build Troubleshooting`

			`If build fails:`
			`1. Use build-error-resolver agent`
			`2. Analyze error messages`
			`3. Fix incrementally`
			`4. Verify after each fix`