What to Do When Token Consumption Is Very High?
If you find that a single window is consuming more than $5 worth of tokens per hour, this is usually caused by an excessively long conversation context, frequent reading of large files, or suboptimal usage patterns.
Recommended Reading: For detailed tips on saving tokens, see How to Save Tokens
Common Causes
- Too many conversation turns - Not resetting the conversation for a long time causes the context to grow increasingly large
- Frequently reading large files - Each request includes a large amount of file content
- Using a high-consumption model - For example, the Opus model consumes several times more tokens than Sonnet
Quick Solutions
- Use the
/clearcommand regularly to clear conversation history - Use the
/compactcommand to compress the current context - Choose your model wisely — Sonnet is sufficient for everyday tasks
- Avoid reading too many files at once