Token Efficiency Patterns
Concrete strategies for Claude Code to use fewer tokens while maintaining quality.
Tool Usage
- Parallel tool calls: Always batch independent reads, globs, and greps in a single message
- Glob before Read: Find the right file first instead of reading multiple wrong files
- Grep with head_limit: Use
head_limitto avoid reading huge search results - Avoid redundant reads: If you just wrote or edited a file, don’t re-read it to verify
- Use memory files: Check
MEMORY.mdbefore re-exploring the codebase for info you’ve seen before
Response Style
- Be concise: Short answers save tokens. Bullet points over paragraphs
- Don’t repeat the question: Jump straight to the answer
- Don’t explain what you’re about to do: Just do it
- Skip confirmation of obvious actions: “Done.” is enough
- Don’t list files you read: The user can see tool calls
Skills & Automation
- Use skills for recurring tasks: A skill call is cheaper than re-deriving a workflow
- Create skills proactively: If a task pattern repeats 3+ times, propose a skill
- Cache knowledge in vault: Write findings to
20_knowledge/so future sessions don’t re-research
Codebase Navigation
- Use memory for key paths: Don’t re-explore project structure every session
- Targeted searches: Use specific glob patterns instead of broad
**/*searches - Read with offset/limit: For large files, read only the relevant section
- Use Explore agent for deep research: Keeps main context clean
Anti-Patterns to Avoid
- Reading a file, then reading it again in the same session
- Using Bash for
cat,grep,findwhen dedicated tools exist (wastes tokens on tool overhead) - Explaining every step before doing it
- Asking for confirmation on low-risk, reversible actions
- Re-reading CLAUDE.md or memory files mid-session (they’re already in context)
- Creating sub-agents for simple, single-query tasks