Benchmark Shows CLI Tool Reduces Claude Code Token Costs by 32% Through Structural Navigation

A developer has open-sourced a CLI tool called Scope that provides Claude Code agents with structural code navigation capabilities, similar to IDE features like "find references" and "go to definition." The tool was built in Rust using tree-sitter and SQLite.
What the Tool Does
The tool gives agents commands like:
- "show me a 180-token summary of this 6,000-token class"
- "search by what code does, not what it's named"
It currently supports TypeScript and C#, with the goal of helping agents navigate code more efficiently than their default grep-based approach.
Benchmark Methodology
The developer ran 54 automated runs on Sonnet 4.6 across a 181-file C# codebase with:
- 6 task categories
- 3 conditions: baseline, tool available, architecture preloaded into CLAUDE.md
- 3 repetitions each
Full NDJSON capture was recorded on every run to decompose tokens into fresh input, cache creation, cache reads, and output. The benchmark runner and telemetry capture are included in the repository.
Key Findings
Contrary to expectations, agents with the tool read more files (6.8 to 9.7 average vs. baseline) but made 67% more code edits per session and finished in fewer turns.
The savings came from shorter conversations, which reduced cache accumulation. Approximately 90% of token cost lives in cache accumulation.
Overall results:
- 32% lower cost per task
- 2x navigation efficiency (nav actions per edit)
- Navigation-to-edit ratio improved from 25:1 (baseline) to 13:1 (with tool) and 12:1 (with architecture preloaded)
Results varied by task type:
- Bug fixes: -62% cost
- New features: -49% cost
- Cross-cutting changes: -46% cost
- Discovery and refactoring tasks: no advantage (baseline agents already navigate these fine)
Important Caveats
The developer notes several limitations:
- p-values don't reach 0.05 at n=6 paired observations (direction is consistent but sample is too small for statistical significance)
- Benchmarked on C# only so far (TypeScript support exists but hasn't been benchmarked yet)
- Cost calculation uses current Sonnet 4.6 API rates: fresh input $3/M, cache write $3.75/M, cache read $0.30/M, output $15/M
The tool is open source and available at github.com/rynhardt-potgieter/scope for developers who want to experiment with improving agent token efficiency.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude Opus 4.6: A Model for Sustained Engineering Tasks
Claude Opus 4.6 brings sustained focus to long-term projects, supporting multi-day tasks with features like ultra-long context and adaptive thinking.

Optio: Orchestrating AI Coding Agents in Kubernetes from Ticket to PR
Optio is an open-source orchestration system that turns tickets into merged pull requests using AI coding agents like Claude Code or Codex. It handles the full lifecycle in isolated Kubernetes pods with a feedback loop that auto-resumes agents on CI failures or review feedback.

Running Two Claude Code Agents on the Same Repo with Git Worktrees
A Reddit user details how to run multiple Claude Code agents in parallel on the same codebase using git worktrees, avoiding file conflicts and enabling independent agent sessions.

Persistent Side Panel for Claude Code with Autonomous Content Management
A developer built a TUI panel that sits in an iTerm2 split pane next to the terminal, featuring three fixed panels that Claude autonomously manages to show relevant content like code, diagrams, and status updates.