IUM: MCP Symbol Indexer Cuts AI Agent Token Usage by 15.9x vs grep

IUM is a two-pass AST walker that builds an SQLite database of every symbol event — definitions, call sites, mutations — then exposes the index over the Model Context Protocol (MCP) as tools for Claude Code, Cursor, or any MCP client. Instead of grep dumping thousands of lines into context, agents get exact file:line coordinates, call graph tracing, and semantic search.
Why this exists
Claude Code was burning context windows by making repeated grep calls across unfamiliar codebases — reading dozens of files to find a single function. IUM replaces that with a prebuilt index queried via MCP.
Benchmark: token cost
Tested against the DataFusion codebase (1,538 files, Rust). Four equivalent queries were run with grep versus IUM:
- Grep: 82,645 tokens
- IUM: 5,190 tokens
- Ratio: 15.9x fewer tokens
Caveat from the author: grep wins on raw execution speed. IUM wins on token cost at volume. For one-off scripts, use grep. For AI agents making thousands of calls, the math favors IUM.
Technical details
- Two-pass AST walk → SQLite matrix of symbol events
- Exposes MCP tools for trace, search, and coordinate lookup
- Everything runs locally — your code never leaves your machine
- Supported languages: Rust, Python, TypeScript, CSS, HTML
Usage
Install from copecode.dev. Works with Claude Code, Cursor, or any MCP client out of the box.
📖 Read the full source: r/ClaudeAI
👀 See Also

Benchmark Results: When to Use Claude Opus with Codex vs. Pure Opus for Code Generation
A controlled benchmark tested the 'Plan with Opus, Execute with Codex' approach across three real coding tasks. Results show a cost crossover at approximately 600 lines of code, with specific recommendations based on project size.

Open Source Dashboard Reveals Actual Claude Code Compute Costs
A developer reverse-engineered Claude Code's rate limit formula to build a local dashboard that shows real-time usage percentage, actual dollar costs, burn rate, peak hours, and which skills/hooks are firing. The tool revealed a $100/month plan consumed $13,286 in equivalent API compute in one month.

Krasis LLM Runtime Shows 8.9x Prefill and 4.7x Decode Speed Improvements Over Llama.cpp
Krasis LLM runtime now runs both prefill and decode entirely on GPU with different optimization strategies, achieving 8.9x faster prefill and 4.7x faster decode than llama.cpp on Qwen3.5-122B with a single 5090 GPU.

Vibeyard: Open-Source Dashboard That Launches Claude Sessions from PRs, Issues, and Kanban Cards
Vibeyard is an open-source (MIT) home screen with draggable widgets for PRs, issues, kanban, and Claude sessions. Click any card to spawn a pre-scoped Claude Code session for review, fix planning, or resumption.