Analyzing AI Coding Tools: Dissecting 3,177 API Calls

The recent analysis conducted on four AI coding tools—Claude Code Opus 4.6, Claude Code Sonnet 4.5, Codex GPT-5.3, and Gemini 2.5 Pro—highlights substantial differences in managing API call context windows. Using the Context Lens tracer, the study intercepted 3,177 API calls to evaluate the tools’ efficiency and strategy in handling the context window when tasked with bug fixes in an Express.js environment.
Each coding tool tackled a specific bug—an incorrectly reordered null check in res.send(). Opus, Sonnet, Codex, and Gemini were tasked with identifying and fixing the bug, followed by running the test suite to verify the fix. They all succeeded, albeit with varying approaches and resources.
Claude Code Opus 4.6 consistently used around 23K to 27K tokens, mainly consisting of tool definitions (69% of the context). This indicates a reliance on re-sending these definitions due to the architecture, causing significant caching overhead. Codex (GPT-5.3) presented a wider range from 29.3K to 47.2K tokens, mostly tool results (72%), providing more variability depending on test command specificity. Sonnet, with similar variance, mixed definitions and results more evenly.
Gemini stands out due to its disproportionate use of tokens, peaking at 350.5K, utilizing almost exclusively tool results (96%), exploiting its large 1M context window. Despite a lower cost per token, Gemini’s inconsistent and expansive usage pattern without convergence across runs indicates a unique, albeit less efficient strategy.
These findings illustrate considerable disparities in how AI coding tools manage context windows, impacting both performance and cost efficiency. Developers should weigh token usage strategies when choosing the appropriate tool for their needs, particularly for tasks involving iterative changes or extensive project histories.
📖 Read the full source: HN LLM Tools
👀 See Also

Eqho: Local Voice-to-Text App for Claude Code Sessions
Eqho is a free, open-source voice-to-text app that uses OpenAI's Whisper model locally to type spoken input into any focused application. Currently Windows-only with command-line setup required.

Atuin v18.13 adds AI shell commands, faster search daemon, and PTY proxy
Atuin v18.13 introduces three major features: an AI-powered English-to-Bash helper called atuin ai, a faster search daemon with in-memory indexing, and a PTY proxy called hex that enables popup rendering without clearing terminal output.

Gemma 4 E2B Tested as Multi-Agent Coordinator in TypeScript Framework
A developer tested Gemma 4 E2B as a coordinator in a multi-agent setup using the open-multi-agent TypeScript framework. The model successfully decomposed tasks into JSON, assigned agents, called tools like bash and file operations, and synthesized results.

TOON MCP server reduces tool result tokens by 30-60% in OpenClaw
An MCP server that compresses structured JSON tool results into the TOON format can cut token usage by 30-60% for tabular data like database queries and API responses, helping delay context window compaction in OpenClaw sessions.