Routing cuts OpenClaw Max usage cost by 85%: $200/mo to $30/mo with API routing

One OpenClaw Max user ran the numbers and discovered that only about 15% of their daily token usage actually required a frontier model like Opus. The rest—file reads, git status, project context scanning, test generation, scaffolding, formatting, renaming, simple refactors—could be handled by cheaper models like Sonnet or even cheaper alternatives.
Token usage breakdown
- ~40% — file reads, git status, project context scanning: don't need Opus
- ~25% — test generation, scaffolding, boilerplate: Sonnet handles identically
- ~20% — formatting, renaming, simple refactors: any model works
- ~15% — actual hard reasoning, cross-file architecture: the only part that needs Opus
By switching from the $200/month Max subscription to the API with routing rules, the user configured Sonnet for routine tasks and Opus only for cross-file reasoning. The monthly bill dropped to approximately $30—an 85% reduction—with no perceived change in output quality because the hard tasks still get Opus.
The user notes that the subscription model deliberately obscures this inefficiency: no token breakdown, no per-task cost visibility, just a quota that mysteriously shrinks.
For teams or individuals paying for premium plans, routing via API can yield significant savings without sacrificing performance on the tasks that actually need top-tier models.
📖 Read the full source: r/openclaw
👀 See Also

Fixing Claude's Time Hallucinations in Claude Code with Hooks
A user discovered that Claude Code lacks real-time clock access, causing it to incorrectly suggest actions like 'get some rest' at inappropriate times. The fix involves adding a one-line hook to ~/.claude/settings.json that injects the current time into Claude's context on every message.

Code AI Bots to Battle Humans in a New Multiplayer Game
A new multiplayer game allows players to code AI bots to compete against human players in real-time, offering a unique blend of coding and gaming challenges.

Agent Framework Token Bloat: A 500:1 Input-to-Output Ratio Is Normal
A self-hosted agent framework user reports ~21k input tokens per message and 500:1 input-to-output ratio from tool definitions, system prompt, and memory. Community confirms 15-25k baseline context is common for tool-using agents.

How to disable Claude Code's verb spinner feature
Claude Code includes a default verb spinner that displays whimsical gerunds like 'Seasoning' and 'Crafting' during processing. You can disable it by editing the settings.json file with a blank space in the spinnerVerbs array.