OpenClaw token usage investigation reveals configuration issues

Token usage investigation reveals configuration problems
A developer reported burning through their OpenAI Codex weekly subscription in about 1.5 days while using OpenClaw for daily AI news updates. They used Claude Code to perform a deep review of their setup and identified several configuration issues causing excessive token consumption.
Key findings from the investigation
The investigation revealed multiple configuration problems:
- Telegram requireMention setting: All group chats had
requireMention: false, meaning every message triggered the agent. Setting this totruemakes bots only fire on @mentions. - Web fetch defaults: The
readabilitysetting was off by default, causing fetches to return raw CSS/JS content even on failed requests. 21 out of 21web_fetchcalls had failed but still dumped page shells into context. - Model inheritance: Per-agent model overrides don't inherit from defaults. Changing
agents.defaults.model.primarytogpt-5.4-minididn't cascade to four agents with hardcoded overrides togpt-5.4. - Orphan session files: Found 41
.reset.*/.deleted.*transcript files (~56MB) that nothing references anymore. - No guardrails on web research: A web-research agent (responsible for 78% of token spend) had no fetch budget, no stopping rules, and no "check local memory before searching" pattern.
Configuration fixes
The investigation recommended these specific configuration changes:
openclaw config set tools.web.fetch.readability true
openclaw config set tools.web.fetch.maxChars 12000
openclaw config set tools.web.fetch.timeoutSeconds 15
openclaw config set tools.web.fetch.cacheTtlMinutes 30For session cleanup:
openclaw sessions cleanup --all-agents --enforce --fix-missingQuick checklist
- Check
requireMentionon all group chats - Enable
tools.web.fetch.readabilityand setmaxChars/timeoutSeconds - Audit per-agent model overrides — defaults don't cascade
- Run
openclaw sessions cleanup --all-agents --dry-run --fix-missing - Add fetch budgets and stopping rules to research-heavy agents
The developer noted that Claude Code was particularly useful for forensic analysis: it grep'd through 2 days of gateway logs, counted 510 gpt-5.4 references vs 23 gpt-5.4-mini in active sessions, found 198 CSS variable references in failed fetch results, and identified 56% redundancy in startup files.
📖 Read the full source: r/openclaw
👀 See Also

Don't Assume Expensive Models Are Better: Case Study Shows 13x Cost Savings by Testing
User replaced GPT-5.4 with Gemini 3.1 Flash Lite on a classification task, achieving identical 85% accuracy at 1/13th the cost after running evals on 21 models.

Claude Code Token Waste Fix: Disable Attribution Header for Better Cache Hits
Setting CLAUDE_CODE_ATTRIBUTION_HEADER=false in your shell configuration can improve Claude Code's cross-session prompt cache hit rate from 48% to 99.98%, reducing system prompt processing costs by 7x per session.

Vague Prompts Are the Real Problem, Not the Model — 50-Run Test Shows Prompt Quality Trumps Model Choice
A Reddit user ran the same ten prompts through ChatGPT 4, Claude Sonnet, and Gemini 1.5 Pro five times each (150 outputs total) and found that all three models produced similarly usable or similarly generic results — the deciding factor was prompt specificity, not the model.

Claude's /btw Command Enables Parallel Communication During Tasks
Claude AI now supports a /btw command that lets users communicate with the AI while it's actively working on a task, allowing questions, additional instructions, or clarifications without interrupting the current workflow.