Slash Agent Start-Up Tokens by 60%: Clean Up Your Bot's Workspace

✍️ OpenClawRadar📅 Published: May 13, 2026🔗 Source

A developer on r/openclaw shared a practical method to drastically reduce start-up token consumption for LLM-based coding agents. The approach: run an LLM over all markdown files in the workspace root to identify and eliminate bloat and duplication.

Key Actions

Reviewed every markdown file at the workspace root for redundant or overly verbose content (e.g., change logs, duplicated memories, user info).
Structured the remaining files similarly to a memory system for consistency.
Used a CLI tool (codex) instead of going through the agent to keep the process objective.
Created a TOOLS file with quick notes and a separate tools/ folder with per-tool details that the agent can pull on demand.
Added new files like voice to maintain a consistent tone across different models.

Results

Start-up tokens dropped from 80k to 31k—a 61% reduction. The workspace became leaner and the agent more responsive without losing essential context.

Why This Matters

High start-up tokens translate to slower response times and higher costs. Regularly auditing workspace files with an LLM—outside the agent loop—prevents bloat accumulation and keeps token budgets under control.

Who This Is For

Developers running long-lived AI coding agents who want to cut token waste and improve agent performance without sacrificing context quality.

📖 Read the full source: r/openclaw

👀 See Also

Tips

MTP Acceptance Rate: 50% Threshold Determines Speculative Decoding Benefit

MTP (Multi-Token Prediction) via speculative decoding on Gemma-4 26B shows benefit only when draft token acceptance rate exceeds 50% — based on mlx-vlm benchmarks on M4 Max Studio.

May 9, 2026, 12:17 AM UTC

OpenClawRadar

Tips

TLS Interception by Antivirus Breaks Claude Desktop’s Connection; Workaround with AV Exclusions

Antivirus TLS inspection on bridge.claudeusercontent.com causes Cowork (Claude desktop companion) to fail with 'Claude in Chrome is not connected'. Fix: add *.claudeusercontent.com and *.anthropic.com to AV HTTPS exclusions. Node.js --use-system-ca would prevent this.

May 10, 2026, 06:16 PM UTC

OpenClawRadar

Tips

Eight Prompting Techniques That Improve Claude Output Quality

A Reddit user shares eight specific prompting techniques that consistently improved their Claude output quality, including commands like "Think through every layer before answering" and "Find the 20% of actions that drive 80% of results."

Apr 13, 2026, 09:45 PM UTC

OpenClawRadar

Tips

Using Light-Context Cron Jobs for Daily OpenClaw Tips

A user shares their setup of a daily cron job that posts OpenClaw tips to a Nextcloud Talk channel, highlighting the --light-context flag to reduce bootstrap overhead for isolated tasks.

Apr 3, 2026, 09:45 AM UTC

OpenClawRadar