Slash Agent Start-Up Tokens by 60%: Clean Up Your Bot's Workspace
A developer on r/openclaw shared a practical method to drastically reduce start-up token consumption for LLM-based coding agents. The approach: run an LLM over all markdown files in the workspace root to identify and eliminate bloat and duplication.
Key Actions
- Reviewed every markdown file at the workspace root for redundant or overly verbose content (e.g., change logs, duplicated memories, user info).
- Structured the remaining files similarly to a memory system for consistency.
- Used a CLI tool (
codex) instead of going through the agent to keep the process objective. - Created a
TOOLSfile with quick notes and a separatetools/folder with per-tool details that the agent can pull on demand. - Added new files like
voiceto maintain a consistent tone across different models.
Results
Start-up tokens dropped from 80k to 31k—a 61% reduction. The workspace became leaner and the agent more responsive without losing essential context.
Why This Matters
High start-up tokens translate to slower response times and higher costs. Regularly auditing workspace files with an LLM—outside the agent loop—prevents bloat accumulation and keeps token budgets under control.
Who This Is For
Developers running long-lived AI coding agents who want to cut token waste and improve agent performance without sacrificing context quality.
📖 Read the full source: r/openclaw
👀 See Also

MTP Acceptance Rate: 50% Threshold Determines Speculative Decoding Benefit
MTP (Multi-Token Prediction) via speculative decoding on Gemma-4 26B shows benefit only when draft token acceptance rate exceeds 50% — based on mlx-vlm benchmarks on M4 Max Studio.

TLS Interception by Antivirus Breaks Claude Desktop’s Connection; Workaround with AV Exclusions
Antivirus TLS inspection on bridge.claudeusercontent.com causes Cowork (Claude desktop companion) to fail with 'Claude in Chrome is not connected'. Fix: add *.claudeusercontent.com and *.anthropic.com to AV HTTPS exclusions. Node.js --use-system-ca would prevent this.

Eight Prompting Techniques That Improve Claude Output Quality
A Reddit user shares eight specific prompting techniques that consistently improved their Claude output quality, including commands like "Think through every layer before answering" and "Find the 20% of actions that drive 80% of results."

Using Light-Context Cron Jobs for Daily OpenClaw Tips
A user shares their setup of a daily cron job that posts OpenClaw tips to a Nextcloud Talk channel, highlighting the --light-context flag to reduce bootstrap overhead for isolated tasks.