mcp-optimizer reduces token waste from idle MCP servers in Claude Code

mcp-optimizer is a plugin that addresses token waste in Claude Code when using Model Context Protocol (MCP) servers. The problem: every connected MCP server loads its full tool schema into every conversation, even if tools are never used. With three servers, this can waste approximately 6,500+ tokens per session.
Key features
mcp-doctor– health check your MCP serversmcp-audit– see which tools you actually use versus which waste tokensmcp-optimize– generate a project-local configuration with only what you needmcp-to-skills– convert MCP tools into on-demand local Skills (zero idle cost)
The core concept: Skills only load when invoked, while MCP schemas load every time. This optimization reduces unnecessary token consumption.
Installation
Install via the plugin marketplace:
/plugin marketplace add choam2426/mcp-optimizer
/plugin install mcp-optimizerThe plugin is available on GitHub at https://github.com/choam2426/mcp-optimizer.
📖 Read the full source: r/ClaudeAI
👀 See Also

Reddit User Shares AI Tool for Gathering Financial Account Balances
A Reddit post on r/openclaw presents an AI agent designed to streamline the collection of financial account balances using Python. Users discuss automation potential via custom scripts leveraging APIs like Plaid.

Meera: A Fully Offline AI Assistant for Linux Gnome Built on Qwen3.5-2B
Meera is an offline AI assistant for Gnome Desktop that uses Qwen3.5-2B-Q4_K_M (1.2 GB) and llama-cpp with Vulkan support. It leverages a second tiny embedding model for tool selection and RAG, avoiding prompt embedding bloat. Works on Ubuntu 24.04 with RTX 5090 and Fedora Silverblue on Intel i3.

Auto-Fix System Uses Claude Code Headless to Detect and Fix Production Errors
A developer built an automated production error-fixing system using Claude Code CLI in headless mode. The system detects errors from logs, creates isolated git worktrees for each issue, prompts Claude to write fixes, and requires manual approval via Telegram before creating PRs.

User-built PTC for Claude Code shows 40-65% token savings on analysis tasks, not code writing
A developer built a local PTC implementation called Thalamus for Claude Code and analyzed 79 real sessions, finding 40-65% token savings on analysis tasks but near-zero savings on code-writing tasks. The agent used execute() primarily for general Python computation rather than batching tool calls.