MCP Slim: Local Embedding Search for MCP Tools Reduces Context Bloat

MCP Slim is a proxy tool that addresses MCP's context bloat problem by replacing full tool catalogs with three meta-tools: search, describe, and call. Instead of loading all tool definitions upfront, the LLM searches for what it needs, reducing context window usage by 96%.
Key Details
The tool specifically targets MCP's documented context bloat issue where connecting a few servers can consume 55,000+ tokens on tool definitions before the first message. MCP Slim's main differentiator from other proxies like mcp-compressor and MCProxy is its use of semantic search with a local MiniLM embedding model. This allows it to find relevant tools without keyword overlap - for example, "save a note" can find create_entities.
The tool runs fully offline and requires no API keys. It works with any stdio MCP client including Claude Desktop, Cursor, Cline, Windsurf, and Zed.
Installation and Setup
Installation is done via: npx mcp-slim init
The GitHub repository is available at: https://github.com/dopatools/mcp-slim
The author (/u/OpportunitySpare2441) has offered to answer questions about the architecture or tradeoffs.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Pi Coding Agent with Qwen 35B Q2: Using Filesystem as External Memory and Enforcing Context Guards
A Reddit user built a stack around Pi coding agent with Qwen 35B Q2_K_XL quant that enforces guards — rejects edits over 100 lines, caps thinking blocks at 2000 chars, and monitors context at 65%/80% — treating the filesystem as the model's memory, not the context window.

Open-source CLI uses Claude Haiku to automate Xero expense auditing
A developer has released an open-source Python CLI tool that uses Claude Haiku 4.5 to automate Xero expense auditing. The tool follows a 'deterministic code first, then AI to fill in the gaps' approach, keeping costs to a few cents per audit run.

Claude Skills: 12 Strict Coding Rule Packs for TypeScript, Rust, Swift, Go, JS, Postgres, and Audits
12 markdown files with opinionated, version-aware rules for TS, Rust, Swift, Go, JS, Postgres, security, performance, testing, code review, GitHub standards, and git commits. MIT, free, no signup.

antirez's DS4: Running DeepSeek V4 Flash with 1M Context on Mac Metal and DGX
Redis creator Salvatore Sanfilippo released DS4, a project to run DeepSeek V4 Flash with a 1M context window on Mac Metal hardware and DGX, with OpenAI/Anthropic endpoints for agentic coding tools.