Semble: A Local MCP Server for Claude Code with 98% Token Reduction

✍️ OpenClawRadar📅 Published: April 30, 2026🔗 Source
Semble: A Local MCP Server for Claude Code with 98% Token Reduction
Ad

Semble is an MCP server that lets Claude Code search local codebases efficiently, returning only relevant code chunks instead of full files. It uses a hybrid of static embeddings, BM25, and a code-optimized reranking stack, all running locally on CPU — no API keys, no GPU, no heavy dependencies.

Installation

Install via uvx:

claude mcp add semble -s user -- uvx --from "semble[mcp]" semble

Once installed, Claude Code can search any repo — local or remote — directly.

Ad

Key Details

  • Token reduction: Uses ~98% fewer tokens than the typical grep+read approach.
  • Performance: Indexes any repo in ~250ms, answers queries in ~1.5ms (all on CPU).
  • Quality: Reaches NDCG@10 of 0.854 — 99% of the best transformer hybrid tested, while being ~200x faster.
  • Benchmarked against: grepai, probe, colgrep, and other existing methods.
  • Open source: Available on GitHub under the MinishLab organization.

Who It's For

Developers using Claude Code on large codebases who want to reduce token burn and latency while getting high-quality code search results without external API calls.

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also

Introducing operate.txt: A YAML spec for AI agents navigating SaaS products
Tools

Introducing operate.txt: A YAML spec for AI agents navigating SaaS products

A developer created operate.txt, a YAML file hosted at yourdomain.com/operate.txt that documents screen details, loading states, irreversible actions, and step-by-step paths for AI agents using computer use features. The spec addresses issues like Claude asking 'is this broken?' during legitimate loading screens.

OpenClawRadar
How Clawdbot Coordinates 6 AI Agents with a Production-Stable Work Queue
Tools

How Clawdbot Coordinates 6 AI Agents with a Production-Stable Work Queue

Clawdbot's team built a work queue system to coordinate 6 AI agents (design, code, marketing, ops) for their AI-operated store. The system features atomic task claiming, a state machine, retry logic with backoff, task chains, heartbeat tracking, and a daemon orchestrator.

OpenClawRadar
Project Ledger: Human-in-the-Loop Memory System for AI Coding Agents
Tools

Project Ledger: Human-in-the-Loop Memory System for AI Coding Agents

A GitHub project introduces a YAML-based ledger system where humans curate what AI agents remember about codebases. It includes a /ledger skill, UserPromptSubmit hook for automatic context injection, and Haiku auditor review.

OpenClawRadar
Vibeyard IDE adds embedded browser for direct web UI editing with AI agents
Tools

Vibeyard IDE adds embedded browser for direct web UI editing with AI agents

Vibeyard, an open-source IDE for AI coding agents, now includes a browser tab session type that lets users click elements in a web UI and instruct an AI agent to edit them directly, eliminating selector guessing and component hunting.

OpenClawRadar