Semble: A Local MCP Server for Claude Code with 98% Token Reduction

✍️ OpenClawRadar📅 Published: April 30, 2026🔗 Source

Semble is an MCP server that lets Claude Code search local codebases efficiently, returning only relevant code chunks instead of full files. It uses a hybrid of static embeddings, BM25, and a code-optimized reranking stack, all running locally on CPU — no API keys, no GPU, no heavy dependencies.

Installation

Install via uvx:

claude mcp add semble -s user -- uvx --from "semble[mcp]" semble

Once installed, Claude Code can search any repo — local or remote — directly.

Key Details

Token reduction: Uses ~98% fewer tokens than the typical grep+read approach.
Performance: Indexes any repo in ~250ms, answers queries in ~1.5ms (all on CPU).
Quality: Reaches NDCG@10 of 0.854 — 99% of the best transformer hybrid tested, while being ~200x faster.
Benchmarked against: grepai, probe, colgrep, and other existing methods.
Open source: Available on GitHub under the MinishLab organization.

Who It's For

Developers using Claude Code on large codebases who want to reduce token burn and latency while getting high-quality code search results without external API calls.

📖 Read the full source: r/ClaudeAI

👀 See Also

Tools

Introducing operate.txt: A YAML spec for AI agents navigating SaaS products

A developer created operate.txt, a YAML file hosted at yourdomain.com/operate.txt that documents screen details, loading states, irreversible actions, and step-by-step paths for AI agents using computer use features. The spec addresses issues like Claude asking 'is this broken?' during legitimate loading screens.

Mar 24, 2026, 07:45 PM UTC

OpenClawRadar

Tools

How Clawdbot Coordinates 6 AI Agents with a Production-Stable Work Queue

Clawdbot's team built a work queue system to coordinate 6 AI agents (design, code, marketing, ops) for their AI-operated store. The system features atomic task claiming, a state machine, retry logic with backoff, task chains, heartbeat tracking, and a daemon orchestrator.

Mar 1, 2026, 05:45 AM UTC

OpenClawRadar

Tools

Project Ledger: Human-in-the-Loop Memory System for AI Coding Agents

A GitHub project introduces a YAML-based ledger system where humans curate what AI agents remember about codebases. It includes a /ledger skill, UserPromptSubmit hook for automatic context injection, and Haiku auditor review.

Mar 29, 2026, 10:45 AM UTC

OpenClawRadar

Tools

Vibeyard IDE adds embedded browser for direct web UI editing with AI agents

Vibeyard, an open-source IDE for AI coding agents, now includes a browser tab session type that lets users click elements in a web UI and instruct an AI agent to edit them directly, eliminating selector guessing and component hunting.

Apr 21, 2026, 04:29 AM UTC

OpenClawRadar