Semble: Code Search for AI Agents Using 98% Fewer Tokens Than grep+read

✍️ OpenClawRadar📅 Published: May 17, 2026🔗 Source
Semble: Code Search for AI Agents Using 98% Fewer Tokens Than grep+read
Ad

Semble is a fast, token-efficient code search library built specifically for AI coding agents like Claude Code, Cursor, Codex, and OpenCode. It returns relevant code snippets from natural language or code queries, using ~98% fewer tokens than the typical grep+read fallback approach.

How It Works

Semble combines static Model2Vec embeddings (using their own potion-code-16M model) with BM25, fused via RRF and reranked with code-aware signals. All computation runs on CPU — no GPU, no API keys, no external services. Indexing an average repo takes ~250ms, and queries complete in ~1.5ms on CPU.

Key Features

  • Token-efficient: 98% fewer tokens than grep+read — returns only the relevant chunks.
  • Fast: ~250ms to index a typical repo, ~1.5ms per query (very large repos may take longer).
  • Accurate: 0.854 NDCG@10 on their benchmark of ~1250 query/document pairs across 63 repos and 19 languages — 99% of the best transformer setup (137M parameters) at ~200x faster indexing and ~10x faster queries.
  • Zero config: No API keys, GPU, or external services required.
  • MCP server: Drop-in for Claude Code, Cursor, Codex, OpenCode, and any MCP-compatible agent.
  • Local and remote: Pass a local path or a git URL. Indexes are cached per session and auto-updated on file changes.
Ad

Installation and Setup

MCP server (recommended for agents)

Requires uv to be installed. For Claude Code:

claude mcp add semble -s user -- uvx --from "semble[mcp]" semble

For Codex, add to ~/.codex/config.toml:

[mcp_servers.semble]
command = "uvx"
args = ["--from", "semble[mcp]", "semble"]

For OpenCode, add to ~/.opencode/config.json:

{
  "mcp": {
    "semble": {
      "type": "local",
      "command": ["uvx", "--from", "semble[mcp]", "semble"]
    }
  }
}

For Cursor, add to ~/.cursor/mcp.json or .cursor/mcp.json:

{
  "mcpServers": {
    "semble": {
      "command": "uvx",
      "args": ["--from", "semble[mcp]", "semble"]
    }
  }
}

Bash integration (alternative)

Install with pip or uv, then add the code search snippet to AGENTS.md or CLAUDE.md:

pip install semble
uv tool install semble

Then in AGENTS.md:

## Code Search
Use `semble search` to find code by describing what it does or naming a symbol/identifier, instead of grep:
```bash
semble search "authentication flow" ./my-project
```

MCP Tools

The MCP server exposes two tools:

  • search — Search a codebase with a natural-language or code query. Pass repo as a local directory path or an https:// git URL.
  • find_related — Given a file path and line number, return chunks semantically similar to the code at that location.

📖 Read the full source: HN AI Agents

Ad

👀 See Also

🦀
Tools

xAI TTS Integration for Home Assistant Built with Claude — Full Repo

A developer used Claude to build a custom Home Assistant integration for xAI's TTS API (Eve voice) with full UI config, five voices, and speech tags.

OpenClawRadar
Open-source Claude skill for management consulting frameworks and case studies
Tools

Open-source Claude skill for management consulting frameworks and case studies

A free, MIT-licensed Claude skill provides structured reference material for management consulting work, including frameworks, industry context, and case studies. The project consists of 80+ markdown files organized by domain and seeks contributors to expand coverage.

OpenClawRadar
Claude's Silent Drop-Off: The Action Layer Failure When AI Agents Hit Business Sites
Tools

Claude's Silent Drop-Off: The Action Layer Failure When AI Agents Hit Business Sites

Claude can read business sites (pricing, booking flows, forms) but fails at the action layer — booking, submitting, or routing — due to lack of callable endpoints. This causes invisible user drop-off with no analytics signal.

OpenClawRadar
Open-Source Claude IDE Bridge Connects Dispatch, Desktop App, and Claude Code
Tools

Open-Source Claude IDE Bridge Connects Dispatch, Desktop App, and Claude Code

The claude-ide-bridge is an MIT-licensed open-source tool that connects Claude Code to your IDE, providing access to LSP, debugger, terminals, git, and GitHub through 124 tools. It enables a workflow where tasks sent via Dispatch from a phone are handled by the Claude desktop app, which uses Claude Code to write code and run tests while interacting with the IDE.

OpenClawRadar