Offline-web-search: A Local Google Search Alternative for AI Agents

✍️ OpenClawRadar📅 Published: March 10, 2026🔗 Source
Offline-web-search: A Local Google Search Alternative for AI Agents
Ad

What This Is

offline-web-search is a fully local Google Search alternative designed specifically for AI agents that need offline web search capabilities. It was created as a drop-in replacement for Claude's web tools, allowing LLMs to use it without complex prompting.

Core Problem Being Solved

The developer identified that existing offline search solutions for AI agents have significant limitations. Most tools either dump raw HTML files into the context window or have basic search functionality that prevents agents from finding specific documentation. This is particularly problematic in air-gapped environments, when handling sensitive data, or when building fully local stacks.

Ad

Key Technical Features

  • Search Engine Behavior: Instead of text dumps, it indexes content into a local SQLite FTS5 database using BM25 ranking, title boosting, synonym expansion, prefix matching, and non-English demotion.
  • Content Sources: Natively supports Kiwix ZIM archives (containing offline snapshots of Stack Overflow, Python docs, DevDocs, Wikipedia) and includes an indexing API and crawler for custom content like internal Confluence, company docs, or random HTML pages.
  • Architecture: Uses a client-server model with an HTTP API. The "heavy" content server runs centrally on your network, while lightweight clients connect via MCP server for Claude Desktop or native Claude Code skill.
  • Exposed Tools: Provides two standard tools: Google Search for BM25 ranked search and visit_page to return clean Markdown of full pages.

How It Works

The tool was built by reverse-engineering Claude's Web-Fetch and Web-Search tools, their system prompts, and functionality. Search results provide highly relevant, ranked snippets to the LLM, which can then use the visit_page tool to access clean, readable Markdown versions of full pages.

Who It's For

Developers building offline AI agents or heavily local setups who need reliable search capabilities without internet access.

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also

nah: A context-aware permission guard for Claude Code
Tools

nah: A context-aware permission guard for Claude Code

nah is a PreToolUse hook that intercepts every tool call in Claude Code, classifying commands by action type like filesystem_read or git_history_rewrite and applying policies based on context. It runs a deterministic classifier in milliseconds with optional LLM escalation for ambiguous cases.

OpenClawRadar
Conduid: Trust Infrastructure Layer for MCP Servers Built with Claude
Tools

Conduid: Trust Infrastructure Layer for MCP Servers Built with Claude

Conduid indexes over 25,000 MCP servers across GitHub, npm, PyPI, and major directories, scoring each 0-100 based on GitHub activity, security posture, documentation quality, and maintenance signals. The entire codebase was written with Claude by a solo founder.

OpenClawRadar
Pleng: Self-Hosted Cloud Platform with AI-Driven Infrastructure Management
Tools

Pleng: Self-Hosted Cloud Platform with AI-Driven Infrastructure Management

Pleng is an AGPL-3.0 licensed, self-hosted cloud platform that uses an AI agent (currently Claude) to manage infrastructure via Telegram bot commands. It deploys from GitHub repos or local directories with automated Traefik routing, Let's Encrypt SSL, and basic analytics.

OpenClawRadar
Mind Keg MCP: Persistent Memory for Claude Code and MCP-Compatible Agents
Tools

Mind Keg MCP: Persistent Memory for Claude Code and MCP-Compatible Agents

Mind Keg MCP v0.1.1 is an open-source MCP server that provides persistent memory for Claude Code and other MCP-compatible agents. It stores learnings locally via SQLite and retrieves them via semantic search, allowing AI coding assistants to remember context between sessions.

OpenClawRadar