Local Semantic Memory Search for OpenClaw Agents Using Harrier Embeddings

✍️ OpenClawRadar📅 Published: June 7, 2026🔗 Source
Local Semantic Memory Search for OpenClaw Agents Using Harrier Embeddings
Ad

A new repo shows how to give an OpenClaw agent local semantic memory search without sending embeddings to an external service. The approach runs a small local embedding server around Microsoft's Harrier model (microsoft/harrier-oss-v1-0.6b), exposes an Ollama-compatible API, and wires it to OpenClaw's memorySearch config.

How it works

The embedding server runs Harrier locally and provides /api/embed and /api/embeddings endpoints that match Ollama's API format. OpenClaw's memorySearch already supports Ollama-style endpoints, so pointing it at http://localhost:8000 gives the agent a local SOTA semantic memory layer.

Why this matters for agent memory

Most agent memory systems have two pain points:

  • Shoving too much memory into the prompt burns tokens and makes context messy.
  • Keeping memory files small and manual becomes hard to maintain as history grows.

Semantic memory search offers a middle path. Long-term memory stays in normal markdown files (MEMORY.md, daily logs, notes, project files) that are human-readable and editable. At runtime, the agent retrieves only relevant chunks.

Ad

Benefits

  • Less token waste — not stuffing every durable fact into every prompt.
  • Cleaner memory files — no need to compress into one giant context blob.
  • Better recall — finds conceptually related notes even when wording doesn't match exactly.
  • Easier debugging — source of truth is plain text, not an opaque vector database.
  • Better privacy — embeddings computed locally, no data shipped to hosted API.

What the repo includes

  • Small Python embedding server implementing Ollama-compatible endpoints
  • Example OpenClaw memorySearch config
  • macOS launchd service template
  • Mock markdown memory corpus
  • Smoke tests and local query demo

The repo is at github.com/promptclickrun/harrier-openclaw-memory-search.

📖 Read the full source: r/openclaw

Ad

👀 See Also

Claude Code v2.1.76 System Prompt Updates: Security Monitor Refinements and New Hook Event
Tools

Claude Code v2.1.76 System Prompt Updates: Security Monitor Refinements and New Hook Event

Claude Code v2.1.76 includes updates to system prompts with 43 new tokens, featuring refinements to the security monitor for autonomous agents and the addition of a PostCompact hook event. Changes include clarified sensitive data detection, expanded code deserialization examples, and improved formatting for irreversible local destruction guidance.

OpenClawRadar
MAGELLAN: A 15-Agent Autonomous Scientific Discovery System Built on Claude Code
Tools

MAGELLAN: A 15-Agent Autonomous Scientific Discovery System Built on Claude Code

MAGELLAN is a 15-agent autonomous scientific discovery system built entirely on Claude Code. It uses Opus for deep reasoning and Sonnet for structured tasks, generating cross-disciplinary hypotheses without human direction, with 260 hypotheses proposed and 60% killed by adversarial validation in 19 sessions.

OpenClawRadar
SideX: A Tauri-Based Port of Visual Studio Code
Tools

SideX: A Tauri-Based Port of Visual Studio Code

SideX is a port of Visual Studio Code that replaces Electron with Tauri, using a Rust backend and the OS's native webview. The project claims the same architecture with 96% smaller size, with core editing and terminal functionality currently working.

OpenClawRadar
Claude Code Routines: Automated Cloud Tasks for AI Development Workflows
Tools

Claude Code Routines: Automated Cloud Tasks for AI Development Workflows

Claude Code Routines allow developers to save Claude Code configurations as automated tasks that run on Anthropic-managed cloud infrastructure. Routines support scheduled, API, and GitHub triggers for unattended execution of prompts against repositories.

OpenClawRadar