Signet: An Open-Source Local-First Memory Substrate for AI Agents

What Signet Solves
Most current AI agent memory systems operate like RAG (Retrieval-Augmented Generation): user message → search memory → retrieve results → answer. This works for explicit queries but breaks when relevant context is implicit.
Examples from the source:
- "Set up the database for the new service" should surface that PostgreSQL was already chosen
- "My transcript was denied, no record under my name" should surface that the user changed their name
- "What time should I set my alarm for my 8:30 meeting?" should surface commute time
The problem isn't storage - it's that these systems wait for the current message to contain enough query signal to retrieve the right past context.
How Signet Works
Signet handles memory outside the agent loop with this architecture:
- Preserves raw transcripts
- Distills sessions into structured memory
- Links entities, constraints, and relations into a graph
- Uses graph traversal + hybrid retrieval to build a candidate set
- Reranks candidates for prompt-time relevance
- Injects context before the next prompt starts
The agent isn't deciding what to save or when to search - it starts with context already available. This moves from query-dependent retrieval toward ambient recall.
Technical Details
Signet is:
- Local-first (SQLite + markdown)
- Inspectable and repairable
- Works across Claude Code, Codex, OpenCode, and OpenClaw
On LoCoMo, it currently achieves 87.5% answer accuracy with 100% Hit@10 retrieval on an 8-question sample. The developer notes this is a small sample but shows the approach is promising.
📖 Read the full source: r/openclaw
👀 See Also

Efficient Token Management with Open-Source MCP Servers: Pare
Pare MCP servers reduce token waste and enhance efficiency when AI coding agents use developer tools by providing structured output.

harshal-mcp-proxy Now on npm: Single Daemon Replaces 12 MCP Server Configs
harshal-mcp-proxy is now available as a 54 kB npm package. Install globally, run as a daemon, and replace 12 separate MCP server configs with 6 tools, saving ~2.7 GB RAM and ~50K tokens per session.

Using Obliteratus toolkit to remove refusal weights from AI models
A Reddit user used the Obliteratus toolkit to surgically remove specific weights responsible for refusal behavior in AI models, demonstrating on Alibaba's Qwen 1.5B model that it can reveal training origins without retraining.

Clawdwatch: Open-source OSINT tool for real-time flight tracking, news scraping, and alerts
Clawdwatch is a CLI tool that pulls live flight data from OpenSky Network, scrapes news from Al Jazeera and AP, and can send Telegram alerts for military aircraft or emergency squawks. It runs locally with npm install and tracks 204+ flights over the Middle East in real-time.