agent-recall: Local SQLite MCP for Persistent Claude Code Memory

Solving Claude Code's Session Memory Problem
Claude Code forgets everything between sessions, requiring users to re-explain team details, decisions, and blockers. While CLAUDE.md works for single projects, it becomes messy across multiple projects with conflicting context.
agent-recall addresses this with a local SQLite-based memory layer that persists across sessions without cloud dependencies.
How It Works
- Single SQLite file in WAL mode - no cloud, no vector DB, nothing leaves your machine
- 9 MCP tools that let Claude save entities, relationships, and observations as you work
- At session start, an LLM generates a structured briefing covering key people, recent decisions, and active blockers instead of dumping raw data
- Versioned slot history - old values get archived, not overwritten
Key Features
Compared to claude-memory-mcp (which is great for free-form memories), agent-recall offers:
- Entity-relation model for structured data
- Scoped visibility to control which project sees what context
- LLM-summarized briefings rather than loading raw facts into context
Setup
Installation is straightforward:
pip install 'agent-recall[mcp]'Then add the server to your .mcp.json configuration file. The tool is MIT licensed with 321 tests and was developed from running approximately 30 agents daily for work.
The author raises an interesting question about implementation approaches: are LLM-summarized briefings actually better than loading raw facts into context? This reflects ongoing experimentation in the MCP ecosystem about optimal memory management strategies.
📖 Read the full source: r/ClaudeAI
👀 See Also
ClaudeAI Brainstorming Mode Gets Visual Companion for Mockups and UI Approval
A user discovers a new 'Visual companion' feature in ClaudeAI brainstorming mode that serves mockups on a local web server, enabling back-and-forth UI tweaks before building.

Master-plan: A Minimal Terminal Task System Built for Claude Code Users
A developer has built master-plan, a Claude Code plugin with four slash commands that manages tasks directly in the terminal using a markdown file and git. The system captures ideas mid-session without context switching and auto-detects test runners.

Using a Local LLM as a Claude Code Subagent to Reduce Context Usage
A Reddit user demonstrates how Claude Code can delegate tasks to a local LLM running via LM Studio, keeping file content out of Claude's context. The setup uses a ~120-line Python script with LM Studio's tool-calling API to handle file operations locally.

Claude Banana: A Claude Code plugin for image generation with design system awareness
Claude Banana is a Claude Code plugin that generates images using Google's Gemini API with context-aware prompt crafting. It reads Tailwind configs, CSS variables, design tokens, and existing assets to understand project visual styles.