Engram: Open-source memory layer for Claude Code and MCP clients

What Engram is
Engram is an intelligent memory layer built using Claude Code that functions as an MCP (Model Context Protocol) server compatible with any MCP client. It's free and open-source under the AGPL license.
How it differs from Claude's native auto memory
Anthropic recently launched auto memory for Claude Code, which validates the need for persistent memory in agents. However, Engram offers several key differences:
- No cap: Stores unlimited memories with semantic vector search (only retrieves relevant content)
- 96% fewer tokens: Uses approximately 800 tokens per query versus 5K+ for loading an entire markdown file
- Higher accuracy: Achieves 80% accuracy on the LOCOMO benchmark (standard memory benchmark from Snap Research) versus 29% for flat-file approaches
- Cross-project: One vault across all projects, not siloed per repository
- Intelligence layer: Includes automatic extraction, consolidation, contradiction detection, and bi-temporal recall
- Client compatibility: Works with any MCP client, not locked to Claude Code
Claude's native system writes markdown files with a 200-line cap (approximately 5K tokens loaded every session) and operates per-project without semantic search.
Technical implementation
Behind the scenes, Engram uses:
- SQLite + sqlite-vec for vector search
- Knowledge graph for entity relationships
- LLM-powered consolidation that produces insights rather than just storage
Core functions
Your agent interacts with Engram through these primary calls:
engram_remember: Store memoriesengram_recall: Retrieve memories
Additional MCP tools
engram_ask: Question answeringengram_consolidate: Merge related memoriesengram_audit: Cross-reference external contentengram_briefing: Session start context
Getting started
Installation takes about 2 minutes:
npm install -g engram-sdk
engram initThis auto-detects Claude Code and Cursor, writes the MCP configuration, and creates your vault.
Availability
Engram is listed in the official MCP Registry as io.github.tstockham96/engram.
📖 Read the full source: r/ClaudeAI
👀 See Also

KV Cache Reuse for Long Conversations on Apple Silicon Delivers 200x Speedup
A developer implemented session-based KV cache reuse for local LLM inference using Apple's MLX framework, achieving a 200x improvement in time-to-first-token at 100K context length. The approach keeps the KV cache in memory across conversation turns, processing only new tokens.

Maggy: An Autonomous Engineering Platform on Claude Code with Cross-Session Memory and P2P Team Learning
Maggy sits at Level 4 of the AI coding tool spectrum: multi-model orchestration, cross-session memory, process intelligence from CI/reviews, and P2P team learning. Benchmarks show 83% reduction in Claude usage while catching 7 security issues missed by single-pipeline Claude Code.

Fino: Open-Source MCP Server for Personal Finance Analysis with Claude
Fino is a free, open-source MCP server that connects Claude to bank accounts through Plaid, stores transaction data locally in SQLite, and provides Claude with tools for financial analysis.

Developer builds local AI research agent that creates podcasts from topics or YouTube links
A developer built a fully local AI agent that takes topics or YouTube links and generates deep-dive reports, conversational podcast scripts, and audio. The system dynamically researches, extracts insights, refines summaries, and creates natural back-and-forth conversations.