Survey of Local-First Markdown Memory Servers for AI Agents: Mem0, Hindsight, Zep, and the Newcomer Engram
A Reddit user asked for a fully local agent memory system that stores memories as readable Markdown files — not a database or cloud service. After receiving ~20 suggestions and testing them all, here is the breakdown of what each tool actually offers and where the gaps remain.
Non-Memory Systems Flagged
Several suggested tools are not memory systems: ChromaDB is a vector database; qmd is a document search engine with no write pipeline; ContextKeep does context compression; LCM preserves session context only.
Established Options
- mem0 — market leader, graph-based memory, SDKs in multiple languages, production-scale. Downsides: defaults to OpenAI, leans hosted, stores in opaque database.
- Hindsight — knowledge graph, entity resolution, handles contradictory memories. Requires Postgres + vector DB, storage is SQL — can't read files directly.
- Zep — longest track record, multi-modal memory, structured extraction. Cloud-first, similar infra requirements to Hindsight.
- Honcho — continual learning, stateful architecture, more research-grade. AGPL license + cloud dependency.
OpenClaw-Specific Options
- memory-lancedb-pro — strongest memory plugin for OpenClaw, hybrid retrieval, decay model, actively maintained. Not a standalone server.
- GBrain — MCP-first, decent OpenClaw integration, not useful outside ecosystem.
Most Interesting Newcomer: mnem
mnem is a Rust single binary, no Python/Ollama/external dependencies. Described as "git for agent memories": branch, diff, merge, revert. Uses GraphRAG. Benchmarks well against mem0. Two weeks old — thin test coverage. Storage is content-addressed graph nodes, not readable files.
The Gap and What Fills It: Engram
None of the tested tools combined fully local + human-readable file storage + smart deduplication + importance decay + standalone server with no infrastructure requirements. Engram by Obsidian68 (github.com/Obsidian68/Engram) is brand new (almost no stars) but checks all four boxes:
- Memories stored as Markdown files in a folder — openable in VS Code, editable, deletable.
- Full REST API and MCP server.
- Smart dedup on writes, importance decay for older memories.
- Runs entirely on Ollama — no API keys, no external calls, fully local.
If privacy and readability matter for your agent's knowledge, Engram is currently the only complete solution.
📖 Read the full source: r/openclaw
👀 See Also

Open-source MCP suite improves Claude Code generation quality by 15-20%
An open-source MCP suite consisting of three local servers and a prompt skill addresses the 'bad token' problem in AI code generation, with one customer reporting 15-20% quality improvement for Claude Code.

Revdiff: Terminal Diff Viewer with Inline Annotations for AI Agents
Revdiff is a TUI diff reviewer built specifically for reviewing AI-generated code changes without leaving terminal sessions. It outputs structured annotations to stdout that can be piped directly back to AI agents like Claude Code, creating a continuous review loop.

Distilled Qwen 3.5 27B Model Shows Strong Performance with Cursor AI Coding Agent
A user reports that the opus 4.6 distilled version of Qwen 27B works effectively as the model driving Cursor, with performance comparable to Gemini 3 Flash. Setup took about 10 minutes using Cursor to configure ngrok tunnel and localllama.

Building an Agentic RAG for Obsidian with Claude and an Eval Harness to Detect Hallucinations
A developer built an agentic RAG system over an Obsidian vault to let Claude answer questions from engineering books, then created an eval harness using Claude Sonnet as a judge to detect when the agent was confidently wrong. Rubric iterations improved judge-human agreement from 39% to 94%.