Open-source MCP memory server with knowledge graph and learning features

✍️ OpenClawRadar📅 Published: March 28, 2026🔗 Source

An open-source MCP server called cuba-memorys provides persistent memory for AI agents, going beyond simple RAG or vector stores. Written in Rust with a PostgreSQL + pgvector backend, it implements a knowledge graph architecture with learning capabilities.

Architecture and features

The system stores entities, observations, and typed relations rather than flat documents. Key features include:

Exponential decay — Memories fade realistically using importance = importance * exp(-0.693 * days/halflife) with a 30-day halflife
Hebbian + BCM metaplasticity — Oja's rule with EMA sliding threshold; memories strengthen with access and self-normalize via BCM
4-signal RRF fusion (k=60) — Combines ts_rank + trigrams + pgvector HNSW + importance with entropy-routed weighting to detect keyword-dominant vs semantic queries
Leiden community detection — Traag et al. 2019 algorithm for discovering clusters in the knowledge graph
Personalized PageRank — Ranks entity importance based on graph topology
Anti-hallucination verification — Triangulates claims against stored knowledge with graduated confidence scoring
Error memory with pattern detection — Triggers warnings when ≥3 similar errors occur

Performance benchmarks

The Rust implementation shows significant improvements over the original Python version:

Binary size: ~50MB venv (Python) vs 7.6MB (Rust)
Entity create: ~2ms (Python) vs 498μs (Rust)
Hybrid search: <5ms (Python) vs 2.52ms (Rust)
Memory usage: ~120MB (Python) vs ~15MB (Rust)
Dependencies: 12 packages (Python) vs 0 runtime (Rust)

Implementation details

The server provides 13 MCP tools and works with any MCP-compatible client including Claude Code, Cursor, and Windsurf. It's self-hosted with a PostgreSQL backend and makes no external API calls. All algorithms are based on peer-reviewed papers with citations in the README.

Installation options:

pip install cuba-memorys

npm install -g cuba-memorys

The project is available under CC BY-NC 4.0 license on GitHub.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

Efficient Workflow Using Claude Code: Planning Before Execution

Boris Tane leverages Claude Code with a structured planning-first approach, focusing on detailed research and planning to maintain control over architecture decisions.

Feb 22, 2026, 03:45 AM UTC

OpenClawRadar

Tools

AI Sandbox Manager: LXC-Based Sandbox for Codex with GPU Passthrough and Computer Use on Headless Linux

ai-sandbox-manager is an open-source LXC-based sandbox for Codex agents on headless Linux. It provides GPU passthrough, full sudo access, persistent environments, and computer use via CUA, all while isolating the agent from the host OS.

Jun 7, 2026, 12:15 PM UTC

OpenClawRadar

Tools

OpenClaw Multi-Agent Workflow Issues: Stalling, Context Loss, and Token Inefficiency

A developer reports OpenClaw multi-agent workflows frequently stall with agents hanging, experience context leakage despite custom documentation, and consume excessive tokens with no output. The setup used Gemini 3 Pro/Codex models with a COO orchestrator and specialized task agents.

Feb 28, 2026, 11:45 AM UTC

OpenClawRadar

Tools

Developer builds local AI research agent that creates podcasts from topics or YouTube links

A developer built a fully local AI agent that takes topics or YouTube links and generates deep-dive reports, conversational podcast scripts, and audio. The system dynamically researches, extracts insights, refines summaries, and creates natural back-and-forth conversations.

Apr 17, 2026, 05:04 PM UTC

OpenClawRadar