Open-source MCP memory server with knowledge graph and learning features

An open-source MCP server called cuba-memorys provides persistent memory for AI agents, going beyond simple RAG or vector stores. Written in Rust with a PostgreSQL + pgvector backend, it implements a knowledge graph architecture with learning capabilities.
Architecture and features
The system stores entities, observations, and typed relations rather than flat documents. Key features include:
- Exponential decay — Memories fade realistically using importance = importance * exp(-0.693 * days/halflife) with a 30-day halflife
- Hebbian + BCM metaplasticity — Oja's rule with EMA sliding threshold; memories strengthen with access and self-normalize via BCM
- 4-signal RRF fusion (k=60) — Combines ts_rank + trigrams + pgvector HNSW + importance with entropy-routed weighting to detect keyword-dominant vs semantic queries
- Leiden community detection — Traag et al. 2019 algorithm for discovering clusters in the knowledge graph
- Personalized PageRank — Ranks entity importance based on graph topology
- Anti-hallucination verification — Triangulates claims against stored knowledge with graduated confidence scoring
- Error memory with pattern detection — Triggers warnings when ≥3 similar errors occur
Performance benchmarks
The Rust implementation shows significant improvements over the original Python version:
- Binary size: ~50MB venv (Python) vs 7.6MB (Rust)
- Entity create: ~2ms (Python) vs 498μs (Rust)
- Hybrid search: <5ms (Python) vs 2.52ms (Rust)
- Memory usage: ~120MB (Python) vs ~15MB (Rust)
- Dependencies: 12 packages (Python) vs 0 runtime (Rust)
Implementation details
The server provides 13 MCP tools and works with any MCP-compatible client including Claude Code, Cursor, and Windsurf. It's self-hosted with a PostgreSQL backend and makes no external API calls. All algorithms are based on peer-reviewed papers with citations in the README.
Installation options:
pip install cuba-memorys
or
npm install -g cuba-memorys
The project is available under CC BY-NC 4.0 license on GitHub.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Efficient Workflow Using Claude Code: Planning Before Execution
Boris Tane leverages Claude Code with a structured planning-first approach, focusing on detailed research and planning to maintain control over architecture decisions.

AI Sandbox Manager: LXC-Based Sandbox for Codex with GPU Passthrough and Computer Use on Headless Linux
ai-sandbox-manager is an open-source LXC-based sandbox for Codex agents on headless Linux. It provides GPU passthrough, full sudo access, persistent environments, and computer use via CUA, all while isolating the agent from the host OS.

OpenClaw Multi-Agent Workflow Issues: Stalling, Context Loss, and Token Inefficiency
A developer reports OpenClaw multi-agent workflows frequently stall with agents hanging, experience context leakage despite custom documentation, and consume excessive tokens with no output. The setup used Gemini 3 Pro/Codex models with a COO orchestrator and specialized task agents.

Developer builds local AI research agent that creates podcasts from topics or YouTube links
A developer built a fully local AI agent that takes topics or YouTube links and generates deep-dive reports, conversational podcast scripts, and audio. The system dynamically researches, extracts insights, refines summaries, and creates natural back-and-forth conversations.