cortex-engine MCP server adds persistent memory and multi-agent support

Persistent memory for AI agents
cortex-engine v0.4.0 is an open-source MCP server that provides persistent long-term memory for AI agents. Unlike typical agents that lose state when sessions end, this tool enables agents to learn and retain information over time.
Core memory tools
The server offers several key functions for MCP-compatible agents:
observe()— records facts in real-timequery()— performs semantic search through past memoriesbelieve()— tracks when understanding of a concept evolvesdream()— consolidates short-term observations into a structured knowledge graph
There are 40+ additional memory tools available. According to the developer, after several months of use, query() can retrieve context from weeks-old sessions that users had forgotten about, with the knowledge graph becoming more useful over time rather than degrading.
Multi-agent support
The latest release adds multi-agent functionality. You can run different agents (like a researcher and a coder) in the same project with isolated memory namespaces. Agents won't share memories unless explicitly configured to collaborate.
Technical implementation
The system is local by default, using SQLite with built-in embeddings and no external services required. It's both LLM and database agnostic, providing flexibility in implementation.
cortex-engine is available on npm as version 0.4.0.
📖 Read the full source: r/clawdbot
👀 See Also

Hypura: Storage-tier-aware LLM inference scheduler for Apple Silicon
Hypura is a Rust-based inference scheduler that places model tensors across GPU, RAM, and NVMe tiers to run models exceeding physical memory on Apple Silicon Macs. It enables running a 31GB Mixtral 8x7B on a 32GB Mac Mini at 2.2 tok/s and a 40GB Llama 70B at 0.3 tok/s where vanilla llama.cpp crashes.

Open-source CLI uses Claude Haiku to automate Xero expense auditing
A developer has released an open-source Python CLI tool that uses Claude Haiku 4.5 to automate Xero expense auditing. The tool follows a 'deterministic code first, then AI to fill in the gaps' approach, keeping costs to a few cents per audit run.

Maggy: An Autonomous Engineering Platform on Claude Code with Cross-Session Memory and P2P Team Learning
Maggy sits at Level 4 of the AI coding tool spectrum: multi-model orchestration, cross-session memory, process intelligence from CI/reviews, and P2P team learning. Benchmarks show 83% reduction in Claude usage while catching 7 security issues missed by single-pipeline Claude Code.
CTOP: Terminal UI to Monitor Claude Code Sessions, Zero Deps
CTOP is a zero-dependency Node.js TUI that shows CPU, memory, context window saturation, token breakdown, and cost estimates for all running Claude Code and Codex sessions.