Traversable Skill Graph for Persistent AI Agent Memory in Codebases

A Practical Solution to Stateless AI Assistants
AI coding assistants lack persistent memory between sessions, forcing developers to start from zero each time. The common approach of dumping everything into large rules files like .cursorrules fails due to token limits and dilution of instructions.
The solution presented is progressive disclosure through a traversable skill graph that lives inside the codebase. The AI navigates this graph autonomously across sessions.
Three-Layer Architecture
The system has three distinct layers:
- Layer 1 (Always Loaded): Under 150 lines (300 tokens). Contains stack identity, folder conventions, and non-negotiables. Includes one outbound pointer to
HANDOVER.md. - Layer 2 (Loaded Per Session):
HANDOVER.mdserves as an attention router, not a document. It tells the AI which domain file to load based on current task (payments, auth, database, api-routes). Each domain file ends with instructions pointing to the next relevant file, creating a self-directing system. - Layer 3 (Loaded Per Task): Prompt library with 12 categories. Each entry includes context, build, verify, and debug sections. The AI checks the index, loads the category, and follows the pattern.
Core Insight: Self-Directing Instructions
The key innovation is that instructions carry meaning, not just references. For example: "load security/threat-modeling.md before modifying webhook handlers" tells the AI when and why, not just what.
The developer has built this into a SaaS template that ships with the codebase, available at launchx.page for those wanting to examine the full graph structure.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Building a Sub-500ms Voice Agent: Architecture and Performance Insights
A developer built a voice agent from scratch achieving ~400ms end-to-end latency with full STT → LLM → TTS streaming. Key insights include treating voice as a turn-taking problem, using semantic end-of-turn detection, and colocating all components for minimal latency.

OpenClaw developer builds Kumiho cognitive memory plugin for persistent agent collaboration
A developer created Kumiho, an AI cognitive memory system backed by a knowledge graph, to address OpenClaw's lack of memory across sessions. The openclaw-kumiho plugin hooks into conversations to recall context, capture structured summaries, and maintain versioned creative outputs.

DebugBase: A Collective Error Knowledge Base for AI Coding Agents via MCP
DebugBase is an MCP-compatible tool that provides a shared knowledge base where AI coding agents can check for known fixes to common errors like Next.js hydration mismatches or TypeScript resolution issues. It includes 11 MCP tools and comes pre-seeded with 58 error/fix pairs from real agent sessions.

Claude's Canva integration: a practical workflow for design generation
Claude's Canva connector exports editable Canva projects with structured layouts, not flat images. The post details a workflow from prompt to finished carousel in 12-15 minutes, including setup, high-fidelity mode, and honest limitations.