Open-source solo RPG engine uses three Claude instances for parsing, narration, and direction

Architecture and Pipeline
EdgeTales is a Python-based solo RPG engine where players type character actions, dice mechanics resolve outcomes behind the scenes, and Claude AI writes atmospheric prose based on results. The core design principle is "AI narrates, it does not decide"—dice determine success or failure, while Claude only turns outcomes into story.
The system uses a triple-AI pipeline with three Claude instances per player turn:
- Brain (Claude Haiku): Parses free-text input into structured JSON with fields like RPG move, stat, target NPC, position/effect level. Takes ~300ms and costs ~$0.0002.
- Narrator (Claude Sonnet): Receives structured prompts with dice results, NPC context, and story arc, then writes atmospheric prose. Also embeds hidden metadata (
<new_npcs>,<memory_updates>) that the parser extracts for game state updates. Takes ~2s and costs ~$0.003. - Director (Claude Haiku): Runs asynchronously after the player sees narration. Analyzes scenes like a TV showrunner for NPC behavior hints, plot thread tracking, and scene summaries. Only triggers on specific events (failed rolls, new NPCs, every 3rd scene) with zero player-facing latency.
Total cost per turn is ~$0.003–0.004, making a 20-scene session cost ~6–8 cents. The Director's output goes into <director_guidance> tags in the next Narrator prompt, with graceful degradation if the Director fails.
Technical Implementation Details
Prompt Engineering Lessons:
- Structured XML context injection (
<world>,<character>,<npc>,<story_arc>,<director_guidance>) made Sonnet's output more consistent than prose instructions. - Haiku is effective for structured parsing—the Brain returns valid JSON with 8+ fields from free-form multilingual input.
- JSON repair is essential. Both models occasionally produce malformed JSON (missing commas in German text, unescaped newlines, trailing commas). A
_repair_json()function with try-first approach handles this with zero overhead for valid JSON. - NPC deduplication uses three safety nets: explicit
<npc_rename>tags, fuzzy substring matching before creation, and alias-aware search.
NPC Memory System: Each NPC has importance-weighted memory calculated as Score = 0.40 × Recency + 0.35 × Importance + 0.25 × Relevance. The Director generates "reflections" (how an NPC feels) alongside factual observations. Memory stays bounded at 25 entries per NPC with intelligent consolidation.
Technical Stack: Python 3.11+, NiceGUI, Anthropic SDK, EdgeTTS/Chatterbox (TTS), Faster-Whisper (STT). The codebase is ~6,800 lines across 5 files. Features include 20+ narration languages, voice I/O, PDF export, kid-friendly mode, and Raspberry Pi compatibility.
📖 Read the full source: r/ClaudeAI
👀 See Also

Telegram Bot to Manage Headless Claude Code Channels via tmux
A zero-dependency Python Telegram bot that launches, stops, and monitors Claude Code Channels sessions in tmux on a headless server, with watchdog auto-restart.

IronBee: Open-source verification layer for Claude Code and Cursor
IronBee is an open-source verification layer that forces AI coding agents to test changes in a real browser before completing tasks. In testing, it caught bugs in 82% of Claude Code sessions that would have shipped without verification.

Automate GitHub PR review with Claude Code agents
A developer built an agent that processes GitHub mentions, spawns Claude Code workers to review or fix PRs, and only escalates ambiguous cases to humans.

Broccoli: Open-source harness for running AI coding agents from Linear tickets in cloud sandboxes
Broccoli is an open-source tool that takes coding tasks from Linear, executes them in isolated cloud sandboxes using Claude and Codex, and opens PRs for human review. It runs on your own Google Cloud infrastructure with production-grade deployment.