Manual-Driven Development: A Method to Prevent Claude Code's Confident Divergence

Manual-Driven Development (MDD) is a methodology designed to solve confident divergence in Claude Code, where the AI produces incorrect code that compiles and passes its own tests because it's working from incorrect assumptions about the system.
The Problem: Confident Divergence
When Claude Code works on large production codebases, it reads a few files, infers patterns, and starts coding based on assumptions. This leads to confident divergence - code that looks correct, runs, passes tests, but is wrong because Claude wrote tests against its assumptions rather than the actual system behavior. The divergence only surfaces in production when real users hit edge cases Claude never knew existed.
The author notes this isn't just about Claude not knowing your system - it's that developers can't reliably narrate their entire system either. At 11pm, you won't remember to mention that operators are scoped to specific groups, that ROLE_HIERARCHY is defined in three different files, or that base-tier policies are system-only and cannot be created via the API.
MDD Results in Production
In a production audit using MDD:
- Seven sections audited
- 190 findings identified
- 876 new tests written
- 7 hours and 48 minutes of Claude Code session time
- Estimated 234 to 361 hours of human developer time saved (30-46x compression ratio)
- Zero CLAUDE.md rule violations across all sections
The codebase had 200+ routes, 80+ models, and a daemon enforcement pipeline that converts network policies into live nftables rules on the host.
How MDD Differs from Other Tools
MDD addresses a different problem than existing tools:
- GSD solves context rot (quality degradation as sessions fill up)
- Mem0 and Claude-Mem solve session amnesia (forgetting everything when sessions end)
- MDD solves confident divergence (wrong code based on incorrect assumptions)
All three can run together without conflict. RuleCatch, which monitors rule enforcement in real time, reported 60% fewer rule violations during the SwarmK build with MDD compared to sessions without MDD - same model, same rules, same codebase.
The methodology, prompts, and section-by-section data are documented and reproducible according to the source.
📖 Read the full source: r/ClaudeAI
👀 See Also

Sentrial: Production Monitoring for AI Agents
Sentrial is a monitoring tool that automatically detects failure patterns in AI products including loops, hallucinations, tool misuse, and user frustrations. It diagnoses root causes by analyzing conversation patterns, model outputs, and tool interactions.

OpenAlly: Local AI Assistant for Android with Phone Control
OpenAlly is an Android app that runs an AI assistant locally on your phone via an embedded Node.js process, with 51 built-in skills and phone control capabilities through Aster companion. It connects to 19+ messaging platforms and supports 18 model providers with your own API keys.

Claude Code vs. Codex: Real-World Build Test – 36 Files vs. 28, Infinite Loop, and $0.46 Cost Difference
A developer pits Claude Code against Cursor's Codex on two real tasks: a PR triage bot and a WebSocket code review UI. Claude built 36 files in 12 minutes with zero TypeScript errors; Codex produced a working UI but hit an infinite React loop. Cost difference: ~$0.46.
Claude Prototypes Real Estate Analysis App in 3 Hours Using Live Zillow Data via clawhub
A developer used Claude with the zillow-full clawhub tool to build a rental cash flow analysis app — pulling live Zillow API data, prototyping the UI around real JSON responses, and delivering a working prototype in one afternoon.