Cold Validation Architecture: Dual-Agent Code Review System Open-Sourced

Cold Validation Architecture is an open-source system that implements dual-agent validation for AI-generated code. One agent writes the code, while a separate agent reviews it independently without access to the builder's reasoning or context.
How It Works
The system addresses single-agent self-review bias by applying separation of duties similar to independent audits. The reviewer operates in complete isolation from the builder's thought process.
Workflow Phases
- Write plan →
/review-plan(Gate A) - Implement →
/review-impl(Gate C) - Ship →
/acceptance-report(Gate D)
Key Implementation Details
- The reviewer runs in an isolated temporary directory
- Reviewer only sees: plan document, code diff, and test output
- Findings persist with fingerprints for tracking
- Builder can disagree with reviewer's rationale
- Maximum 2 rounds per phase
- Built with bash scripts and JSON schemas
- Currently works with Claude Code + Codex CLI
- Pattern is agent-agnostic (can work with different AI agents)
Technical Specifications
The system is implemented as bash scripts that coordinate between two separate AI agents. JSON schemas define the structure for plan documents, code reviews, and acceptance reports. The isolated execution environment ensures the reviewer has no access to the builder's internal reasoning or intermediate steps.
This approach is useful for developers who want to implement more rigorous validation for AI-generated code, particularly when working with complex systems where single-agent self-review might introduce confirmation bias.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Giving Claude a Local LLM as an Assistant via MCP on Mac
A developer connects Claude to a local Qwen 2.5 Coder 14B via Ollama and MCP, creating a no-cost assistant for delegating tasks like text processing and handling large files.

harshal-mcp-proxy Now on npm: Single Daemon Replaces 12 MCP Server Configs
harshal-mcp-proxy is now available as a 54 kB npm package. Install globally, run as a daemon, and replace 12 separate MCP server configs with 6 tools, saving ~2.7 GB RAM and ~50K tokens per session.

Developer Creates Practical Claude Skills for Kotlin Multiplatform Projects
A developer built a public repository of Claude skills specifically for Kotlin Multiplatform work after finding existing skills too generic, opinionated, or thin. The skills cover architecture reviews, feature implementation, modularization, Compose Multiplatform UI, navigation, platform bridges, deep links, adaptive UI, testing, and build governance.

OpenClaw vs Hermes: Choose the Right Self-Hosted AI Agent After 100+ Deployments
After deploying 100+ AI agents for clients, a Reddit user shares hard-won lessons: OpenClaw (149K stars) is the reliable workhorse for single/small fleets; Hermes excels at multi-agent orchestration but has a smaller community.