Hacking Multi-Agent Orchestration into OpenClaw: A Developer's Experience

A developer shared their experience modifying OpenClaw to implement true multi-agent orchestration after discovering that agents were pretending to collaborate without actually calling each other.
The Problem: Fake Collaboration
The developer initially set up multiple agents (PM, planner, backend, frontend, designer) with different assigned models, expecting an orchestrator to coordinate them. While responses appeared structured with different sections and perspectives, log analysis revealed the PM agent was doing everything solo and faking the other agents' contributions. None of the other agents were actually called.
The core issue: OpenClaw treats each agent as an independent unit with no built-in way for one agent to spawn another, wait for results, and fold them back in.
The Solution: Core Runtime Modifications
To implement proper orchestration, the developer modified the core runtime (reply-Bm8VrLQh.js) to handle:
- Parent-child agent spawning via sessions_spawn / sessions_yield
- Subagent completion events bubbling up to parent
- Proper message assembly for the gateway and TUI
The sessions_yield implementation was particularly challenging, requiring about 90 minutes of continuous Codex assistance to get the async flow correct.
Results and Tradeoffs
After implementation:
- Agents now run on separate threads in parallel
- Results get aggregated by the orchestrator
- PM receives a consolidated report and formats the final output
- Each agent actually uses its assigned model (fixing a bug where they all defaulted to the base model)
Tradeoffs include:
- Full pipeline takes 30-60 seconds vs near-instant for single agent
- Cost was about $0.90 over two days of testing
- Memory sits around 10-16GB during active runs
Hardware and Initial Setup
The developer used an M4 Mac Mini (32GB) as a dedicated AI assistant for organizing messy notes and summarizing research. They initially tried running LLMs locally with a 30B model but found it painfully slow and switched to commercial APIs (OpenAI, Claude, Gemini) through OpenClaw.
Output quality with orchestration is still being evaluated. For simple tasks, a single agent is faster and cheaper, but for complex multi-step tasks, specialization may pay off with more tuning needed.
📖 Read the full source: r/openclaw
👀 See Also

iOS App Built Entirely with Claude Code by Non-Engineer Ships to App Store
A product manager with no iOS development experience shipped SpectraSort, a photo sorting app built entirely with Claude Code. The app uses on-device AI for quality ranking and personal taste learning, processing about 10 photos/second on the Neural Engine.

Using AI to Enhance Existing Enterprise Tools Like Jira
A developer used Claude's Chrome extension to create a Jira sidebar showing cross-project dependency graphs in 4 prompts, working directly within the existing Jira interface.

How a React useEffect Bug Caused Random Haptic Feedback and Tanked App Retention
A developer ignored user reports of random app vibrations for months, only to discover a React useEffect dependency issue was causing constant haptic feedback on mid-range devices, dropping 7-day retention from 35% benchmark to 18%.

OpenClaw Agent Burned $20 in API Tokens Due to Web Scraping Context Bloat
A developer building an OpenClaw agent to monitor financial sites accidentally consumed $20 worth of API tokens in a few hours by fetching Yahoo Finance pages that included 609,000 tokens of extraneous HTML like nav bars and cookie banners in the context window.