Local Multi-Agent AI Setup on WSL Using OpenClaw and Ollama

Architecture Overview
A developer has documented their local multi-agent AI setup running entirely on WSL Ubuntu 24.04 on Windows. The system uses OpenClaw 2026.2.26 as an open-source gateway that connects AI agents to messaging apps like Telegram, creating private AI infrastructure under full user control.
Agent Configuration
The setup consists of four specialized agents:
- Pluto - The coordinator that routes tasks to appropriate agents. Runs on OpenRouter (free tier).
- Hermes - Handles research, writing, web browsing, content tasks, and API integrations like YouTube. Uses OpenRouter.
- Vulcan - Coding and automation agent running 100% locally on Ollama with qwen2.5-coder model, resulting in zero API costs.
- Aegis - Security monitoring and read-only system auditing. Uses OpenRouter.
Technical Implementation Details
The stack includes:
- OpenClaw 2026.2.26
- Ollama with models: qwen2.5-coder, codellama, llama3.2
- OpenRouter API
- Telegram bots (one per agent)
- WSL Ubuntu 24.04
- systemd for process management
Cost and Configuration
Total spend is less than $0.01, with Vulcan being completely free (local Ollama). The other three agents use OpenRouter's auto-routing feature that selects the most cost-effective models. The developer set a $5/month hard cap on OpenRouter as a safety net.
Key Learnings
- WSL + systemd works effectively for running the gateway as a background service that survives reboots
- Ollama model auto-discovery in WSL has quirks - manual registration of provider config was required
- Delegation between agents works well once the coordinator's instructions are properly tuned
- The Chrome browser relay for live web access requires port 18792, not the gateway port (caused an hour of troubleshooting)
📖 Read the full source: r/openclaw
👀 See Also

OpenClaw User Switches to RunLobster for Managed Infrastructure
A developer spent 4 months troubleshooting OpenClaw issues including agent stalling, config breaks, and unpredictable API costs before switching to RunLobster. The same models and framework worked reliably with multi-step task completion and faster integrations.

Freight Driver Builds iOS App with Claude Code, Shares Practical Lessons
A freight driver in Japan with minimal coding experience used Claude Code to build an iOS app for new recordkeeping regulations, shipping it to the App Store in six months. He shares specific lessons about prompt engineering, unexpected costs with Expo and Supabase, and managing burnout.

Building a Voice Assistant with OpenClaw, Alexa, and Local LLM
A developer built a voice-first assistant using OpenClaw as the AI agent backbone, Alexa for voice input, and a local LLM (Ollama with Qwen 2.5 3B) to handle general knowledge queries with sub-second response times and reduced API costs.

Using Markdown Files as a Memory System for AI Coding Agents
A developer shares a method using {topic}_LOG.md and {topic}_SUMMARY.md files to persist conversations with Claude Code, solving compaction and agent restart issues by creating a dual memory system with detailed logs and indexed summaries.