Agent Harness Outside the Sandbox: Durable Execution & Cold Starts

Mendral's blog argues that the agent harness — the loop that drives an LLM by sending prompts, executing tool calls, and feeding results back — should run outside the sandbox, especially for multi-user agents. They contrast two architectures and detail the three challenges they solved when adopting the outside model.
Two Architectures
- Harness inside the sandbox: The loop lives in the same container as the code it works on. Tool calls (bash, read, write) execute locally. Skills and memories are files on the container's filesystem. This is what Claude Code does locally. Simple execution model, but credentials are inside the sandbox, the sandbox is the session (losing it loses progress), and multi-user becomes a distributed filesystem problem.
- Harness outside the sandbox: The loop runs on the backend and calls into a sandbox over an API to execute tools. Credentials stay out of the sandbox (no permission model needed). Sandboxes can be suspended when idle, become cattle (survive failures), and multi-user sharing is a shared database problem, not a distributed filesystem one.
Three Challenges Solved
- Durable execution: Agent sessions can run hours and must survive deploys and failures. Mendral uses Inngest for checkpointing — each turn is a step, and the loop picks up where it left off if the server restarts.
- Sandbox lifecycle with low cold starts: The loop is suspended most of the time (e.g., during LLM calls). They use Blaxel to resume sandboxes from standby in ~25ms, avoiding seconds-long cold starts during interactive turns.
- Filesystem abstraction: With harness and sandbox on different machines, a shared filesystem is no longer available. Mendral notes they had to handle this, but the post focuses on the first two as the key solved problems.
The post concludes that the outside model is superior for multi-user setups despite the complexity of durable execution and cold start handling.
📖 Read the full source: HN AI Agents
👀 See Also

Claude CLI Directive Drift Issue Reported by Developer
A developer reports Claude CLI consistently ignores project directives stored in .claude folder files, particularly after auto-compact operations. The tool runs prohibited background processes and deletes task/session data despite explicit instructions.

AI Coding Agents Can Fragment Workflow and Drain Attention, Developer Warns
A 12-year web dev reports that using Claude Code daily leads to micro interruptions, loss of focus, and mental exhaustion — without measurable productivity gains.

FFmpeg Developer Accuses OxideAV of AI License Laundering in MagicYUV Issue
An FFmpeg developer has opened an issue on OxideAV's magicyuv repo, challenging the project's licensing and alleging AI-assisted license laundering of GPL code.

AI Reimplementation of chardet Library Raises Copyleft Licensing Questions
Dan Blanchard used Anthropic's Claude to reimplement the chardet Python library from scratch, changing the license from LGPL to MIT. The resulting code shows less than 1.3% similarity to previous versions, sparking debate about whether AI-assisted reimplementation erodes copyleft protections.