Handoffs Pattern in Claude Workflows: Two-File Split vs One-Doc Summary

Long Claude sessions degrade from context decay. Handoffs solve this by compressing key information into a document and starting a fresh agent. Two implementations are now being discussed in the community: Matt Pocock's /handoff skill and an alternative two-file split approach used in the APM multi-agent framework.
Matt Pocock's Handoff Skill
Pocock's skill compacts the conversation into a single document. It points at existing artifacts instead of restating them, and the next agent picks up from there. It also chains between threads: /grill-with-docs → /handoff → /prototype → /handoff back. The repo is available at mattpocock/skills.
Two-File Split Approach (APM Framework)
An alternative approach, built into the APM multi-agent framework for Claude Code back in May 2025, splits the handoff into two artifacts:
- Persistent narrative file — records what was done, decisions made, and why. This lives in the project and leaves a durable trail.
- Ephemeral prompt — tells the incoming agent how to rebuild context from the codebase and the persistent narrative file.
The key difference: the incoming agent reconstructs from durable project state (codebase + narrative), not just the compressed chat conversation. Persisting the narrative also makes it visible when multiple agents are involved, so you can track which agent is working off a summary vs firsthand context. This makes context gaps easier to manage.
The author opened an issue on Pocock's repo with these ideas: mattpocock/skills#235.
Key Questions
- Is a single compressed document enough for handoffs?
- Or does the two-file split (persistent narrative + ephemeral prompt) provide better context reconstruction and multi-agent traceability?
The discussion is ongoing. Both approaches are valid depending on whether you need a quick resume or long-running multi-agent workflows with context gap management.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude Code documentation includes excessive React components inflating token counts
Analysis of Claude Code's LLM documentation reveals that MDX files contain massive inlined React components, with context-window.md using 18,501 tokens but only 551 tokens of actual documentation content.

CostHawk Launches Public Leaderboard for Claude Code, Codex, and Cursor Token Consumption
CostHawk’s leaderboard ranks public users of Claude Code, OpenAI Codex, and Cursor by total token consumption, tracking counts, models, and sync timestamps without storing prompts or code.

CLI-Anything-WEB: Open-source plugin that reverse-engineers any website into a Python CLI for Claude Code
CLI-Anything-WEB is an open-source Claude Code plugin that watches your browser traffic, reverse-engineers the protocol, and generates a full Python CLI with auth, tests, and --json support. 19 sample CLIs included for sites like Reddit, Booking, Airbnb, ChatGPT, and LinkedIn.

Turn Your Knowledge Base into a Wiki + MCP Server for Claude
A demo of Akyn transforming a knowledge base (URL, PDF, Notion) into a wiki and exposing it as an MCP server, enabling Claude to query and write back — with OAuth, human-in-the-loop, and auto-sync.