2-Prompt System to Carry Context Between Claude Chats Without Token Waste

A Reddit user posted a practical system to solve the problem of hitting Claude's chat length limit mid-project. The solution uses two prompts: one for context extraction from the old chat, another for initialization in the new chat.
The Problem
Long Claude conversations slow down, hit length limits, or get expensive on API. Common workarounds—manual summarization, copy-pasting entire chats, or starting fresh—lose critical decisions, waste tokens, or require re-explaining.
The System
Prompt 1: Context Extraction (paste in old chat)
Instructs Claude to compress the entire conversation into a structured 9-section summary: Objective, Key Context, Decisions Made, Work Completed, Current State, Next Steps, Open Questions/Blockers, Critical Data/Assets, and Style & Preferences. The output must be placed inside one clean code block. Target length: 300–600 words, preserving specifics over generalities.
Prompt 2: Chat Initialization (paste in new chat)
You paste the extracted context block and instruct the new chat to treat it as source of truth. It asks Claude to verify understanding, flag any gaps, and resume from Next Steps rather than restarting.
The exact prompts are provided in the source. For example, the extraction prompt's output structure uses these headers:
- Objective
- Key Context
- Decisions Made (with reasons)
- Work Completed
- Current State
- Next Steps (mark immediate next with →)
- Open Questions / Blockers
- Critical Data / Assets
- Style & Preferences
The initialization prompt includes: "Treat the context as established. Do not re-frame or restart. Maintain all decisions and preferences listed." and asks for a confirmation reply with the objective, immediate next action, and any gaps.
Who It's For
Developers using Claude (API or web) who need to continue long-running projects without losing context or burning tokens on redundant explanations.
📖 Read the full source: r/ClaudeAI
👀 See Also

Be My Butler: Multi-Agent Pipeline for AI Code Verification
Be My Butler is an open-source multi-agent pipeline where different AI models review each other's code through blind verification. The system addresses the problem of AI agents incorrectly reporting their own code as functional.

Claude Code Auto Mode: Safer Alternative to Skipping Permissions
Claude Code now offers auto mode, a permissions mode where Claude makes permission decisions with safeguards monitoring actions before execution. It's available as a research preview for Team plan users, with Enterprise and API rollout coming soon.

OpenSwarm: Multi-Agent Claude CLI Orchestrator for Linear and GitHub
OpenSwarm orchestrates multiple Claude Code CLI instances as autonomous agents that pull Linear issues and run Worker/Reviewer/Test/Documenter pipelines. It uses LanceDB with multilingual-e5 embeddings for memory and includes Discord bot control, PR auto-improvement, and a web dashboard.

Visual Studio 2022 Extension Adds Native Ollama Integration for Local LLMs
A free extension for Visual Studio 2022 connects directly to local Ollama endpoints, enabling private AI coding assistance without switching between tools. It supports models like DeepSeek and Llama 3 with cloud fallback options.