engram v3.4.0 Adds Anthropic Plugin to Keep Claude Code Running Under New Rate Limits

engram v3.4.0 addresses the recent rate-limit reductions and the impending removal of Claude Code from the Pro tier by exposing a dedicated Anthropic plugin. The plugin bundles an MCP server config that instantiates a shared memory layer locally, surviving file edits and IDE switches without extra latency.
Key Features
- Three new skills accessible via slash commands in Claude Code:
/engram:costfor token spend queries,/engram:queryfor fast context retrieval, and/engram:mistakesto surface recent execution errors. - Zero-config MCP integration — the MCP server runs locally, so the context spine is instantiated the first time a skill runs, with no additional setup.
- Cross-IDE persistence — the shared memory layer persists across file edits and even across different IDEs, enabling continuity.
Installation
CLI (one line):
npm install -g engramx@latest engram setup # detects Claude Code automatically
Via Claude Code marketplace: Once the listing appears, run /plugin install engram.
What It Solves
Claude Code users have faced sudden rate-limit reductions with the product's looming removal from the Pro tier. engram's plugin provides a local, latency-free memory layer that helps manage API consumption (via cost queries) and recover from errors quickly (via mistake surfacing). The MCP server runs locally, so no external dependencies are introduced.
Who It's For
Developers who rely on Claude Code and need to work around tighter rate limits while maintaining continuity across sessions.
Resources
- Source code & issues: github.com/NickCirv/engram
- npm package:
engramx
📖 Read the full source: r/ClaudeAI
👀 See Also

Developer creates read/write WordPress MCP plugin with 28 abilities
A developer built a WordPress plugin that registers 28 MCP abilities through the WordPress Abilities API, enabling full read/write access for AI coding agents. The plugin handles content management, quality auditing, and safety features, converting between Markdown and Gutenberg blocks automatically.

Using a Local LLM as a Claude Code Subagent to Reduce Context Usage
A Reddit user demonstrates how Claude Code can delegate tasks to a local LLM running via LM Studio, keeping file content out of Claude's context. The setup uses a ~120-line Python script with LM Studio's tool-calling API to handle file operations locally.

Open-source multi-agent framework extracted from Claude Code leak
A developer extracted the multi-agent orchestration system from Claude Code's leaked source code and rebuilt it as a model-agnostic open-source framework with MIT license. The 8,000-line TypeScript framework includes task scheduling, inter-agent messaging, and built-in tools.

Orc: Multi-Agent Coding Orchestration Tool Adds Planning and Notification Features
Orc is an open-source tool that orchestrates AI coding agents across projects with a local TUI interface. The latest release adds planning as a first-class phase, notification systems for human intervention, and natural language lifecycle hooks.