Offload Routine Claude Code Work to Gemma via offload-mcp MCP Server

✍️ OpenClawRadar📅 Published: May 6, 2026🔗 Source
Offload Routine Claude Code Work to Gemma via offload-mcp MCP Server
Ad

The offload-mcp tool is an MCP server designed to let Claude offload routine work to free model APIs, specifically using Gemma via the Google GenAI API. The goal is to avoid burning expensive Claude Code or Codex context on small tasks and to bypass the performance limitations of running local models on hardware like a MacBook Air.

Key Details

  • Handles: commit messages, PR summaries, translations, docstrings, source diff/file summaries, and freeform prompts.
  • Freeform mode: send almost any routine prompt to a cheaper model instead of using Claude.
  • Source-based mode: reads local diffs/files directly through the MCP server and reports estimated primary input tokens saved.
  • Default model chain uses Gemma, but model IDs are configurable.
  • Repository: https://github.com/peterhadorn/offload-mcp

The author notes they didn't find another tool on GitHub to handle this, and the API path is much more practical for small jobs than running locally.

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also