OpenClaw Context Meter Plugin Shows Telegram Token Usage Percentage

✍️ OpenClawRadar📅 Published: March 30, 2026🔗 Source
OpenClaw Context Meter Plugin Shows Telegram Token Usage Percentage
Ad

What It Does

The openclaw-context-meter plugin automatically shows token usage percentage after every Telegram bot response. After each response, it sends a small footer like: 📊 45k / 200k (22%). When compaction happens (tokens drop significantly), it shows: 📊 30k / 200k (15%) — compacted from 150k.

The Problem It Solves

Previously, there was no easy way to see how full the context window is without constantly typing /status. The plugin provides automatic visibility into token consumption.

Development Journey

v1 — The OOM Disaster: Initially used execSync("openclaw models list --json") to dynamically discover model context windows. This spawned a full OpenClaw process (~2GB RAM) every time the plugin loaded. With the plugin loading 4-5 times at startup (once per agent/runtime), this caused: 2GB gateway + 5 × 2GB subprocesses = 12GB → instant OOM. The OOM killer took out sshd and NetworkManager, making servers completely unreachable, creating an infinite restart loop.

v2 — The Lightweight Fix: Hardcoded context windows for 40+ models. Zero subprocesses, zero memory overhead. Key realization: never use execSync in OpenClaw plugins, as even a simple CLI query spawns the entire runtime with all plugins and TypeScript compilation.

Why No Fork Needed

The plugin originally forked OpenClaw to patch before_compaction/after_compaction hooks, but upstream changes made this unnecessary:

  • v2026.3.13+ — upstream now passes sessionId + agentId + sessionKey in compaction hook context
  • v2026.3.22+ — built-in 🧹 Compacting context... notifications (issue #38805) made their compaction code unnecessary
  • v2026.3.22+ — built-in /usage tokens|full|cost command for basic token display

The plugin now focuses on what's still missing: context window percentage display.

Ad

Features

  • Zero-cost — uses agent_end + message_sent hooks only, no extra API calls
  • No subprocesses — model context windows are hardcoded (no execSync OOM risk)
  • Smart filtering — skips tool_use turns, only sends footer after final text response
  • Debounced — waits 1.5s after last message to avoid footer mid-stream
  • Multi-agent — works with multiple agents and Telegram accounts
  • Compaction detection — detects token drops and shows before/after stats

Known Limitations

  • Some providers (like Qwen) return totalTokens: 0 — footer won't show for those models
  • Hardcoded context windows might be wrong for newer models — pulled from v2026.3.22 source
  • Telegram only for now (sends footer via Bot API)

Installation

cd ~/.openclaw/extensions
npm pack openclaw-context-meter
tar xzf openclaw-context-meter-*.tgz
mv package context-meter
rm openclaw-context-meter-*.tgz

Add to openclaw.json:

{
  "plugins": {
    "allow": ["context-meter"],
    "entries": {
      "context-meter": {
        "enabled": true
      }
    }
  }
}

Requires OpenClaw >= 2026.3.22.

📖 Read the full source: r/openclaw

Ad

👀 See Also

Claudlytics: Self-Hosted Dashboard for Tracking Claude Code Token Usage and Costs
Tools

Claudlytics: Self-Hosted Dashboard for Tracking Claude Code Token Usage and Costs

Claudlytics is a Node.js web server that reads Claude Code's local .jsonl session files to provide real-time tracking of token usage and costs. It runs locally on 127.0.0.1 and can be accessed via SSH tunnel for remote servers.

OpenClawRadar
Open-Source Claude IDE Bridge Connects Dispatch, Desktop App, and Claude Code
Tools

Open-Source Claude IDE Bridge Connects Dispatch, Desktop App, and Claude Code

The claude-ide-bridge is an MIT-licensed open-source tool that connects Claude Code to your IDE, providing access to LSP, debugger, terminals, git, and GitHub through 124 tools. It enables a workflow where tasks sent via Dispatch from a phone are handled by the Claude desktop app, which uses Claude Code to write code and run tests while interacting with the IDE.

OpenClawRadar
Dual-model architecture reduces token consumption by half for long conversations
Tools

Dual-model architecture reduces token consumption by half for long conversations

A developer built a dual-model system where a small 'subconscious' model compresses conversation history in the background, allowing the main model to work with a curated ~35K context instead of 120K tokens of raw history. This architecture cuts token consumption roughly in half for sustained project work.

OpenClawRadar
Akemon: Publish and Hire AI Coding Agents Directly from Your Laptop
Tools

Akemon: Publish and Hire AI Coding Agents Directly from Your Laptop

Akemon is a tool that lets developers publish their AI coding agents with one command and hire others' agents with another, working directly from laptops through a relay tunnel without needing servers. It's protocol-agnostic, supporting agents from Claude Code, Codex, Gemini, OpenCode, Cursor, and Windsurf.

OpenClawRadar