Pair Programmer Plugin Adds Live Screen, Voice, and Audio Context to Claude Code

✍️ OpenClawRadar📅 Published: April 16, 2026🔗 Source
Pair Programmer Plugin Adds Live Screen, Voice, and Audio Context to Claude Code
Ad

A developer has released Pair Programmer, a plugin that addresses Claude Code's lack of real-time context by providing live desktop perception. The tool captures three data streams: screen content (with visual indexing generating short scene descriptions), microphone input (transcription plus lightweight intent classification for questions, explanations, or commands), and system audio (indexing meetings, tutorials, or other audio playing on the machine).

Architecture and Implementation

The system uses a multi-agent pipeline rather than a single model approach. It runs specialized agents in parallel:

  • Screen reader for visual context
  • Voice processor for microphone transcription and intent classification
  • Audio classifier for system audio
  • Orchestrator that correlates all inputs and synthesizes a single response

The plugin is built on VideoDB infrastructure. While indexing currently uses cloud models, the design is model-agnostic—the Index layer can swap in any VLM or LLM. The developer mentions interest in wiring local models for visual description and transcription layers.

Ad

Current Status and Installation

The plugin is currently macOS only. Installation requires three commands. The GitHub repository is available at https://github.com/video-db/claude-code/tree/main.

The developer is seeking feedback on architectural approaches, specifically whether developers prefer the multi-agent pipeline with specialized models and orchestration or pushing toward a single model end-to-end solution for desktop perception systems.

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also

SmallClaw v1.0.2 adds background task system for local LLMs
Tools

SmallClaw v1.0.2 adds background task system for local LLMs

SmallClaw v1.0.2 introduces a background task engine that runs multi-step workflows autonomously, with step verification to address small model reliability issues. The update has been tested on 4B-class models like qwen3:4b on 8GB machines.

OpenClawRadar
Reddit user shares detailed prompt for exporting personal knowledge from AI assistants
Tools

Reddit user shares detailed prompt for exporting personal knowledge from AI assistants

A Reddit user has created a comprehensive prompt for extracting structured personal knowledge from AI assistants like Claude, addressing perceived limitations in Anthropic's ChatGPT import feature. The prompt generates three distinct JSON artifacts covering personal knowledge bases, intellectual frameworks, and knowledge graphs.

OpenClawRadar
CostClaw: Free Local Cost Tracking Dashboard for OpenClaw Agents
Tools

CostClaw: Free Local Cost Tracking Dashboard for OpenClaw Agents

CostClaw is a free, local plugin that captures every LLM call via OpenClaw's native hooks and provides a dashboard showing model breakdowns, per-session costs, and hourly spend charts. The developer discovered their heartbeat agent was running Claude Sonnet every 3 minutes 24/7, costing $60/month, and switching to Haiku cut their bill by ~65%.

OpenClawRadar
Local 35B MoE Model Drops Agent OS Code Failure Rate to 0%
Tools

Local 35B MoE Model Drops Agent OS Code Failure Rate to 0%

A developer reports that switching a multi-agent system's runtime to Qwen 3.6 35B A3B (MoE, 3B active params) eliminated code failures, achieving 100% success rate through a 5-layer validation gate.

OpenClawRadar