Pair Programmer Plugin Adds Live Screen, Voice, and Audio Context to Claude Code

A developer has released Pair Programmer, a plugin that addresses Claude Code's lack of real-time context by providing live desktop perception. The tool captures three data streams: screen content (with visual indexing generating short scene descriptions), microphone input (transcription plus lightweight intent classification for questions, explanations, or commands), and system audio (indexing meetings, tutorials, or other audio playing on the machine).
Architecture and Implementation
The system uses a multi-agent pipeline rather than a single model approach. It runs specialized agents in parallel:
- Screen reader for visual context
- Voice processor for microphone transcription and intent classification
- Audio classifier for system audio
- Orchestrator that correlates all inputs and synthesizes a single response
The plugin is built on VideoDB infrastructure. While indexing currently uses cloud models, the design is model-agnostic—the Index layer can swap in any VLM or LLM. The developer mentions interest in wiring local models for visual description and transcription layers.
Current Status and Installation
The plugin is currently macOS only. Installation requires three commands. The GitHub repository is available at https://github.com/video-db/claude-code/tree/main.
The developer is seeking feedback on architectural approaches, specifically whether developers prefer the multi-agent pipeline with specialized models and orchestration or pushing toward a single model end-to-end solution for desktop perception systems.
📖 Read the full source: r/ClaudeAI
👀 See Also

SmallClaw v1.0.2 adds background task system for local LLMs
SmallClaw v1.0.2 introduces a background task engine that runs multi-step workflows autonomously, with step verification to address small model reliability issues. The update has been tested on 4B-class models like qwen3:4b on 8GB machines.

Reddit user shares detailed prompt for exporting personal knowledge from AI assistants
A Reddit user has created a comprehensive prompt for extracting structured personal knowledge from AI assistants like Claude, addressing perceived limitations in Anthropic's ChatGPT import feature. The prompt generates three distinct JSON artifacts covering personal knowledge bases, intellectual frameworks, and knowledge graphs.

CostClaw: Free Local Cost Tracking Dashboard for OpenClaw Agents
CostClaw is a free, local plugin that captures every LLM call via OpenClaw's native hooks and provides a dashboard showing model breakdowns, per-session costs, and hourly spend charts. The developer discovered their heartbeat agent was running Claude Sonnet every 3 minutes 24/7, costing $60/month, and switching to Haiku cut their bill by ~65%.

Local 35B MoE Model Drops Agent OS Code Failure Rate to 0%
A developer reports that switching a multi-agent system's runtime to Qwen 3.6 35B A3B (MoE, 3B active params) eliminated code failures, achieving 100% success rate through a 5-layer validation gate.