AgentLens: Observability Tool for Multi-Agent AI Workflows

AgentLens is an observability tool designed for multi-agent AI workflows that combine local and cloud-based models. It addresses the challenge of tracking activities across different providers like Ollama, vLLM, Anthropic, and OpenAI.
Key Features
- Unified tracing across Ollama, vLLM, Anthropic, OpenAI, and other providers
- Cost tracking that includes local compute time converted to estimated cost
- MCP server for querying statistics from inside Claude Code
- CLI interface for quick inline checks with commands like
agentlens q stats - Self-hosted - runs on your machine with data staying local
Deployment and Usage
Deploy using Docker:
docker run -d -p 3100:3100 phoenixaihub/agentlens-collector
Wrap Ollama calls with one line of code:
const { client } = wrapOllama(ollama, { client: lens });
The dashboard displays agent flow, cost breakdowns, and latency by provider.
This tool is particularly useful for developers running mixed AI workflows where some tasks use local models (like Ollama) while others require cloud-based models (like Claude or GPT) for complex reasoning.
📖 Read the full source: r/LocalLLaMA
👀 See Also

OpenClaw Skill Usage Tracker: Monitor Which Skills You Actually Use
A developer built a tool to track OpenClaw skill usage analytics, including invocation counts, breakdowns by agent and channel, and top skill rankings over different time periods.

Gullivr Travel App Integrates with Claude via Remote MCP Server
A developer built Gullivr, a travel planning app with a remote MCP server that allows Claude to create and manage trips directly within the app. The integration enables real-time updates while chatting with Claude, eliminating manual copying between tools.

Keyoku Plugin Replaces OpenClaw's Static Heartbeat with Memory-Driven Autonomy
Keyoku is a free OpenClaw plugin that changes the agent's heartbeat from reading a static HEARTBEAT.md file to scanning the agent's actual memory store for stalled work, dropped commitments, conflicting information, and quiet relationships. It uses a local Go engine with SQLite + HNSW and offers three autonomy levels: observe, suggest, and act.

Browser-native real-time coherence control system for Claude with SDE bands and Kalman filtering
A developer has built a real-time coherence control harness that runs entirely as a Claude artifact in the browser, treating conversation as a stochastic process with live Monte Carlo SDE paths, dual Kalman filtering, and behavioral signal detection.