Torrix: Self-Hosted LLM Observability Without Postgres or Redis

Torrix is a self-hosted LLM observability tool designed for teams who want to see what their agents are doing in production without the overhead of Postgres, Redis, or complex infrastructure. It runs as a single Docker container backed by SQLite. The full install is:
curl -o docker-compose.yml https://raw.githubusercontent.com/torrix-ai/install/main/docker-compose.community.yml
docker compose upNo external dependencies. All data stays in a local SQLite file on your machine. After startup, open http://localhost:8088 and create an account.
Key Features
- LLM call logging via HTTP proxy or Python/Node.js SDK: tokens, cost, latency, full prompt and response traces, reasoning token capture.
- Provider support: OpenAI, Anthropic, Gemini, Groq, Mistral, Azure OpenAI, and any OpenAI API-compatible endpoint.
- Cost forecasting and hard budget caps
- PII masking
- Model routing rules
- Evals with golden runs and AI judge
- Prompt library with version history
- Run tags for filtering by environment
- MCP server so AI Assistants can query your own logs
- OTLP/HTTP ingestion for apps already using OpenTelemetry
SDK Usage Example (Python)
pip install torrix
import torrix
from openai import OpenAI
torrix.init(api_key="<your-torrix-api-key>", base_url="http://localhost:8088")
client = torrix.wrap(OpenAI(api_key="<your-openai-key>"))
response = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "Hello!"}],
torrix_name="my-run",
)
print(response.choices[0].message.content)
The Node.js SDK is also available via npm install.
Licensing and Scaling
Community edition is free for one user with 7-day retention. Pro adds teams, RBAC, 30-day retention, API key management, full text search, and audit logs. SQLite doesn't scale to high write throughput; this is aimed at teams logging hundreds to low thousands of LLM calls per day, not millions.
📖 Read the full source: HN LLM Tools
👀 See Also

Benchmark Results for Small Local and OpenRouter Models on Agentic Text-to-SQL Task
A developer tested multiple small local and OpenRouter models using a custom agentic text-to-SQL benchmark that converts English queries to SQL with debugging rounds. The benchmark includes 25 questions, runs in under 5 minutes, and reveals top performers like kimi-k2.5 and Qwen 3.5 variants.

Swarm Orchestra v2 Plugin Adds Inter-Agent Messaging to Fix Claude Code Agent Team Chaos
Swarm Orchestra is a plugin that addresses Claude Code's experimental TeamCreate feature, which can spawn runaway agents. Version 2 adds inter-agent messaging via a PreToolUse hook and self-configuration via a /teammate skill.

Open-source structural hallucination checker for AI agent pipelines
A new open-source tool provides four suppressors to catch structural failures in AI agent pipelines, including grounding enforcement, prompt injection detection, JSON validation, and tool response verification. Available as both a REST API and MCP server with a free tier of 500 requests/month.

Nakkas MCP Server Generates Animated SVGs from AI Descriptions
Nakkas is an MCP server where AI constructs complete animated SVG configurations from descriptions, rendering clean animated SVGs with shapes, gradients, animations, and filters. It supports parametric curves, 15 filter presets, CSS @keyframes and SMIL animations, and works anywhere SVG renders.