VoidLLM: Zero-Knowledge Proxy for Ollama and vLLM with Team Access Control

VoidLLM is a proxy server that sits between your applications and local LLM servers like Ollama and vLLM. It adds organization and team access control, API key management, usage tracking, and rate limiting without ever seeing your prompts or content.
Key Features
- OpenAI-compatible — works with any SDK that supports the OpenAI API format
- Provider adapters for Ollama, vLLM, Anthropic, Azure, and OpenAI
- <2ms proxy overhead
- Rate limiting per organization, team, or API key (distributed via Redis)
- Cost tracking and analytics dashboard
- Zero content logging — only metadata (who accessed what model and how many tokens were used)
Use Case
If you're running Ollama or vLLM locally and want to share it across a team with proper access control and usage visibility, this proxy provides those capabilities while maintaining privacy through its zero-knowledge architecture.
The tool is available on GitHub at github.com/voidmind-io/voidllm.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code vs. Codex: Real-World Build Test – 36 Files vs. 28, Infinite Loop, and $0.46 Cost Difference
A developer pits Claude Code against Cursor's Codex on two real tasks: a PR triage bot and a WebSocket code review UI. Claude built 36 files in 12 minutes with zero TypeScript errors; Codex produced a working UI but hit an infinite React loop. Cost difference: ~$0.46.

Sense: Go SDK for LLM-powered test assertions and structured text extraction
Sense is a Go SDK that uses Claude for two main functions: evaluating non-deterministic output in tests with plain English assertions, and extracting typed structs from unstructured text through reflection and forced tool_use.

Data Analyst Builds Prompt Calibrator Tool with Claude, No Prior Frontend Experience
A data analyst with no HTML, CSS, or JavaScript experience built Prompt Calibrator, a client-side web tool that structures AI prompts through a form with four fields and four modes. The tool was developed using Claude as a code review partner and is hosted on GitHub Pages.

Detecting Silent Tool Failures in AI Coding Agents with Vibeyard
Vibeyard is a tool that detects when AI coding agents experience silent tool failures—where agents fall back to alternative strategies without alerting developers—and surfaces these inefficiencies during sessions. It can suggest fixes to prevent repeated inefficient workflows.