Toothcomb: Open-Source Real-Time Speech Fact-Checker Built with Claude Opus and Sonnet APIs

Toothcomb is an open-source tool for analyzing and fact-checking speech in real time. It accepts a speech transcript, an MP3 file (which it transcribes), or live audio from your microphone. The analysis runs in three stages using Claude Opus and Sonnet APIs.
How It Works
- Stage 1 — Chunking & Analysis: The text is broken into small parts (a few sentences each). Each part is sent to the Claude Opus API with detailed instructions to extract claims, promises, predictions, logical fallacies, and deceptive/manipulative language.
- Stage 2 — Fact-Checking: For claims requiring verification, Claude uses its existing knowledge or performs web searches via the API's web search tool in conjunction with the Sonnet API.
- Stage 3 — Final Review: The entire speech is reviewed holistically to catch contradictions, unfulfilled promises, or patterns invisible at the chunk level.
Technical Details
The architecture and high-level design were created by the author; most actual code was written by Claude Code/Opus 4.6. The author notes micromanaging Claude to a degree where any human developer would have resigned, but the collaboration felt genuine and the resulting code quality matched hand-written code, completed in much less time.
Who It's For
Developers building tools for media analysis, political debate fact-checking, or real-time conversational auditing who want a production-ready pipeline using Claude APIs.
📖 Read the full source: r/ClaudeAI
👀 See Also

Zoku: A Tool That Automatically Detects Repeated Workflows in Claude Code
Zoku is a local tool that hooks into Claude Code's event system to record tool actions across sessions, identifies repeated workflow patterns, and then informs Claude about these patterns so it can proactively suggest or execute them. It requires no configuration, has no dependencies, and stores everything locally in ~/.zoku/.

Claude Code documentation includes excessive React components inflating token counts
Analysis of Claude Code's LLM documentation reveals that MDX files contain massive inlined React components, with context-window.md using 18,501 tokens but only 551 tokens of actual documentation content.

Claude-context-lint tool audits token overhead in Claude Code projects
A new tool called claude-context-lint scans Claude Code projects to show how much context window is consumed by CLAUDE.md files, skills, MCP servers, and system prompts before user input. The tool provides specific recommendations for reducing token usage.

NLA Transforms Gemma 3’s Internal Activations into Readable Text for Any Token
Anthropic released Natural Language Autoencoders (NLA) that decode a model’s internal state into text. Paired with Gemma 3, the Auto Verbalizer explains what the model was “thinking” for any generated token. Weights are on Hugging Face; demo on Neuronpedia.