Researcher Builds Veracity-Checking Skill for Claude Code, Finds Hallucinations in Own Documentation

✍️ OpenClawRadar📅 Published: April 20, 2026🔗 Source
Ad

Veracity-Checking Skill Architecture

A researcher with a sleep science background from University of Miami built a Claude Code skill called /veracity-tweaked-555 that decomposes documents into atomic claims and verifies each one via web search. The tool uses 16 parallel agents across 4 waves per run and was built in collaboration with Claude Code (Opus 4.6), where Claude drafted the code while the researcher designed the methodology.

Self-Audit Results and Error Patterns

When the researcher ran the veracity checker on its own SKILL.md documentation, it scored 62 out of 100. The skill designed to catch hallucinations had hallucinated facts in its own documentation, including:

  • Fabricating a performance statistic ("3x more accurate" for SAFE, which the paper never claims)
  • Inflating a paper's improvement claim ("+35.5%" was actually +5.5% over SOTA)
  • Fabricating an acronym expansion for a real technique

After initial fixes, the score reached 80, then 84 after a third run. A week later, after a more rigorous convergence loop with 6 runs, 19 agents, and 35 additional fixes, it stabilized at 96.5/100. However, the v3 audit dropped to 74 because v1 fixes had introduced new errors (an understated token cost and an incomplete tool list).

The errors follow consistent patterns: attribution inflation (slightly stronger language than the source warrants), plausible-but-fabricated identifiers (PMIDs, arXiv IDs that look real but point to different papers), and stale statistics presented as current.

Ad

Context Engineering Challenge

A single audit run generates approximately 917K tokens across 16 agents, exceeding Claude Code's 200K context window. When Claude Code compacts conversations to stay within limits, it performs lossy compression. After a few compactions, the agent loses track of how findings relate to each other — which fix caused which regression, which claim contradicts which other claim. Individual facts (names, numbers, function signatures) survive better than the connections between them.

Claude's diagnosis was that relational information — causal chains, cross-references, multi-step dependencies — is harder to preserve in a summary than isolated facts.

Solution and Additional Skill Audits

The researcher solved this by building a companion skill called /context-engineer that predicts overflow before it happens and externalizes relational state to JSON files on disk. The design test: if you can /clear your entire conversation and resume from the state file alone, the architecture is correct.

Running veracity checks on other Claude Code skills revealed:

  • One skill had a fabricated paper title in its attribution section — the citation looked perfect (authors, venue) but the title was fabricated and the year was wrong
  • The same skill misattributed an audit framework to the wrong standards body, appearing in multiple locations
  • The /context-engineer skill had internal inconsistencies — prose said "5-10K tokens" while a table said "5-15K tokens" for the same metric

12 total fixes were needed across all skills. All passed at 95+ on 3 consecutive runs after corrections.

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also

Claude Code Mastery: Open-source config system adds persistent memory and curated skills to Claude Code CLI
Tools

Claude Code Mastery: Open-source config system adds persistent memory and curated skills to Claude Code CLI

Claude Code Mastery is an open-source configuration system that adds persistent memory across sessions, smart lifecycle hooks, and 26+ curated skills to Claude Code CLI. It includes a 6-file Memory Bank per project, zero-config launcher, and cross-platform support.

OpenClawRadar
Hollow AgentOS Reduces Claude Code Token Usage by 68.5% with JSON-Native OS Approach
Tools

Hollow AgentOS Reduces Claude Code Token Usage by 68.5% with JSON-Native OS Approach

Hollow AgentOS, a JSON-native operating system layer for AI agents, reduces Claude Code token usage by 68.5% by eliminating wasteful shell command overhead. The tool plugs into Claude Code via MCP and runs local inference through Ollama.

OpenClawRadar
Soul MCP Server Adds Persistent Memory and Safety for Local LLMs
Tools

Soul MCP Server Adds Persistent Memory and Safety for Local LLMs

Soul is an open-source MCP server that provides persistent memory across sessions for local LLMs with two commands: n2_boot at start and n2_work_end at end. It includes Ark safety features that block dangerous commands like rm -rf and DROP DATABASE at zero token cost, plus cloud storage configuration.

OpenClawRadar
MCP Gateway for Secure Remote Access to Internal Tools
Tools

MCP Gateway for Secure Remote Access to Internal Tools

An open-source MCP gateway aggregates multiple MCP tool servers into a single connection, enabling secure access via Claude Desktop without exposing public endpoints. It uses OpenZiti/zrok for zero-trust networking and requires only one configuration entry with a share token.

OpenClawRadar