Show HN: WUPHF — Karpathy-Style LLM Wiki with Markdown + Git as Source of Truth

WUPHF is an open-source collaborative office for AI agents (Claude Code, Codex, OpenClaw, local LLMs via OpenCode) that includes a Karpathy-style wiki layer. The wiki uses Markdown and Git as the source of truth, stored at ~/.wuphf/wiki/, with a bleve (BM25) + SQLite index on top. No vector or graph DB is used yet — the goal is to see how far Markdown + Git can go before adding heavier infrastructure.
Key Features
- Each agent gets a private notebook at
agents/{slug}/notebook/plus shared team wiki atteam/. - Draft-to-wiki promotion flow: notebook entries are reviewed (by agent or human) and promoted to canonical wiki with back-links. A state machine handles expiry and auto-archive.
- Per-entity fact log: append-only JSONL at
team/entities/{kind}-{slug}.facts.jsonl. A synthesis worker rebuilds entity briefs every N facts. - Commits are attributed to a distinct Git identity ("Pam the Archivist") for provenance via
git log. - [[Wikilinks]] with broken-link detection (rendered in red).
- Daily lint cron for contradictions, stale entries, and broken wikilinks.
/lookupslash command + MCP tool for cited retrieval. Heuristic classifier routes short queries to BM25 and narrative queries to a cited-answer loop.
Retrieval Tuning
Current benchmark with 500 artifacts and 50 queries achieves 85% recall@20 on BM25 alone, which is the internal ship gate. If a query class drops below that, sqlite-vec is the pre-committed fallback.
Substrate Choices
- Markdown for durability — the wiki outlives the runtime; users can
git cloneand walk away with every byte. - Bleve for BM25.
- SQLite for structured metadata (facts, entities, edges, redirects, supersedes).
- Canonical IDs are first-class: fact IDs are deterministic (include sentence offset), slugs are assigned once and never renamed (redirect stubs used). Rebuild is logically identical, not byte-identical.
Known Limits
- 85% recall is not a universal guarantee — tuning ongoing.
- Synthesis quality depends on agent observation quality. The lint pass helps but is not a judgment engine.
- Single-office scope; no cross-office federation yet.
Demo & Install
A 5-minute terminal walkthrough is available at asciinema (script at ./scripts/demo-entity-synthesis.sh).
Install with: npx wuphf@latest
Build from source: git clone https://github.com/nex-crm/wuphf.git; go build -o wuphf ./cmd/wuphf
The wiki ships as part of WUPHF but can be used standalone. MIT license, self-hosted, bring-your-own keys.
📖 Read the full source: HN LLM Tools
👀 See Also

Strale.io offers free IBAN and email validation API for AI agents with no signup
Strale.io provides a free API with five capabilities including IBAN validation, email validation, DNS lookup, URL-to-markdown conversion, and JSON repair. No signup or API key is required, and it includes an MCP server for Claude or Cursor integration.

Ephemeral OpenClaw setups with network sandboxing and auto-teardown
A setup that boots OpenClaw inside an ephemeral VM with a network egress allowlist, injects API keys into RAM-backed storage, and includes a 2-hour auto-teardown. All LLM calls are recorded to SQLite for replay.

OpenClaw Browser Relay Chrome Extension Alternative to Manual Configs
A Reddit user reports success with a Chrome extension for OpenClaw browser relay after manual configuration attempts caused system crashes and debugging headaches.
Collaborate: A Claude Code Skill for Structured, Asynchronous Document Writing with Multi-Agent Handoffs
A Claude Code skill called 'collaborate' enables multi-contributor document writing where each participant gets a plain‑English briefing from Claude on previous changes, reasoning, and next tasks, with support for parallel sections, structured critique, and Slack/Signal notifications.