Caveman: A Claude Code Skill That Cuts 75% of Tokens by Using Caveman-Style Speech

What Caveman Does
Caveman is a Claude Code skill that makes Claude talk like a caveman, cutting approximately 75% of tokens while keeping full technical accuracy. The approach is based on the observation that caveman-speak dramatically reduces LLM token usage without losing technical substance.
Before and After Examples
The source provides specific examples of token reduction:
- Normal Claude (69 tokens): "The reason your React component is re-rendering is likely because you're creating a new object reference on each render cycle. When you pass an inline object as a prop, React's shallow comparison sees it as a different object every time, which triggers a re-render. I'd recommend using useMemo to memoize the object."
- Caveman Claude (19 tokens): "New object ref each render. Inline object prop = new ref = re-render. Wrap in useMemo ."
- Normal Claude: "Sure! I'd be happy to help you with that. The issue you're experiencing is most likely caused by your authentication middleware not properly validating the token expiry. Let me take a look and suggest a fix."
- Caveman Claude: "Bug in auth middleware. Token expiry check use < not <= . Fix:"
Installation
Install with either of these commands:
npx skills add JuliusBrussee/cavemanOr through the Claude Code plugin system:
claude plugin marketplace add JuliusBrussee/caveman
claude plugin install caveman@cavemanUsage
Trigger caveman mode with these phrases:
- /caveman "talk like caveman"
- "caveman mode"
- "less tokens please"
Stop caveman mode with:
- "stop caveman"
- "normal mode"
What Caveman Changes and Keeps
Caveman removes:
- Filler words
- Articles (a, an, the)
- Pleasantries (e.g., "Sure I'd be happy to")
- Hedging (e.g., "It might be worth considering")
Caveman keeps:
- Code blocks (writes normally)
- Technical terms (e.g., polymorphism stays polymorphism)
- Error messages (quotes exactly)
- Git commits & PRs (writes normally)
Benefits and How It Works
The source claims these benefits:
- 75% tokens saved
- 100% technical accuracy maintained
- ~3x speed increase
- 75% less cost on output
- Faster responses due to fewer tokens to generate
Caveman eliminates wasted tokens on phrases like:
- "I'd be happy to help you with that" (8 wasted tokens)
- "The reason this is happening is because" (7 wasted tokens)
- "I would recommend that you consider" (7 wasted tokens)
- "Sure, let me take a look at that for you" (10 wasted tokens)
Repository Details
The repository has 746 stars, 14 forks, and uses the MIT license. The latest release is v1.0.0 from April 4, 2026.
📖 Read the full source: HN AI Agents
👀 See Also

Claude Token Counter Updated with Model Comparison Feature
Simon Willison's Claude Token Counter tool now supports comparing token counts across different Claude models. The update reveals Opus 4.7 uses 1.0–1.35× more tokens than Opus 4.6 due to an updated tokenizer, potentially increasing costs by around 40% despite identical pricing.

GrapeRoot tool reduces Claude Code costs by 45% with pre-scanned repository context
A free tool called GrapeRoot that pre-scans repositories and builds dependency graphs reduced Claude Code costs by 45% on average across 10 engineering tasks while improving response quality by 13%. The tool eliminates exploration loops that normally consume tokens.

OpenYak: Open-Source Desktop AI Agent for Local File Management and Automation
OpenYak is an open-source desktop AI assistant that runs entirely on your machine, offering file management, data analysis, and office automation with 100+ AI models through OpenRouter and 20+ BYOK providers.

Extracting OpenClaw Components: A Developer's Experience with Lane Queue and Memory System
A developer attempted to extract specific components from OpenClaw for use in personal AI agents, testing the Lane Queue task execution system and examining the memsearch memory system. The Lane Queue was successfully reimplemented in Python using documentation, revealing gaps in documentation and 13 implementation issues.