sourcecode: Open-Source CLI to Compress Large Java/Spring Monorepos for Claude

An open-source CLI tool called sourcecode was released to compress large enterprise repositories into Claude-friendly context. Aimed at Java/Spring monorepos, it targets the problem of excessive token usage when feeding codebases to LLMs like Claude.
What It Does
The tool provides several features out of the box:
- Repo context compression — reduces a real Java Spring Boot + Angular monorepo (~4k files) from roughly ~3M tokens (manual estimate) to
--agentmode: ~5k structured tokens, or--compactmode: ~1.7k tokens. - Git hotspots / churn detection — identifies files with frequent changes.
- TODO / FIXME extraction — pulls all code annotations.
- Symbol lookup — across duplicated modules and projects.
- PR delta workflows — produces structured JSON/YAML output for Claude workflows.
Current Limitations
The author notes that deep Java semantics, Spring/MyBatis understanding, cross-file reasoning, and full architecture comprehension are still major works in progress. The tool is far more useful today as a context compression + navigation tool than as a generic AI code intelligence platform.
Future Focus
Going forward, the project will concentrate on:
- Java/Spring monorepos specifically
- Semantic symbol graphs
- Impact analysis
- Working-tree awareness
Installation
Available via PyPI and npm:
pip install sourcecode==1.0.0
# or
npm install sourcecode-npmGitHub repo: HarounDominique/sourcecode
Who It's For
Developers using Claude with large Java/Spring monorepos who need to reduce context size while preserving navigability and key code structure.
📖 Read the full source: r/ClaudeAI
👀 See Also

bunx ccusage Shows $18,450 in Credits Burned — Flat Plans Absorb the Cost
A user on r/ClaudeAI ran bunx ccusage and discovered $18,450 in credits used in May — 248M input tokens, 42M output tokens, 21.7B with cache reads — while paying only €400/month flat-rate for Claude Code and Codex.

Claude Design vs Huashu-Design: A Head-to-Head on HTML Layouts and Rate Limits
Claude Design builds HTML prototypes fast but hits rate limits quickly. Huashu-Design, an open-source Claude Code skill, runs on the normal subscription with no separate rate limit—but takes 20 minutes vs 5.

memv: Open-Source Memory System for AI Agents
memv is an open-source memory system designed for AI agents that stores only unexpected information from interactions, reducing noise and redundancy.

GitAgent: An Open Standard for Portable AI Agents in Git Repos
GitAgent is an open specification that defines AI agents through three core files in a git repository: agent.yaml for configuration, SOUL.md for personality/instructions, and SKILL.md for capabilities. The CLI allows running any agent repo directly with commands like npx @open-gitagent/gitagent run -r https://github.com/user/agent -a claude.