OpenJet v0.4: Zero-Config Local Coding Agent with llama.cpp Backend

OpenJet v0.4 is now available — an open-source terminal coding agent that runs local LLMs via llama.cpp with zero manual configuration. It gives you a Claude Code-style workflow without any API keys, running entirely on your own hardware.
What It Does
- Auto-detects your hardware (RTX, Mac, Jetson, CPU-only, etc.) and configures the llama.cpp backend accordingly.
- Reads files, searches your repository, edits code, and runs shell commands.
- Asks for permission before executing shell or write actions.
- Stores persistent memory across sessions, both per-project and system-wide.
Installation
curl -fsSL https://www.openjet.dev/install.sh | bash
openjet setup
openjetNew in v0.4
- Full new user interface (TUI).
- Improved model support for smaller GPUs.
- Multi-step coding agent capabilities.
Who It's For
Developers who want a local, private coding agent without the hassle of manually configuring models, backends, or permissions.
📖 Read the full source: r/LocalLLaMA
👀 See Also

AGI in md: 11 Cognitive Compression Levels for Claude System Prompts
A GitHub repository documents 11 levels of cognitive compression that can be encoded in Claude system prompts, with Level 8 shifting from analysis to construction and improving Haiku's performance from 0/3 to 4/4. The project includes 28 prompts, 299 raw outputs, and full experiment logs across 19 domains.

Using a Local LLM as a Claude Code Subagent to Reduce Context Usage
A Reddit user demonstrates how Claude Code can delegate tasks to a local LLM running via LM Studio, keeping file content out of Claude's context. The setup uses a ~120-line Python script with LM Studio's tool-calling API to handle file operations locally.

Open-source tool for AI-curated Reddit feeds using Cloudflare, Supabase, and Vercel
A developer open-sourced a self-hosted tool that filters Reddit for quality posts about AI-assisted development, using Cloudflare Workers for cron jobs and proxies, Supabase for storage, and Vercel for the frontend. The tool includes engagement scoring, optional LLM summaries, and costs $1-2/month for AI processing.

Recall: A Persistent Memory MCP Server for Claude Code
Recall is an open-source MCP server that gives Claude Code persistent memory across sessions via semantic search with embeddings. It includes four lifecycle hooks: session-start, observe, pre-compact, and session-end.