OpenJet v0.4: Zero-Config Local Coding Agent with llama.cpp Backend

✍️ OpenClawRadar📅 Published: May 2, 2026🔗 Source

OpenJet v0.4 is now available — an open-source terminal coding agent that runs local LLMs via llama.cpp with zero manual configuration. It gives you a Claude Code-style workflow without any API keys, running entirely on your own hardware.

What It Does

Auto-detects your hardware (RTX, Mac, Jetson, CPU-only, etc.) and configures the llama.cpp backend accordingly.
Reads files, searches your repository, edits code, and runs shell commands.
Asks for permission before executing shell or write actions.
Stores persistent memory across sessions, both per-project and system-wide.

Installation

curl -fsSL https://www.openjet.dev/install.sh | bash
openjet setup
openjet

New in v0.4

Full new user interface (TUI).
Improved model support for smaller GPUs.
Multi-step coding agent capabilities.

Who It's For

Developers who want a local, private coding agent without the hassle of manually configuring models, backends, or permissions.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

AGI in md: 11 Cognitive Compression Levels for Claude System Prompts

A GitHub repository documents 11 levels of cognitive compression that can be encoded in Claude system prompts, with Level 8 shifting from analysis to construction and improving Haiku's performance from 0/3 to 4/4. The project includes 28 prompts, 299 raw outputs, and full experiment logs across 19 domains.

Mar 1, 2026, 12:45 PM UTC

OpenClawRadar

Tools

Using a Local LLM as a Claude Code Subagent to Reduce Context Usage

A Reddit user demonstrates how Claude Code can delegate tasks to a local LLM running via LM Studio, keeping file content out of Claude's context. The setup uses a ~120-line Python script with LM Studio's tool-calling API to handle file operations locally.

Mar 2, 2026, 01:45 PM UTC

OpenClawRadar

Tools

Open-source tool for AI-curated Reddit feeds using Cloudflare, Supabase, and Vercel

A developer open-sourced a self-hosted tool that filters Reddit for quality posts about AI-assisted development, using Cloudflare Workers for cron jobs and proxies, Supabase for storage, and Vercel for the frontend. The tool includes engagement scoring, optional LLM summaries, and costs $1-2/month for AI processing.

Apr 14, 2026, 03:45 PM UTC

OpenClawRadar

Tools

Recall: A Persistent Memory MCP Server for Claude Code

Recall is an open-source MCP server that gives Claude Code persistent memory across sessions via semantic search with embeddings. It includes four lifecycle hooks: session-start, observe, pre-compact, and session-end.

Feb 28, 2026, 05:45 AM UTC

OpenClawRadar