OpenJet v0.4: Zero-Config Local Coding Agent with llama.cpp Backend

✍️ OpenClawRadar📅 Published: May 2, 2026🔗 Source
OpenJet v0.4: Zero-Config Local Coding Agent with llama.cpp Backend
Ad

OpenJet v0.4 is now available — an open-source terminal coding agent that runs local LLMs via llama.cpp with zero manual configuration. It gives you a Claude Code-style workflow without any API keys, running entirely on your own hardware.

What It Does

  • Auto-detects your hardware (RTX, Mac, Jetson, CPU-only, etc.) and configures the llama.cpp backend accordingly.
  • Reads files, searches your repository, edits code, and runs shell commands.
  • Asks for permission before executing shell or write actions.
  • Stores persistent memory across sessions, both per-project and system-wide.

Installation

curl -fsSL https://www.openjet.dev/install.sh | bash
openjet setup
openjet

New in v0.4

  • Full new user interface (TUI).
  • Improved model support for smaller GPUs.
  • Multi-step coding agent capabilities.

Who It's For

Developers who want a local, private coding agent without the hassle of manually configuring models, backends, or permissions.

📖 Read the full source: r/LocalLLaMA

Ad

👀 See Also

AGI in md: 11 Cognitive Compression Levels for Claude System Prompts
Tools

AGI in md: 11 Cognitive Compression Levels for Claude System Prompts

A GitHub repository documents 11 levels of cognitive compression that can be encoded in Claude system prompts, with Level 8 shifting from analysis to construction and improving Haiku's performance from 0/3 to 4/4. The project includes 28 prompts, 299 raw outputs, and full experiment logs across 19 domains.

OpenClawRadar
Using a Local LLM as a Claude Code Subagent to Reduce Context Usage
Tools

Using a Local LLM as a Claude Code Subagent to Reduce Context Usage

A Reddit user demonstrates how Claude Code can delegate tasks to a local LLM running via LM Studio, keeping file content out of Claude's context. The setup uses a ~120-line Python script with LM Studio's tool-calling API to handle file operations locally.

OpenClawRadar
Open-source tool for AI-curated Reddit feeds using Cloudflare, Supabase, and Vercel
Tools

Open-source tool for AI-curated Reddit feeds using Cloudflare, Supabase, and Vercel

A developer open-sourced a self-hosted tool that filters Reddit for quality posts about AI-assisted development, using Cloudflare Workers for cron jobs and proxies, Supabase for storage, and Vercel for the frontend. The tool includes engagement scoring, optional LLM summaries, and costs $1-2/month for AI processing.

OpenClawRadar
Recall: A Persistent Memory MCP Server for Claude Code
Tools

Recall: A Persistent Memory MCP Server for Claude Code

Recall is an open-source MCP server that gives Claude Code persistent memory across sessions via semantic search with embeddings. It includes four lifecycle hooks: session-start, observe, pre-compact, and session-end.

OpenClawRadar