Codeset improves coding agents with repo-specific context from git history

What Codeset does
Codeset runs a pipeline over your git history and generates files that live directly in your repository. These files include: past bugs per file with root causes, known pitfalls, co-change relationships, and test checklists. The coding agent reads these files as part of its normal context window. No RAG, no vector database at query time, no runtime infrastructure required—just static files your agent picks up like any other file in the repo.
Benchmark results
The team tested Codeset with two benchmarks:
- codeset-gym-python (150 tasks, same subset as Claude eval): 60.7% → 66% (+5.3 percentage points)
- SWE-Bench Pro (400 randomly sampled tasks): 56.5% → 58.5% (+2 percentage points)
This shows consistent improvement across both benchmarks, with smaller gains on SWE-Bench Pro compared to codeset-gym. The codeset-gym benchmark is public with full task list and verifiers available for methodology verification.
Pricing and availability
Codeset costs $5 per repository, one-time payment. Use code CODESETLAUNCH for a free trial. Full evaluation artifacts are available at https://github.com/codeset-ai/codeset-release-evals.
📖 Read the full source: r/LocalLLaMA
👀 See Also

BrowserKing: Open-Source Chrome Extension for Browser Control via Claude and Other Models
BrowserKing is a free, open-source Chrome extension that lets Claude and 15+ other models see and control your browser from a side panel. It takes screenshots, sends them to the model, and then acts on decisions to click buttons, fill forms, scroll, and navigate tabs.

Claude IDE Bridge: WebSocket Tool for Real-Time IDE Access
claude-ide-bridge is a WebSocket bridge that connects Claude Code directly to IDE internal state, enabling live diagnostics, go-to-definition, find references, type hovering, file opening, breakpoint management, and debugger state streaming.

OpenClaw as Infrastructure-as-Code Interface for Home Lab Management
OpenClaw transforms from AI gadget to primary computer interface for home lab management, executing tasks like configuring Traefik containers, creating Dashy configurations, and setting up Tailscale access with direct machine access.

OpenClaw Nerve WebUI adds voice control and team management dashboard
Nerve is a WebUI for OpenClaw that provides an all-in-one dashboard for monitoring and managing AI agents, with voice control via double-tap shift for Whisper and sub-agent team building capabilities.