Qwen3.6:27b + Custom Go Agent: A Local Alternative to Claude Code

A developer (codehamr) who runs a local LLM integration business describes experimenting with local models as a fallback for Claude Code. They report that Qwen3.6:27b at Q8 on an RTX 6000 with 96GB and 128k context delivers a similar coding experience to Claude Code, and note that a consumer RTX 5090 with 32GB at Q4_M can achieve comparable results.
For the agent layer, they built a minimal, single Go binary named codehamr (MIT open source) — no plugins, no MCP, no themes. The agent handles search, dependencies, and file work through bash on demand. The repo is available at: https://github.com/codehamr/codehamr
Key Details
- Model: Qwen3.6:27b at Q8 quantisation, running on RTX 6000 (96GB) with 128k context — overkill for a 30B model.
- Consumer alternative: RTX 5090 (32GB) at Q4_M should give a similar coding experience with good prompting discipline.
- Agent build: Custom Go binary — minimal, no plugins, no MCP. Uses bash for search, dependencies, file operations.
- License: MIT open source, available for fork or ignore.
The developer emphasizes that every step toward local LLMs reduces dependency on cloud tools. This setup, combined with disciplined prompting, is the first local configuration where they don't miss Claude Code.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Open-source Claude Code skill /unzuck curates social media feeds into dashboard
A free, open-source Claude Code skill called /unzuck scans feeds across Hacker News, Reddit, LinkedIn, YouTube, Twitter/X, Instagram, and Facebook in parallel using browser automation, scores items against user interest profiles, and generates interactive HTML dashboards.

Tokven MCP generates complete design token systems from a single hex color
Tokven MCP is a Model Context Protocol tool that creates a full design token system from a single brand hex color, including surfaces, borders, text hierarchy, shadows, and light/dark modes with automatic WCAG contrast validation.

Claude-switch CLI tool automates switching between Claude Max accounts when hitting usage caps
A developer built claude-switch, a 250-line bash CLI tool that saves and restores Claude Code credentials from macOS Keychain to switch between accounts when one hits usage limits. The tool eliminates browser re-authentication and maintains workflow continuity.

RouteLLM Setup for Cost-Effective AI Task Routing
A Reddit user shares a Docker Compose configuration that combines Ollama's local Qwen3.5:4b model with GitHub Copilot via OpenWire, using RouteLLM to route complex tasks to GPT-4o while handling simpler tasks locally.