Developer Switches from Cursor Composer 2 and Kimi 2.6 to Qwen3.6:35b-a3b for Enterprise Workloads

A developer on r/LocalLLaMA reports successfully replacing Cursor Composer 2 and Kimi 2.6 with Qwen3.6:35b-a3b for daily software development on a 500,000-700,000 line enterprise codebase (60 hours/week). The user previously tried Kimi 2.6 and DeepSeek 4 Pro/Flash but found Qwen3.6:35b-a3b to be the best fit.
Key Details
- Model: Qwen3.6:35b-a3b (the 3.6 version with 35b parameters and a 3b activated subset via MoE? — the user's notation is ambiguous; likely Qwen2.5-32B or a custom variant). The model supports image/screenshot input.
- Hosting: Run via OpenRouter at approximately $0.08 per 1M tokens averaged after caching and billing adjustments. The user lacks hardware for local inference.
- Workload: Full-time development on a large enterprise software suite. The user claims the model “actually understands” the codebase and task context, surpassing prior options.
- Missing feature: The only drawback noted is the lack of Cursor's cloud agents functionality and high throughput on Composer 2.
Cost Comparison
At ~$0.08/1M tokens, Qwen3.6:35b-a3b is described as “insanely cheap” for its capability level. No exact breakdown is given, but caching and usage discounts apply.
Who It's For
Developers working with large proprietary codebases who want a capable, low-cost model for AI-assisted coding without requiring local GPU hardware.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code v2.1.74 System Prompt Updates: Security Rules, Memory Selection, and New Skills
Claude Code v2.1.74 adds 1,750 tokens to system prompts including new security monitor rules blocking unauthorized external writes, a /stuck skill for diagnosing frozen sessions, and memory selection improvements that skip redundant API references.

Reddit discussion highlights shift from chatbots to autonomous agents with local execution
A Reddit post distinguishes chatbots from autonomous agents using concrete examples and notes the trend toward local execution with models like LLaMA running on private workstations.

Claude Opus 4.6 accuracy drops on BridgeBench hallucination test
Claude Opus 4.6 shows a significant drop in accuracy on the BridgeBench hallucination test, falling from 83% to 68% according to BridgeMind AI's Twitter post.

Minimax M2.7 and Scaling to 100k+ OpenClaw Instances Discussed in Ecosystem Session
Jim and AndyML hosted the Minimax team to discuss Minimax M2.7 and how they scaled their hosting environment to support over 100,000 OpenClaw instances. The session attracted 100-110 users from Discord and 350,000+ viewers on a Chinese simulcast.