Kimi K2.6 vs Claude Opus 4.7: A Practical Coding Showdown on a Minetest Mod + Google Sheets Integration

What's the test?
A developer compared Kimi K2.6 and Claude Opus 4.7 on a two-part coding task: building a Minetest/Luanti bounty board game mod with a TypeScript backend, then extending it with Google Sheets logging through Composio. Both models got identical prompts and were measured on working result, code quality, debugging pain, time, token usage, and cost.
Setup: Claude Opus 4.7 via Claude Code, Kimi K2.6 via OpenCode on OpenRouter. Same repo, same success criteria.
Test 1: Local bounty board
Claude Opus 4.7 built an Express/Zod/Vitest backend, Lua mod, /bounty flow, rewards, and leaderboard with passing tests.
- Cost: ~$3.59
- Time: 12 min API, 23 min wall
- Code: +1,688 / -0
- Output: 54.8k tokens
- Cache read: 2.8M tokens
Kimi K2.6 also got the local bounty board working — backend routes, Lua mod, basic game flow — but the code was messier. It wrote secure.http_mods = bountykimi in the global config, but also created a world-level config with a different mod name, so the HTTP API wasn't enabled for the actual running mod. Debugging took 30+ minutes.
- Cost: ~$0.39
- Duration: ~9 min 27 sec
- Code changes: +4,671 / -0 (2.7x more than Opus)
- Context used: 52,073 tokens
- Context window: 20%
Verdict: Both passed Test 1, but Opus's output was cleaner and smaller.
Test 2: Composio + Google Sheets
Claude Opus 4.7 got the Google Sheets sync working after some back-and-forth on tsx watch and env loading. The backend could complete a bounty and append to Google Sheets through Composio.
- Cost: $16.03 (painful)
- Time: 28 min API, 1 hr 17 min wall
- Code: +1,848 / -507
- Cache read: 22.3M tokens
- Output: 123.3k tokens
Kimi K2.6 failed. It got stuck on dev server issues, tests, and build problems, and never wired the Composio integration into a clean working state. After ~25 minutes and 135k+ tokens, the test was stopped.
- Cost: ~$5.03
- Time: ~25 min
- Tokens: 135k+
Key takeaways
- Best local MVP: Opus (cleaner), but Kimi is far better value.
- Best real integration: Opus by a large margin.
- Cleaner code: Opus (1.7k vs 4.7k lines for the same task).
- Cheapest experiment model: Kimi K2.6.
- Most painful cost: Opus ($16 for Google Sheets sync).
Kimi K2.6 is interesting for cheap local coding tasks — $0.39 for a working Lua + TypeScript mod is impressive. But when external tools, config issues, and real integration are involved, Opus 4.7 remains clearly ahead.
📖 Read the full source: r/LocalLLaMA
👀 See Also

AI Vendor Lock-In Escalates: Switching Models Now Costs More Than Most Expected
A Zapier survey of 542 US executives shows 90% thought they could switch AI vendors in under 4 weeks, but 58% of actual migrations failed or took far longer. Meanwhile, OpenAI raised GPT-5.2 input token pricing from $1.25 to $5.75, and Anthropic moved Claude enterprise to dynamic pricing, potentially doubling or tripling costs for heavy users.

Nvidia reportedly developing open-source NemoClaw to compete with OpenClaw
Recent reports suggest Nvidia is working on an open-source project called NemoClaw aimed at directly competing with OpenClaw in AI development tools. The project is expected to focus on improving performance, scalability, and developer flexibility while maintaining compatibility with modern AI workflows.

Georgia Court Order Contains AI-Hallucinated Legal Citations
A Georgia Supreme Court appeal revealed a trial court order contained at least five citations to nonexistent cases and five more to cases that don't support their cited propositions, with the prosecutor's proposed order containing the same errors.

Stripe's Minions: One-Shot AI Coding Agents
Minions are Stripe's one-shot AI coding agents aiming to enhance developer productivity by leveraging end-to-end automation using LLMs.