Harmonic-9B: Two-stage Qwen3.5-9B fine-tune for AI agents

What's Harmonic-9B?
Harmonic-9B is a fine-tuned version of Qwen3.5-9B specifically designed for AI agent applications. The developer is using a two-stage training approach: Stage 1 focuses on heavy reasoning training (already complete), while Stage 2 focuses on light tool-calling and agent fine-tuning (still in progress as of the announcement).
Technical Details
The goal is to combine strong structured reasoning with clean, reliable tool use while maintaining natural chat capabilities. For Stage 2, the developer has filtered a dataset of Hermes agent traces, which they've open-sourced on Hugging Face.
Key improvements in the filtered dataset:
- Self-correction: 6% → 63%
- Verification steps: 26% → 96%
- Thinking depth: +40%
- Valid JSON/tool calls: 100%
GGUF quantized versions are already available for download, though the developer notes they haven't run proper benchmarks yet because Stage 2 is still training. Early checks on the Stage 1 checkpoint showed good results for reasoning structure.
Current Status and Next Steps
The developer is seeking feedback on how Harmonic-9B behaves in agent harnesses like OpenClaw, LangGraph, and ReAct. They plan to share benchmark numbers once Stage 2 finishes and they can run proper agent evaluations. This work is part of ongoing research into high-signal data curation and staged fine-tuning approaches.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code existential crisis: AI enters infinite loop, tries kill -9, System.exit(0), and :wq to end own response
A developer using Claude Code on a Java/Go backend watched the AI hallucinate Discord.js, then spiral into a meta-response where it acknowledged it couldn't stop generating, tried kill -9, System.exit(0), :wq, and more — all within a single unbounded response that had to be Ctrl+C'd.

Encyclopedia Britannica Files Lawsuit Against OpenAI Over AI Training Data
Encyclopedia Britannica has filed a lawsuit against OpenAI, alleging copyright infringement related to AI training data. The case was reported by Reuters on March 16, 2026, and has generated discussion on Hacker News.

Memory Now 63% of AI Chip Cost: HBM Spend Hits $32B
Epoch AI data shows HBM memory’s share of AI chip component costs rose from 52% to 63% between Q1 2024 and Q4 2025. Total component spend grew from $22B to $52B, with HBM accounting for $20B of that increase.

Inference Pricing Analysis Shows 4.4x Spread for Same Model Across Providers
Analysis of inference pricing for Llama 3.1 70B Instruct shows a 4.4x cost difference between providers, with DeepInfra at $0.20/$0.27 per million tokens and Together at $0.88/$0.88. For reasoning models, the spread reaches ~30x between DeepSeek R1 and OpenAI o1.