Hy3 LLM Tops OpenRouter Rankings: Cheapest Model or Something Else?

A mysterious LLM called Hy3 preview has surged to the top of OpenRouter's AI Model Rankings, beating established models like Claude Opus 4.7 and DeepSeek V4 Flash by more than 50% in token usage. The model, an open-source release from Chinese megacorp Tencent, is priced at $0.066/1M input tokens on OpenRouter, making it the cheapest major model on the platform — even cheaper than DeepSeek V4 Flash at $0.10/1M input tokens.
However, the model's quality doesn't match its popularity. Tencent's own Hugging Face repo shows oddly honest benchmark results that are not favorable for Hy3 compared to other Chinese open-source models. Testing by the author suggests the model's quality is on par with other Chinese models, but not close to Claude Opus 4.7 or GPT 5.5.
OpenRouter's data reveals several peculiarities:
- Usage spike: Hy3 preview had no usage before May 8, 2026, when it switched from a free SKU to paid. Usage has been steady since, indicating organic adoption.
- App usage minimal: The top 5 apps account for <1% of all activity to Hy3. This rules out a single app switching default model (as happened with Grok Code Fast 1 earlier).
- 98% input tokens, 2% output — an extreme ratio suggesting heavy usage in retrieval or preprocessing tasks, not agentic coding loops.
- Single provider: Hy3 preview is only available via SiliconFlow, a Singapore-based provider, which saw a massive usage spike coinciding with Hy3.
When Hy3 moved from free to paid, usage didn't drop significantly, suggesting users are willing to pay despite the model's lower quality — likely because it remains the cheapest option on OpenRouter. The author asks: is Hy3 preview actually the cheapest LLM backed by a major company on OpenRouter?
Developers using AI coding agents should be aware that cost savings may come at a quality cost. If you're running high-volume inference where output quality is less critical (e.g., data extraction, simple classification), Hy3 could be a viable option. But for complex agentic coding, expect significantly worse results compared to Claude or GPT.
📖 Read the full source: HN AI Agents
👀 See Also

Ontario Audit: 60% of AI Scribe Systems Mix Up Drugs, 85% Miss Mental Health Details
Ontario auditors found that 12 of 20 AI Scribe systems inserted incorrect drug info, 9 fabricated treatment suggestions, and 17 missed mental health key details from doctor-patient recordings. The evaluation weighted accuracy at only 4% of total score.

An Open Standard for Agent Run Records: The Case for a Shared Log Schema
Every agent runtime has its own log format, causing fragmentation in debugging, auditing, and tool portability. The fields already converge on a core schema — it's time to standardize.

Claude Code v2.1.160: Safety Prompts for Shell Config, acceptEdits File Protection, and Dozens of Bug Fixes
Anthropic released Claude Code v2.1.160 with safety prompts before writing to shell startup files and build-tool configs in acceptEdits mode, improved Windows clipboard support, and fixed session history loss.

Claude-Code v2.1.78: Plugin State, Streaming Responses, and Critical Fixes
Claude-Code v2.1.78 adds plugin persistent state with ${CLAUDE_PLUGIN_DATA}, line-by-line response streaming, and fixes for API error loops, permission bypass issues, and sandbox security warnings.