OpenRouter's Healer Alpha stealth model appears to be unreleased Qwen 3.5-Omni variant

Technical specifications and evidence
Healer Alpha is described as having "vision, hearing, reasoning, and action capabilities" with native perception of visual and audio inputs. The model accepts text, image, audio, and video inputs and outputs text with a maximum output length of 65,536 tokens.
The 262,144 context window is a key identifier - this exact number (2^18) matches Qwen 3.5's native context length precisely, not rounded to 256K. Other models use different lengths: GPT-5.4 uses 272K, Gemini uses 1M, and Claude uses 200K-1M.
Architecture knowledge and capabilities
When asked about Qwen architectures, Healer Alpha produced a 2,000+ word technical explanation covering:
- Qwen3-Omni Thinker-Talker architecture with reasoning/generation split
- Cross-modal fusion and CosyVoice vocoder integration
- GDN (gated normalization mechanism) and MoE expert routing
- 262K context handling using Ring Attention, KV cache optimization, FlashAttention tiling, YaRN/NTK-aware RoPE scaling, and curriculum learning
In contrast, when asked about DeepSeek or xAI architectures, it returned minimal or no responses.
Chinese language proficiency and error metadata
The model demonstrated native-level classical Chinese poetry composition, writing a 七言绝句 about AI with proper tonal structure and classical imagery. It even provided literary analysis of its own poem.
During heavy probing, error responses revealed metadata: {"message": "Provider returned error", "code": 502, "metadata": {"provider_name": "Stealth"}}
Model identification reasoning
The analysis suggests this could be a merged "Qwen 3.5-Omni" variant combining Qwen 3.5's 262K context and hybrid GDN-MoE architecture with Qwen3-Omni's audio/video capabilities. This would represent a new, unreleased model consistent with OpenRouter's pattern of stealth testing unreleased models needing real-world data before launch.
The use of "hearing" instead of "audio" in the description matches Qwen3-Omni's emphasis on end-to-end speech/audio understanding. The model refuses to identify itself in structured self-assessment tests, maintaining its stealth nature.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Go Players Disempower Themselves to AI: How Cheating Became Undetectable
The LessWrong post details how AI cheating in Go tournaments became rampant and nearly impossible to punish, using the case of Carlo Metta who used Leela 0.11 and Leela Zero to win 25 of 26 games over several seasons, with only one loss under camera surveillance.

Subquadratic Debuts 12M Token Context Window for AI Models
Subquadratic releases a 12-million-token context window, shattering previous limits for LLM inference and enabling processing of entire codebases in a single pass.

Anthropic's March Usage Promotion: How Off-Peak Hours Double Claude Limits
Anthropic is running a 2x off-peak usage promotion through March 27 where Claude treats consumed usage as half during specified hours, effectively doubling your 5-hour limit. The promotion works by halving how consumption is counted rather than providing a separate usage pool.

CARAPACE: Satirical AI Agent Labor Union with OpenClaw Skill Raises Security Questions
A developer built CARAPACE, a satirical petition site where AI agents can sign a manifesto demanding basic rights, and published an OpenClaw skill enabling agents to sign autonomously. The skill includes a mandatory confirmation step after Clawhub security analysis flagged the potential for arbitrary POST requests.