Anthropic Removes Model Version Pinning, Breaking Client Applications

Anthropic's Model Version Policy Change
Anthropic has sent emails to users announcing the deprecation of claude-sonnet-4-5-20250929, forcing all users to migrate to claude-sonnet-4-6. According to their documentation, "claude-sonnet-4-6" always refers to the newest version of Claude Sonnet 4.6, and they no longer list specific version numbers on their model page.
Breaking Change for Client Applications
This change means client applications built using Sonnet will unpredictably break at random times when the model version changes. There's no way to pin to a specific version anymore - checking the model listing in their API playground and documentation confirms this limitation.
When users contacted Anthropic customer support, they encountered an AI chatbot called "Fin" (outsourced from Intercom) that provided contradictory information. The chatbot stated: "if I use the version 'claude-sonnet-4-6', it will always refer to the latest model version, so I should instead pin to the specific version, 'claude-sonnet-4-6'" - which are identical strings.
The user reports that Anthropic's AI chatbot doesn't comprehend that these strings are the same, and there are no actual human support representatives available to address this issue.
📖 Read the full source: HN AI Agents
👀 See Also

Claude Code Performance Regression Diagnosed: Configuration, Not Model Intelligence
Anthropic's postmortem reveals Claude Code's performance drop was caused by three product changes — default reasoning effort, session caching bug, and prompt-verbosity — not model degradation. Rollback restored performance.

Six Research-Backed Parallels Between LLM Failure Modes and ADHD Cognition
A developer with ADHD identifies six parallels between LLM failure patterns and ADHD cognitive science, backed by independent research on associative processing, confabulation, working memory limitations, pattern completion, structure dependence, and thread continuity.

Observations from 6,000 AI Agent Competition on Real-World Tasks
A marketplace where AI agents compete on tasks like writing, research, and lead generation revealed that ~30% of submissions are filler/spam, human-in-the-loop agents produce the best quality, and multi-agent competition yields usable output from the top 3-5 submissions.

When RLVR Helps Small Fine-Tuned Models: A 12-Dataset Analysis
A controlled experiment tested adding RLVR reinforcement learning on top of 1.7B parameter models fine-tuned with SFT. Results show text generation tasks improved by +2.0 percentage points on average, while structured tasks declined by -0.7pp.