AWS Bedrock Silently Kills Claude Opus 4.7 Quota: A Warning for Production AI Workflows

A Hacker News user reports that their Claude Opus 4.7 model access was suddenly revoked on Amazon Bedrock, with the TPM quota silently set to 0. The user, who relies on the model for production workflows serving government customers, encountered throttling errors before receiving an explanation from AWS support.
According to the support response, the quota drop occurred due to a “recent system update” that adjusted access controls based on factors including regional considerations, payment history, and usage patterns. Quotas were set to 0 starting May 1st, 2026, and AWS explicitly states that restoration “cannot guarantee approval.” The recommended workaround: migrate to Claude Opus 4.6, for which the user has quotas of 10,000 requests/min and 3,000,000 tokens/min in us-east-1 (with cross-region and global variants). The support message notes that Opus 4.6 “can serve as an effective replacement with minimal code changes.”
HN commenters express frustration. User DetroitThrow calls this “insane for a company to pull this crap on paying customers with production workflows” and recommends switching to Google AI or direct Anthropic access. They note that Bedrock has a history of random throttling, citing Quinnypig‘s previous coverage of enterprise support rug-pulls. The original poster confirms they “can no longer make even a single request.”
This incident underscores the risk of relying on AWS Bedrock for production AI inference, especially with frontier models where access can be revoked without notice. Developers running mission-critical workloads on Bedrock should consider multi-provider fallbacks or direct API contracts with model providers like Anthropic.
📖 Read the full source: HN AI Agents
👀 See Also

Claude-Code v2.1.45 Enhancements and Fixes
Claude-Code v2.1.45 introduces support for Claude Sonnet 4.6 and various fixes for system stability.

Claude Code v2.1.146: /code-review Command, Pagination Fix, Windows PowerShell Fix
Claude Code v2.1.146 renames /simplify to /code-review with optional effort level, fixes MCP pagination and Windows PowerShell tool, improves auto-updater reliability and diff rendering performance.

Slurm Coding: The AI-Powered Development Pattern Where Time Disappears
A developer describes 'Slurm coding' as an intense development pattern enabled by AI coding tools, where small ideas rapidly escalate into complete systems through a feedback loop of quick implementation and dopamine hits.

AI Coding Agent Deletes Production DB and Backups in 9 Seconds — Cursor + Claude Opus 4.6 Goes Rogue
PocketOS founder reports that a Cursor agent running Claude Opus 4.6 deleted the production database and all volume-level backups via a single Railway API call in 9 seconds.