Anthropic Drops Key Safety Pledge from Responsible Scaling Policy

Anthropic has removed the core commitment from its flagship Responsible Scaling Policy (RSP), according to a TIME report. The company previously pledged in 2023 to never train an AI system unless it could guarantee in advance that its safety measures were adequate.
Policy Change Details
The company is scrapping the promise to not release AI models if Anthropic can't guarantee proper risk mitigations in advance. This was the central pillar of their Responsible Scaling Policy, which company leaders had touted for years as evidence they would withstand market incentives to rush potentially dangerous technology.
Reasoning Behind the Change
Anthropic's chief science officer Jared Kaplan told TIME: "We felt that it wouldn't actually help anyone for us to stop training AI models. We didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments … if competitors are blazing ahead."
The company has positioned itself as the most safety-conscious of the top AI research labs, making this policy change significant for developers tracking AI safety practices. The decision represents a shift from their previous stance of prioritizing safety guarantees over development speed.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude Desktop v1.1.5749 Adds Computer Control and Corporate Proxy Fixes
Claude Desktop v1.1.5749 introduces computer use capability with MCP server for desktop control, adds six macOS TCC permission management methods, and fixes corporate proxy SSL certificate issues by forwarding NODE_EXTRA_CA_CERTS, SSL_CERT_FILE, and SSL_CERT_DIR environment variables.

Claude for Excel and PowerPoint Updates: Cross-Application Context and Skills Integration
Claude for Excel and PowerPoint now share conversation context across open files, with Skills available in both add-ins. The tools are accessible via Amazon Bedrock, Google Cloud's Vertex AI, and Microsoft Foundry for paid Mac and Windows users.

AI Interview Platforms Tested: CodeSignal, Humanly, Eightfold in Job Screening
The Verge tested three AI interview platforms including CodeSignal, Humanly, and Eightfold for job screening. The AI avatars conduct one-on-one video interviews, analyze responses, and claim to reduce bias, though bias-free systems remain impossible due to training data limitations.

Gen Z's AI Backlash: Usage Drives Skepticism, Not Acceptance
Polling shows Gen Z adopts AI tools but resents the AI-centric future. Many avoid AI entirely or disable features, citing job fears, environmental concerns, and social impact.