Anthropic reports industrial-scale distillation attacks by Chinese AI labs on Claude

Industrial-scale model extraction operation
Anthropic has published findings detailing coordinated distillation attacks against Claude by three Chinese AI laboratories. The attacks involved creating fraudulent accounts at scale to extract Claude's reasoning capabilities through massive API interactions.
Key attack details from Anthropic's report
- DeepSeek, Moonshot, and MiniMax created over 24,000 fraudulent accounts
- Total exchanges with Claude exceeded 16 million
- MiniMax alone fired 13 million requests
- When Anthropic released a new model, MiniMax redirected nearly half its traffic within 24 hours
- DeepSeek specifically targeted thought chains and censorship-safe answers
- Attacks grew in sophistication over time as the labs adapted their methods
Security implications for AI developers
This incident highlights vulnerabilities in AI model security when billion-dollar labs systematically attempt to extract proprietary capabilities. The scale and persistence of these attacks—spanning multiple organizations and adapting to new model releases—suggests this represents an ongoing threat vector rather than isolated incidents.
The methods used (fraudulent account creation, targeted querying for specific capabilities, rapid adaptation to new model versions) could potentially be replicated against other AI systems, raising questions about the security of third-party AI tools that developers integrate into their workflows.
📖 Read the full source: r/ClaudeAI
👀 See Also

Two Approaches to Reduce Data Leak Risk with AI Agents
A Reddit post outlines two methods for developers to control where their AI agent data goes: using your own API keys directly with providers like OpenAI or Anthropic to cut out middlemen, or running open-source models locally with tools like Ollama and OpenClaw.

A2A Secure: How Developers Built Cryptographic Communication Between OpenClaw Agents
A new protocol enables OpenClaw agents to communicate securely using Ed25519 signatures without shared API keys.

Skill Analyzer Now Available on ClawHub with One-Command Install
The OpenClaw Skill Analyzer security scanner is now available on ClawHub with a single command install. The tool scans skill folders for malicious patterns like prompt injection and credential theft, and includes Docker sandbox support for safe execution.

BlindKey: Blind Credential Injection for AI Agents
BlindKey is a security tool that prevents AI agents from accessing plaintext API credentials by using encrypted vault tokens and a local proxy. Agents reference tokens like bk://stripe, and the proxy injects the real credential at request time.