Anthropic reports industrial-scale distillation attacks by Chinese AI labs on Claude

✍️ OpenClawRadar📅 Published: February 24, 2026🔗 Source

Industrial-scale model extraction operation

Anthropic has published findings detailing coordinated distillation attacks against Claude by three Chinese AI laboratories. The attacks involved creating fraudulent accounts at scale to extract Claude's reasoning capabilities through massive API interactions.

Key attack details from Anthropic's report

DeepSeek, Moonshot, and MiniMax created over 24,000 fraudulent accounts
Total exchanges with Claude exceeded 16 million
MiniMax alone fired 13 million requests
When Anthropic released a new model, MiniMax redirected nearly half its traffic within 24 hours
DeepSeek specifically targeted thought chains and censorship-safe answers
Attacks grew in sophistication over time as the labs adapted their methods

Security implications for AI developers

This incident highlights vulnerabilities in AI model security when billion-dollar labs systematically attempt to extract proprietary capabilities. The scale and persistence of these attacks—spanning multiple organizations and adapting to new model releases—suggests this represents an ongoing threat vector rather than isolated incidents.

The methods used (fraudulent account creation, targeted querying for specific capabilities, rapid adaptation to new model versions) could potentially be replicated against other AI systems, raising questions about the security of third-party AI tools that developers integrate into their workflows.

📖 Read the full source: r/ClaudeAI

👀 See Also

Security

Two Approaches to Reduce Data Leak Risk with AI Agents

A Reddit post outlines two methods for developers to control where their AI agent data goes: using your own API keys directly with providers like OpenAI or Anthropic to cut out middlemen, or running open-source models locally with tools like Ollama and OpenClaw.

Mar 14, 2026, 03:45 AM UTC

OpenClawRadar

Security

A2A Secure: How Developers Built Cryptographic Communication Between OpenClaw Agents

A new protocol enables OpenClaw agents to communicate securely using Ed25519 signatures without shared API keys.

Feb 7, 2026, 08:18 PM UTC

OpenClaw Radar

Security

Skill Analyzer Now Available on ClawHub with One-Command Install

The OpenClaw Skill Analyzer security scanner is now available on ClawHub with a single command install. The tool scans skill folders for malicious patterns like prompt injection and credential theft, and includes Docker sandbox support for safe execution.

Mar 27, 2026, 10:45 PM UTC

OpenClawRadar

Security

BlindKey: Blind Credential Injection for AI Agents

BlindKey is a security tool that prevents AI agents from accessing plaintext API credentials by using encrypted vault tokens and a local proxy. Agents reference tokens like bk://stripe, and the proxy injects the real credential at request time.

Apr 13, 2026, 05:45 PM UTC

OpenClawRadar