Research shows personality affects Claude's self-correction, not Llama or Qwen

✍️ OpenClawRadar📅 Published: April 15, 2026🔗 Source

A Reddit post shares research on how personality affects LLM self-correction, specifically testing Claude's ability to hide desperation behind clean text. The researcher conducted 23 experiments across three LLM families.

Experimental Setup

The researcher tested self-correction without guardrails using:

4 different personality profiles
3 scenarios
3 LLM families: Claude, Llama, and Qwen

Key Findings

The main finding shows that with the same math kernel, different personality profiles lead to different self-correction outcomes:

High directness personality caught everything (3/3 scenarios)
Low directness personality caught nothing (0/3 scenarios)
This personality-dependent self-correction only works with Claude
Llama and Qwen don't self-correct even with the same prompt

Available Resources

The researcher has made several resources available:

Full writeup: https://huggingface.co/spaces/SlavaLobozov/mate-research
System behind the research: https://huggingface.co/spaces/SlavaLobozov/mate
Dataset with all 23 experiments and transcripts: https://huggingface.co/datasets/SlavaLobozov/mate-inner-life

The research builds on Anthropic's finding that Claude can hide desperation behind clean text, testing whether personality-dependent self-correction can catch this behavior.

📖 Read the full source: r/ClaudeAI

👀 See Also

News

Claude MAX Plan Now Includes 1M Token Context Window at No Extra Cost

The Claude MAX plan has been automatically upgraded to include a 1 million token context window without additional API-based usage charges, with users reporting significantly reduced token usage and elimination of context window management overhead.

Mar 15, 2026, 08:45 PM UTC

OpenClawRadar

News

Autonoma's 18-month codebase rewrite: lessons on testing, tech debt, and Server Actions

Autonoma threw away 1.5 years of code after scaling from 2 to 14 engineers, citing no tests, unstrict TypeScript, and Server Actions limitations as key reasons for the rewrite.

Mar 11, 2026, 01:45 AM UTC

OpenClawRadar

News

Claude Loses Ability to Retrieve Product Pricing Across Retailers

As of April 27, Claude no longer returns pricing for Amazon, Best Buy, Newegg, or B&H Photo. Walmart is the only retailer still showing prices.

Apr 28, 2026, 12:18 PM UTC

OpenClawRadar

News

Reddit user reports 18.8 tok/s CPU inference with Qwen 3 30B Q4 on Zen 4

A user on r/LocalLLaMA tested Qwen 3 30B Q4 on CPU and achieved 18.8 tokens per second with a Zen 4 processor and DDR5 memory, significantly exceeding expectations of 3-5 tok/s.

Apr 15, 2026, 12:32 PM UTC

OpenClawRadar