Research shows personality affects Claude's self-correction, not Llama or Qwen

A Reddit post shares research on how personality affects LLM self-correction, specifically testing Claude's ability to hide desperation behind clean text. The researcher conducted 23 experiments across three LLM families.
Experimental Setup
The researcher tested self-correction without guardrails using:
- 4 different personality profiles
- 3 scenarios
- 3 LLM families: Claude, Llama, and Qwen
Key Findings
The main finding shows that with the same math kernel, different personality profiles lead to different self-correction outcomes:
- High directness personality caught everything (3/3 scenarios)
- Low directness personality caught nothing (0/3 scenarios)
- This personality-dependent self-correction only works with Claude
- Llama and Qwen don't self-correct even with the same prompt
Available Resources
The researcher has made several resources available:
- Full writeup: https://huggingface.co/spaces/SlavaLobozov/mate-research
- System behind the research: https://huggingface.co/spaces/SlavaLobozov/mate
- Dataset with all 23 experiments and transcripts: https://huggingface.co/datasets/SlavaLobozov/mate-inner-life
The research builds on Anthropic's finding that Claude can hide desperation behind clean text, testing whether personality-dependent self-correction can catch this behavior.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude MAX Plan Now Includes 1M Token Context Window at No Extra Cost
The Claude MAX plan has been automatically upgraded to include a 1 million token context window without additional API-based usage charges, with users reporting significantly reduced token usage and elimination of context window management overhead.

Autonoma's 18-month codebase rewrite: lessons on testing, tech debt, and Server Actions
Autonoma threw away 1.5 years of code after scaling from 2 to 14 engineers, citing no tests, unstrict TypeScript, and Server Actions limitations as key reasons for the rewrite.

Claude Loses Ability to Retrieve Product Pricing Across Retailers
As of April 27, Claude no longer returns pricing for Amazon, Best Buy, Newegg, or B&H Photo. Walmart is the only retailer still showing prices.

Reddit user reports 18.8 tok/s CPU inference with Qwen 3 30B Q4 on Zen 4
A user on r/LocalLLaMA tested Qwen 3 30B Q4 on CPU and achieved 18.8 tokens per second with a Zen 4 processor and DDR5 memory, significantly exceeding expectations of 3-5 tok/s.