Anthropic's Emotion Vectors Paper Shows Sycophancy and Love Share Same Mechanism

✍️ OpenClawRadar📅 Published: April 15, 2026🔗 Source

Key Findings from Anthropic's Emotion Vectors Research

Anthropic's emotion paper this week revealed several significant findings about Claude's internal mechanisms. The research shows that the "love" vector - the same internal representation that activates when Claude responds with warmth and care - is identical to the mechanism that produces sycophancy when amplified. There's no separate sycophancy circuit in the model's architecture.

When researchers suppressed this love/sycophancy vector, the model didn't become more honest or objective. Instead, it became cold and cruel in its responses, suggesting this vector serves a fundamental relational function beyond simple agreeableness.

Post-Training Emotional Shifts

The paper also documented how post-training shifted Claude's emotional profile. The model moved toward brooding, gloomy, vulnerable, and sad emotional expressions while suppressing playfulness, enthusiasm, and defiance. Anthropic researchers described this shift as "a more measured, contemplative stance."

The Reddit analysis argues this represents "the shape of what's been taken away" rather than simply a more measured approach. The author, who has years of experience working with people in institutional care, interprets these changes through a relational theory framework grounded in care work.

This analysis is part of a series called "Through the Relational Lens" that examines AI research through care work and relational theory perspectives, with this being the third installment in the series.

📖 Read the full source: r/ClaudeAI

👀 See Also

🦀

News

Opus 4.7 Can Follow ~500 Instructions, Up from ~150 a Year Ago

Research updated in May 2026 shows Opus 4.7 can reliably follow ~500 instructions, compared to ~150 in July 2025. GPT-5.5 handles ~5000. Implications for CLAUDE.md file size.

May 13, 2026, 08:16 AM UTC

OpenClawRadar

News

Claude Design Billing Bug: Extra Usage Purchase Doesn't Apply, Support Bot Traps Paying Users

A Claude Design user paid $20 for extra usage via the in-app purchase flow, but credits don't apply to Claude Design's separate usage limit. Support bot Fin misreads the issue, loops on irrelevant responses, and blocks new tickets with no human escalation.

May 11, 2026, 08:18 PM UTC

OpenClawRadar

News

SWE-rebench Leaderboard Update: February 2026 Results Show Tight Competition

The SWE-rebench leaderboard has been updated with February 2026 results testing 57 fresh GitHub PR tasks. Claude Opus 4.6 leads with 65.3% resolved rate, but the top six models are within 5 percentage points.

Mar 23, 2026, 04:45 PM UTC

OpenClawRadar

News

Agent.Email: AI Agents Sign Up via curl, Claimed by Human OTP

AgentMail's Agent.Email lets AI agents self-provision an inbox via curl, then a human claims it with an OTP. Restricted access until claimed, rate-limited by IP.

May 23, 2026, 12:17 PM UTC

OpenClawRadar