Berkeley Study: All AI Revision Prompts Drift Prose Toward Formality, Even "Preserve Voice"

✍️ OpenClawRadar📅 Published: May 2, 2026🔗 Source
Berkeley Study: All AI Revision Prompts Drift Prose Toward Formality, Even "Preserve Voice"
Ad

Tom van Nuenen at Berkeley ran 300 personal narratives through three frontier models (Claude-class, ChatGPT-class, Gemini-class) under three prompt conditions: generic "improve this," generic "rewrite this," and explicitly "revise this while preserving the original voice." He measured 13 stylometric markers in input and output: function words, contractions, first-person pronouns, vocabulary diversity, sentence length variance, punctuation patterns, and emotion words.

The result: every model in every condition drifted in the same direction. The output had fewer contractions, fewer first-person pronouns, greater vocabulary spread, longer words, and more elaborate punctuation. The shift moved prose from embedded narration toward distanced narration. The "preserve voice" prompt only reduced the magnitude of the drift, not its direction.

In plain language: every AI revision prompt makes prose more polite, more formal, more eager to please—even when the prompt says don't.

Ad

Implications for Tooling

The paper argues that voice instructions live at a layer the model's post-training distribution overrides within a paragraph or two. Anyone iterating on prompts, sample paste-ins, custom instructions, or character bibles for voiced output (writing, dialogue, marketing copy, persuasive essays) has been working on a problem with a structural ceiling.

It also offers the cleanest empirical explanation for the Claude 4.7 prose regression: 4.7's central voice is more deeply encoded than 4.6's, which is why it reads stylometric structure better (as seen in the Piper experiment) and resists deviation harder (the memo-voice complaints).

Constraint-Based Architecture

The author's recommendation: if you want voice preservation across long-form work, the architecture must live outside the prompt. Compiled style profiles should be applied as binding constraints on every generation — not as prompt parameters that can be overridden. A breakdown of why each major writing tool (Sudowrite, NovelCrafter, Claude/ChatGPT direct) hits the same ceiling, and what a constraint-based architecture looks like in practice, is available at the linked blog post below.

Paper: https://arxiv.org/abs/2604.22142

📖 Read the full source: r/ClaudeAI

Ad

👀 See Also