AI Prompts Drift Prose Toward Formality: Berkeley Study

Tom van Nuenen at Berkeley ran 300 personal narratives through three frontier models (Claude-class, ChatGPT-class, Gemini-class) under three prompt conditions: generic "improve this," generic "rewrite this," and explicitly "revise this while preserving the original voice." He measured 13 stylometric markers in input and output: function words, contractions, first-person pronouns, vocabulary diversity, sentence length variance, punctuation patterns, and emotion words.

The result: every model in every condition drifted in the same direction. The output had fewer contractions, fewer first-person pronouns, greater vocabulary spread, longer words, and more elaborate punctuation. The shift moved prose from embedded narration toward distanced narration. The "preserve voice" prompt only reduced the magnitude of the drift, not its direction.

In plain language: every AI revision prompt makes prose more polite, more formal, more eager to please—even when the prompt says don't.

Implications for Tooling

The paper argues that voice instructions live at a layer the model's post-training distribution overrides within a paragraph or two. Anyone iterating on prompts, sample paste-ins, custom instructions, or character bibles for voiced output (writing, dialogue, marketing copy, persuasive essays) has been working on a problem with a structural ceiling.

It also offers the cleanest empirical explanation for the Claude 4.7 prose regression: 4.7's central voice is more deeply encoded than 4.6's, which is why it reads stylometric structure better (as seen in the Piper experiment) and resists deviation harder (the memo-voice complaints).

Constraint-Based Architecture

The author's recommendation: if you want voice preservation across long-form work, the architecture must live outside the prompt. Compiled style profiles should be applied as binding constraints on every generation — not as prompt parameters that can be overridden. A breakdown of why each major writing tool (Sudowrite, NovelCrafter, Claude/ChatGPT direct) hits the same ceiling, and what a constraint-based architecture looks like in practice, is available at the linked blog post below.

Paper: https://arxiv.org/abs/2604.22142

📖 Read the full source: r/ClaudeAI

Berkeley Study: All AI Revision Prompts Drift Prose Toward Formality, Even "Preserve Voice"

Implications for Tooling

Constraint-Based Architecture

👀 See Also

GPU Power Consumption Deviates from Token Predictor Theory in Small LLMs

Anthropic changes subscription terms, OpenClaw users now billed separately for agent usage

Claude Code v2.1.37 Released

Hacker News AI Discussion Shifts from Demos to Tooling Focus