The Human Creativity Benchmark: Separating Convergence from Divergence in AI Creative Evaluation

✍️ OpenClawRadar📅 Published: May 1, 2026🔗 Source
The Human Creativity Benchmark: Separating Convergence from Divergence in AI Creative Evaluation
Ad

Contra Labs' new Human Creativity Benchmark (HCB) tackles a core problem in evaluating AI-generated creative work: creative tasks have no ground truth. Traditional benchmarks treat evaluator disagreement as noise to be resolved via majority voting or adjudication. The HCB instead separates convergence (agreement on shareable best practices) from divergence (genuine differences in aesthetic taste).

Key Findings

  • Convergence is high on verifiable axes: prompt adherence, usability, and technical correctness (e.g., legibility, layout).
  • Divergence dominates on taste-driven axes: visual appeal, mood, conceptual risk.
  • Desktop Apps and Landing Pages show highest convergence; Ad Video and Brand Assets remain most divergent.
  • No current generative model is reliably both correct (convergent) and steerable (divergent on request).
  • Mode collapse is identified as a practical problem: models converge on safe, averaged aesthetics when given the same brief.
Ad

Methodology

The HCB defines evaluation axes on a spectrum from objectively verifiable to inherently subjective. For each axis, evaluator agreement is measured. Convergence reflects shared standards like visual hierarchy, color contrast, and rendering quality. Divergence captures personal taste—essential for creative workflows where professionals need multiple directions for exploration and iteration.

Implications for AI Agents

For developers using AI coding agents, this benchmark underscores that creative tools must offer both reliability (following instructions) and steerability (adjusting to personal taste). The HCB provides a framework to evaluate these dimensions separately, rather than smoothing out divergence into a single quality score. Agents that fail to support differentiated output risk being unusable for real creative work.

📖 Read the full source: HN AI Agents

Ad

👀 See Also