Boost AI Agent Compliance: Negative Language & Hard Tools

A developer on r/openclaw discusses challenges with AI agent compliance and shares concrete strategies that have worked for them.

Two Initial Approaches

The source identifies two factors that affect agent compliance:

Model personality matters: Compliance varies significantly by model. Some are slow, some stubborn, and some "think they're smarter than you." This personality directly impacts rule-following behavior.
Negative language works better: Using NO, DO NOT, and NEVER in bootstrap instructions tends to stick better than positive instructions. The developer recommends "leaning into" this approach.

The Mental Model: Art Teacher vs. Science Teacher

The developer presents a framework for understanding compliance issues:

AI models = art teachers: Brilliant, creative, and valuable, but they "do their own thing." This is described as both the feature and bug of current AI systems.
Tools & code = science teachers: Structured and rule-bound. Science teachers set rules that "can't be broken — like gravity." Even if the art teacher doesn't like gravity, "she still falls."

Practical Application

The developer provides a real-world example involving a memory plugin that fixes agent-amnesia. Certain reports "must run for memory retention and to prevent memory deletion," including internal reports and user-facing ones like a recurring nightly Memory Health report.

During development, the "Art Teacher" (AI model) kept ignoring formats or data, leading to inconsistent performance — sometimes perfect, sometimes MIA. The culprit was the model "bending the bootstrap rules."

Compliance Enforcement Strategy

The developer outlines a two-level approach:

Attempt Level 1: Use stronger words in bootstrap (NO/NEVER, etc.).
Attempt Level 2: When soft rules in .md files fail, "use actual code to force compliance." This means using tools — Python, scripts, hard structure. The developer notes that "hard structure beats polite instructions every time."

The developer's current approach is to first decide if a task needs an "art teacher" (AI model) or a "science teacher" (tools and code). This decision-making process helps with compliance enforcement and reduces stress.

TL;DR Summary

Compliance depends on the strength of bootstrap language (NO/NEVER/etc.) and which model you're using. When those soft rules fail, "stop asking the art teacher and write a science teacher instead — tools and code."

📖 Read the full source: r/openclaw