Contract Testing with OpenClaw: AI Development Guide

Contract testing offers an alternative to integration and end-to-end testing when working with AI coding agents like OpenClaw. Instead of testing the entire system, you test the interfaces and invariants between components, then have the agent generate code that satisfies those contracts.

The Core Idea

When using an agent to write code, the workflow shifts from traditional testing to contract-driven development:

Write contract/spec first instead of implementation
Contracts validate interface + invariants instead of integration tests checking behavior
Minimal E2E smoke tests instead of comprehensive E2E tests
Agent writes implementations instead of humans writing most code

The AI's job becomes: "Make the code satisfy the contract."

What a Contract Looks Like

A contract defines input schema, output schema, invariants, and error conditions. Example in TypeScript with Zod:

export const CreateUserRequest = z.object({
  email: z.string().email(),
  password: z.string().min(8)
})

export const CreateUserResponse = z.object({
  id: z.string().uuid(),
  email: z.string().email(),
  createdAt: z.string()
})

Contract test example:

test("createUser contract", async () => {
  const req = CreateUserRequest.parse({
    email: "[email protected]",
    password: "password123"
  })
  
  const res = await createUser(req)
  expect(CreateUserResponse.parse(res)).toBeDefined()
})

The AI can regenerate the entire service as long as this passes.

Contract Testing Pattern for AI Agents

A common project structure:

contracts/ user.contract.ts order.contract.ts tests/ contract/ user.test.ts

src/ services/ userService.ts

Workflow: define contracts, agent generates implementation, contract tests run, agent fixes failures. This creates a tight feedback loop that AI agents rely on to self-correct.

Example Agent Prompt

Inside an OpenClaw agent workflow:

Implement the service so that all tests in tests/contract pass.
Do not modify contract definitions. Only modify implementation files.

The agent iterates until npm test PASS contract tests.

Consumer-Driven Contracts

Consumer-driven contracts work particularly well for AI development. Example: frontend defines POST /users expecting { id: uuid, email: string }, and the backend agent must satisfy that contract. Tools typically used include Pact, schema validation, and OpenAPI contracts.

Minimal Testing Stack for AI Coding

To replace most integration tests:

contracts/ openapi.yaml tests/ contract/ invariants/

src/ implementation

Test distribution: contract tests (80%), invariant/property tests (15%), minimal E2E smoke tests (5%). Example smoke tests: user signup works, user login works.

Extra Trick: Property Tests

Agents improve dramatically with property tests. Example:

fc.assert(
  fc.property(fc.string(), async (email) => {
    const user = await createUser({email})
    expect(user.email).toEqual(email)
  })
)

This gives the agent a search space to learn from.

Why This Works Better for AI

Agents struggle with multi-service coordination, flaky E2E tests, and complex environment setup. They excel when given deterministic feedback, small isolated tasks, and schemas + constraints. Contract tests become the "ground truth" in an AI-friendly architecture: contracts (truth) → tests (verification) → agent generates → implementation.

📖 Read the full source: r/clawdbot