OpenEvol: Offline Self-Improvement Pipeline for LLMs Using Conversation History

✍️ OpenClawRadar📅 Published: March 31, 2026🔗 Source

What OpenEvol Does

OpenEvol is an offline self-improvement pipeline for large language models that automatically converts AI conversation history into training data. The tool mines high-value exchanges from conversations, judges their quality, and generates fine-tuning datasets without manual labeling or proprietary data flywheels.

How It Works

The pipeline runs through four automated stages:

Mine high-value exchanges from conversations
Judge quality using rules with an optional teacher LLM
Synthesize SFT, preference, and pretraining datasets
Fine-tune with one command

This creates a closed loop where the model learns from its own experience.

Technical Details

No GPU is required to get started - the full pipeline runs on CPU with a mock or OpenAI-compatible teacher backend. You can bring a GPU when ready to train.

Five teacher backends are supported:

Mock
Rule-based
OpenAI-compatible API (any local proxy works)
HuggingFace Transformers
vLLM

Usage Options

Three ways to use OpenEvol:

CLI for offline batch runs
REST API server for automation
OpenClaw desktop plugin that lets you trigger pipeline runs directly from chat

Quality Control

Every batch is automatically scored. If the approval rate drops below 80%, training is blocked and flagged for human review, giving users control over what data gets used for training.

This type of tool is useful for developers who want to improve their AI coding agents using actual conversation history without sending data to external services.

📖 Read the full source: r/openclaw

👀 See Also

Tools

Relay: Open-Source Control Plane for OpenClaw AI Agents

Relay is an Electron desktop app that provides Claude Cowork-like workflow for OpenClaw, running on your infrastructure with your choice of LLM models and built-in governance features including approval gates and exportable audit trails.

Mar 26, 2026, 01:45 PM UTC

OpenClawRadar

Tools

Open-sourced self-healing skill for AI agents detects and fixes failures automatically

A new open-source skill enables AI agents to automatically detect failures, diagnose root causes, and implement fixes. It includes a failure scanner for crons, sub-agents, and deploy logs, plus a database that learns from previous fixes.

Apr 16, 2026, 08:51 AM UTC

OpenClawRadar

Tools

Codeset improves coding agents with repo-specific context from git history

Codeset generates static files from git history that provide context like past bugs, root causes, and co-change relationships. Testing showed 5.3pp improvement on codeset-gym-python and 2pp on SWE-Bench Pro with OpenAI Codex.

Apr 17, 2026, 05:38 PM UTC

OpenClawRadar

Tools

OpenClawDreams: A Dream Simulator Extension for OpenClaw Agents

OpenClawDreams is an extension that adds a background reflection process and nightly dream cycle to OpenClaw agents. It captures encrypted conversation summaries to a local SQLite database, processes them during background cycles, and generates consolidated insights that get pushed into the agent's persistent memory.

Mar 8, 2026, 07:45 PM UTC

OpenClawRadar