OpenEvol: Offline Self-Improvement Pipeline for LLMs Using Conversation History

What OpenEvol Does
OpenEvol is an offline self-improvement pipeline for large language models that automatically converts AI conversation history into training data. The tool mines high-value exchanges from conversations, judges their quality, and generates fine-tuning datasets without manual labeling or proprietary data flywheels.
How It Works
The pipeline runs through four automated stages:
- Mine high-value exchanges from conversations
- Judge quality using rules with an optional teacher LLM
- Synthesize SFT, preference, and pretraining datasets
- Fine-tune with one command
This creates a closed loop where the model learns from its own experience.
Technical Details
No GPU is required to get started - the full pipeline runs on CPU with a mock or OpenAI-compatible teacher backend. You can bring a GPU when ready to train.
Five teacher backends are supported:
- Mock
- Rule-based
- OpenAI-compatible API (any local proxy works)
- HuggingFace Transformers
- vLLM
Usage Options
Three ways to use OpenEvol:
- CLI for offline batch runs
- REST API server for automation
- OpenClaw desktop plugin that lets you trigger pipeline runs directly from chat
Quality Control
Every batch is automatically scored. If the approval rate drops below 80%, training is blocked and flagged for human review, giving users control over what data gets used for training.
This type of tool is useful for developers who want to improve their AI coding agents using actual conversation history without sending data to external services.
📖 Read the full source: r/openclaw
👀 See Also

Relay: Open-Source Control Plane for OpenClaw AI Agents
Relay is an Electron desktop app that provides Claude Cowork-like workflow for OpenClaw, running on your infrastructure with your choice of LLM models and built-in governance features including approval gates and exportable audit trails.

Open-sourced self-healing skill for AI agents detects and fixes failures automatically
A new open-source skill enables AI agents to automatically detect failures, diagnose root causes, and implement fixes. It includes a failure scanner for crons, sub-agents, and deploy logs, plus a database that learns from previous fixes.

Codeset improves coding agents with repo-specific context from git history
Codeset generates static files from git history that provide context like past bugs, root causes, and co-change relationships. Testing showed 5.3pp improvement on codeset-gym-python and 2pp on SWE-Bench Pro with OpenAI Codex.

OpenClawDreams: A Dream Simulator Extension for OpenClaw Agents
OpenClawDreams is an extension that adds a background reflection process and nightly dream cycle to OpenClaw agents. It captures encrypted conversation summaries to a local SQLite database, processes them during background cycles, and generates consolidated insights that get pushed into the agent's persistent memory.