SkillOpt: Optimizing Markdown Skill Files as Trainable Parameters for AI Agents

SkillOpt is a new optimization framework that treats markdown skill files as trainable parameters, applying proper optimization machinery to the ad-hoc skill editing many agent builders already do. The paper (arxiv.org/pdf/2605.23904) formalizes a process: a frontier model proposes bounded edits (add/delete/replace) to markdown skill files, and each edit is gated against a held-out validation set. Only strict improvements are accepted; ties are rejected, and rejected edits become negative signal for subsequent rounds.
Key Findings
- Convergence: Best skills converge with 1 to 4 accepted edits out of many more proposals. An edit budget of 4 to 8 per step works best; removing the cap causes performance to collapse.
- Skill size: The median final skill is ~920 tokens.
- Model transfer: A skill optimized on Codex transferred to Claude Code with zero modification and gained +59.7 on SpreadsheetBench. GPT 4.1 Nano with an optimized skill roughly matched frontier models on procedural benchmarks.
Limitations
The validation gate requires an auto-grader with clear correct answers. This works for code and spreadsheets but breaks for anything open-ended.
Who It's For
Developers building AI coding agents who want to systematically optimize skill files rather than relying on manual iteration or ad-hoc prompt engineering.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Running Qwen3.6-35B-A3B-UD-Q5_K_XL Locally with VS Code Copilot on AMD R9700
A user shares their working llama.cpp setup for Qwen3.6-35B-A3B-UD-Q5_K_XL on a single AMD R9700 with Vulkan, achieving full website and Playwright test generation from scratch with minimal nudging.

Custom GIF Spinner for Claude Code via COLR Font Conversion
A developer created a method to replace Claude Code's default spinner with any animated GIF by converting the GIF into an OpenType COLR color font and patching the spinner to cycle through glyphs representing each frame. The tool currently supports Windows with macOS/Linux versions planned.

GitAgent: An Open Standard for Portable AI Agents in Git Repos
GitAgent is an open specification that defines AI agents through three core files in a git repository: agent.yaml for configuration, SOUL.md for personality/instructions, and SKILL.md for capabilities. The CLI allows running any agent repo directly with commands like npx @open-gitagent/gitagent run -r https://github.com/user/agent -a claude.

Foreman: Open Source Slack Bot for Remote Control of Local Claude Code
Foreman is a free, open source Slack bot that provides remote control for locally running Claude Code instances. It allows developers to send tasks to Claude from their phone while maintaining full local access to filesystem, tools, and environment.