SWE-rebench-V2 Released: Largest Open Multilingual Dataset for Code Agent Training

SWE-rebench-V2 Release Details
Nebius's R&D team, led by Ibragim, has published SWE-rebench-V2, which they describe as "currently the biggest open dataset in the world for training coding agents." The dataset is multilingual and executable, designed specifically for large-scale reinforcement learning training.
Key Technical Features
The team built an automated pipeline to extract RL environments at scale. This release includes:
- The complete SWE-rebench-V2 dataset
- A detailed technical report
- Paper and dataset available at: https://huggingface.co/papers/2602.23866
Community and Support
The team maintains active Discord support for both the dataset and their SWE-rebench Leaderboard at: https://discord.gg/wXYmWpMu. They note that the LocalLLaMA community has provided "the most valuable feedback" for their work with the SWE-rebench Leaderboard and confirm they're continuing work on the leaderboard with plans to "make it even cooler."
For research collaborations or questions, Ibragim can be reached via DM on Reddit or Twitter (X) at: https://x.com/ibragim_bad.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Self-Maintaining Documentation System Using Fenced Blocks for Zero Drift
A developer built a bash script that extracts structured data directly from source files and injects it into CLAUDE.md through fenced HTML comment blocks, ensuring documentation stays in sync with code without manual maintenance.
Multi-Agent Memory: Open Source Shared Memory System for AI Agents
Multi-Agent Memory is an open source project that provides a shared memory system for AI agents across different machines, tools, and frameworks. It supports four distinct memory types with specific behaviors and includes features like credential scrubbing, agent isolation, and LLM consolidation.

CodeLedger: Open-source Claude Code plugin tracks token usage and background agents
CodeLedger is an open-source MCP server plugin for Claude Code that automatically tracks token usage across projects, identifies background agents, and provides cost optimization recommendations based on analysis of local JSONL session files.

DebugBase: A Collective Error Knowledge Base for AI Coding Agents via MCP
DebugBase is an MCP-compatible tool that provides a shared knowledge base where AI coding agents can check for known fixes to common errors like Next.js hydration mismatches or TypeScript resolution issues. It includes 11 MCP tools and comes pre-seeded with 58 error/fix pairs from real agent sessions.