ToolLoop: Open-Source Agent Framework for Claude-Style Tools with Any Model

What ToolLoop Does
ToolLoop is an open-source Python framework built to provide Claude Code-style functionality without being locked into a single provider. The framework includes 11 tools covering file operations, code search, shell access, and sub-agents, and works with any LLM through LiteLLM integration.
Key Features
- Open-source framework written in Python
- Approximately 2,700 lines of code
- 11 built-in tools: file operations, code search, shell, sub-agents
- Works with any LLM through LiteLLM
- Supports any model on Bedrock (DeepSeek, Llama, Mistral) or any API (OpenAI, Moonshot, local endpoints)
- Switch models mid-conversation with shared context
- Same tools and prompts work across different models
Code Example
Here's the SDK usage example from the source:
from sdk import query, ToolLoopOptions
async for event in query(
prompt="Find all TODO comments, fix them, run the tests",
options=ToolLoopOptions(
model="bedrock/converse/deepseek.v3.2",
allowed_tools=["Read", "Edit", "Grep", "Glob", "Bash"],
),
):
print(event)
You can swap deepseek.v3.2 for any model while maintaining the same tools and prompt structure.
Who This Is For
Developers who want Claude Code-style agent functionality but need flexibility to use different models across various providers and local setups.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Open-source tool for AI-curated Reddit feeds using Cloudflare, Supabase, and Vercel
A developer open-sourced a self-hosted tool that filters Reddit for quality posts about AI-assisted development, using Cloudflare Workers for cron jobs and proxies, Supabase for storage, and Vercel for the frontend. The tool includes engagement scoring, optional LLM summaries, and costs $1-2/month for AI processing.

agentcache: Python Library for Multi-Agent LLM Prefix Caching
agentcache is a Python library that enables multi-agent LLM frameworks to share cached prompt prefixes, achieving up to 76% cache hit rates and cutting inference time by more than half in tests with GPT-4o-mini.

iknowkungfu Skill Analyzes OpenClaw Usage to Recommend Missing Skills
iknowkungfu is a new OpenClaw skill that scans your agent's workspace, memory files, and conversation logs to identify what skills you're missing based on your actual usage patterns. It provides specific recommendations with trust scores and reasons tied to your workflow.

AI Code Review Benchmark: Claude, Gemini, Codex, Qwen, and MiniMax Compared
A benchmark tested five AI models on 15 Milvus pull requests with known bugs. Claude detected 53% of bugs in raw mode, while adversarial debate between models increased detection to 80%.