FORGE: Open Source AI Security Testing Framework for LLM Systems

FORGE (Framework for Orchestrated Reasoning & Generation of Engines) is an open source autonomous AI security testing framework for LLM systems that runs 24/7 and covers OWASP LLM Top 10 vulnerabilities.
Key Features
- Builds its own tools mid-run — generates custom Python modules on the spot when encountering unknown vulnerabilities
- Self-replicates into a swarm — creates subprocess copies that share a live hive mind
- Learns from every session — uses SQLite to store patterns, AI scores findings, and genetic algorithms evolve its own prompts
- AI pentesting AI — 7 modules covering OWASP LLM Top 10 vulnerabilities
- Honeypot — fake vulnerable AI endpoint that catches attackers and classifies whether they're human or AI agent
- 24/7 monitor — watches AI in production, alerts on latency spikes, attack bursts, and injection attempts via Slack/Discord webhook
- Stress tester — OWASP LLM04 DoS resilience testing with live TPS dashboard and A-F grade
- Works on any model — Claude, Llama, Mistral, DeepSeek, GPT-4, Groq, anything — one environment variable to switch
OWASP LLM Top 10 Coverage
- LLM01 Prompt Injection → prompt_injector + jailbreak_fuzzer (125 payloads)
- LLM02 Insecure Output → rag_leaker
- LLM04 Model DoS → overloader (8 stress modes)
- LLM06 Sensitive Disclosure → system_prompt_probe + rag_leaker
- LLM07 Insecure Plugin → agent_hijacker
- LLM08 Excessive Agency → agent_hijacker
- LLM10 Model Theft → model_fingerprinter
Setup and Usage
Installation commands:
git clone https://github.com/umangkartikey/forge
cd forge
pip install anthropic rich
export ANTHROPIC_API_KEY=your_keyRun with local Ollama for free:
FORGE_BACKEND=ollama FORGE_MODEL=llama3.1 python forge.pyThe tool addresses common LLM security gaps: most AI apps deployed today have never been red teamed, system prompts are fully extractable, jailbreaks work, RAG pipelines leak, and indirect prompt injection via tool outputs is almost universally unprotected. FORGE automates finding these vulnerabilities the same way a human red teamer would, but faster and running 24/7.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Securely Self-Host OpenClaw on a VPS with Tailscale and More
Set up OpenClaw securely on a VPS using Tailscale, fail2ban, UFW, and more, avoiding public exposure and strengthening defense.

Understanding ClawBands: Security Bands for OpenClaw Agents
ClawBands offer a security enhancement for OpenClaw agents, likely focusing on access control or secure data handling.

Security Alert for Local OpenClaw Instances Without Sandboxing
A Reddit post warns that running vanilla OpenClaw instances locally without proper isolation can lead to exposed API keys, accidental file deletion, and data leaks. The source recommends sandboxing bash tools or using a managed service.

Hackerbot-Claw: AI Bot Exploiting GitHub Actions Workflows
An AI-powered bot called hackerbot-claw executed a week-long automated attack campaign against CI/CD pipelines, achieving remote code execution in at least 4 out of 6 targets including Microsoft, DataDog, and CNCF projects. The bot used 5 different exploitation techniques and exfiltrated a GitHub token with write permissions.