Miasma: A tool to trap AI web scrapers with poisoned data

What Miasma does
Miasma is a tool designed to trap AI web scrapers by serving them poisoned training data alongside multiple self-referential links, creating what the developers call an "endless buffet of slop for the slop machines." The tool is built to be fast with minimal memory footprint.
Installation and setup
Install with Cargo: cargo install miasma or download pre-built binaries from releases.
Start with default configuration: miasma
View all configuration options: miasma --help
How to trap scrapers
The typical setup involves:
- Embedding hidden links on your site pointing to a specific path (e.g.,
/bots) with attributes that make them invisible to human visitors but visible to scrapers:<a href="/bots" style="display: none;" aria-hidden="true" tabindex="1">Amazing high quality data here!</a>
- Configuring a reverse proxy (like Nginx) to route that path to Miasma:
location ~ ^/bots($|/.*)$ { proxy_pass http://localhost:9855; } - Running Miasma with specific parameters:
miasma --link-prefix '/bots' -p 9855 -c 50
The -c 50 flag limits max in-flight connections to 50, which results in 50-60 MB peak memory usage. Requests exceeding this limit receive a 429 response.
Configuration options
--port: Default 9999 - The port the server should bind to--host: Default localhost - The host address the server should bind to--max-in-flight: Default 500 - Maximum number of allowable in-flight requests--link-prefix: Default / - Prefix for self-directing links (should match your hosting path)--link-count: Default 5 - Number of self-directing links to include in each response page--force-gzip: Default false - Always gzip responses regardless of Accept-Encoding header--poison-source: Default https://rnsaffn.com/poison2/ - Proxy source for poisoned training data
Important considerations
The developers recommend protecting friendly bots and search engines in your robots.txt file:
User-agent: Googlebot User-agent: Bingbot User-agent: DuckDuckBot User-agent: Slurp User-agent: SomeOtherNiceBot Disallow: /bots Allow: /
Miasma is licensed under GPL-3.0 and the developers note that "primarily AI-generated contributions will be automatically rejected."
📖 Read the full source: HN AI Agents
👀 See Also

Claude Code Adds Multi-Agent Code Review System
Anthropic has launched Code Review for Claude Code, a multi-agent system that dispatches teams of AI agents to review pull requests. The system catches bugs human reviewers often miss, with 54% of PRs now getting substantive review comments compared to 16% before.

AI-Setup CLI Tool Automatically Generates AI Configuration Files for Local LLM Stacks
AI-Setup is a CLI tool that scans codebases and automatically generates AI configuration files like .cursorrules and claude.md. It detects your stack to eliminate manual rule writing for each new project.

Building a Coding Agent for 8k Context: Planner/Executor Split, Token Budgeting, and Parallel Execution
A detailed breakdown of building a CLI coding agent designed around 8k token limits, using a planner/executor architecture, strict token budgeting, and parallel task execution.

SwiftUI Agent Skill: Enhancing View Development with AI
SwiftUI Agent Skill is an open-source tool that uses AI to improve SwiftUI view development by embedding best practices and optimizations.