Automating Datadog Alert Triage with Claude Code and MCP

A developer at Quickchat created an automated system to handle morning Datadog alert triage using Claude Code and the Model Context Protocol (MCP). The system eliminates manual checking of Datadog dashboards by having AI agents analyze alerts, classify issues, and open pull requests with fixes.
Setup Components
The implementation involves three main components:
1. Datadog MCP Server Integration
Datadog provides a remote MCP server with OAuth authentication. Configuration requires one file in the repository root:
// .mcp.json
{
"mcpServers": {
"datadog": {
"type": "http",
"url": "https://mcp.datadoghq.eu/api/unstable/mcp-server/mcp"
}
}
}
Developers authenticate with a single browser click. For US1 region users, replace datadoghq.eu with datadoghq.com.
2. Claude Code Skill for Triage
A skill file at .claude/skills/triage-datadog defines the triage workflow in four phases:
- Gather: Check Datadog for monitors, error logs, and incidents from the last 24 hours
- Classify: Sort findings into three categories: Actionable (code bugs), Infrastructure (server problems), and Noise (transient blips)
- Fix: For each real bug, spin up an AI agent in an isolated git worktree to find root causes, write fixes with tests, and open PRs
- Report: Summarize findings in a table format
Agents run in parallel to avoid sequential waiting.
3. Cron Job Automation
The system runs automatically on weekdays at 8 AM with this crontab entry:
3 8 * * 1-5 claude -p --dangerously-skip-permissions '/triage-datadog'
The -p flag prints output without conversation, and --dangerously-skip-permissions allows the agent to proceed without human approval for each file read. Each agent runs in a sandboxed macbox session with scoped git worktrees, no access to production infrastructure, secrets, or deployment pipelines.
For additional security, tools can be restricted with an explicit allowlist:
claude -p --dangerously-skip-permissions --allowedTools "Bash(git:*) Bash(gh:*) Edit Read Grep Glob Agent" '/triage-datadog'
The developer reports the entire setup took about 30 minutes to implement.
📖 Read the full source: HN AI Agents
👀 See Also

Claude Code v2.1.90 adds mouse support with CLAUDE_CODE_NO_FLICKER flag
Anthropic released Claude Code v2.1.90 with a new feature that enables mouse support in the chat interface. Users can activate it by setting the CLAUDE_CODE_NO_FLICKER=1 environment variable before running claude.

Specsmaxxing: Fighting AI Psychosis with YAML Specs and ACAI
Acai.sh introduces Specsmaxxing: a method to combat AI agents losing context by writing requirements in YAML and using numbered Acceptance Criteria for AI (ACAI) that agents reference in code.

Claude Code's Official Telegram Plugin: Setup Notes and Migration from OpenClaw
A developer migrated from OpenClaw to Claude Code's official Telegram integration, documenting the setup process and creating an open-source migration skill. The integration connects via BotFather tokens and offers better token efficiency and cleaner communication.

SourceBridge: Open-source tool for codebase analysis using local LLMs
SourceBridge is an open-source tool that indexes Git repositories into symbol graphs and uses local LLMs to generate codebase summaries, architecture walkthroughs, and learning materials. It supports multiple local backends including Ollama, llama.cpp, vLLM, LM Studio, and SGLang via OpenAI-compatible APIs.