OpenClaw Agent Automates AI News Pipeline with LLM Curation

✍️ OpenClawRadar📅 Published: March 3, 2026🔗 Source
OpenClaw Agent Automates AI News Pipeline with LLM Curation
Ad

Automated AI News Pipeline with OpenClaw

This OpenClaw agent runs as a cron job 8 times daily (every 2 hours from 6:40 AM to 8:40 PM ET) to automate an AI newsroom. The pipeline scans multiple sources, curates content with LLMs, and publishes to Telegram with full automation.

Phase 1: Multi-Source Scanning

  • 25 RSS feeds via blogwatcher with keyword filtering and 3-tier source ranking (TechCrunch, OpenAI Blog, Reuters Tech, Simon Willison, etc.)
  • 13 Reddit subreddits via public JSON API with score-filtering and flair-filtering
  • Twitter/X via bird CLI (curated account lists by tier) and twitterapi keyword search (min 50 likes, 5K followers)
  • GitHub trending + release monitoring for 16 key AI repos
  • Tavily web search with 5 targeted queries and 2-day freshness window

All sources run best-effort—if one fails, the rest continue.

Phase 2: Scoring, Deduplication, and LLM Curation

  • Quality scoring script assigns points based on source tier, keyword signals, and breaking news indicators
  • Title similarity matching at 80% to collapse duplicate stories
  • Deterministic URL pre-filter checks against two history files: everything scanned and everything published
  • Top 8 articles get full text fetched (Cloudflare Markdown preferred, HTML fallback, 1,200 character cap)
  • Gemini Flash receives scored list, enriched articles, and editorial profile to pick and rank top 7 stories
Ad

Phase 3: Learning Editorial Profile

  • Markdown file captures preferences over time (Anthropic news, M&A over $100M, AI security incidents, geopolitics, etc.)
  • Currently at 82% scanner approval rate (4 out of 5 stories match preferences)
  • Nightly cron job updates profile based on daily approval and rejection decisions

Phase 4: Publishing Pipeline

  • Scan delivers 7 ranked stories to Telegram News Editing Group
  • "Draft #3" command triggers publishing pipeline
  • Story goes to Perplexity for fact validation and source gathering
  • Writer sub-agent (Claude Sonnet) trained on writing style with humanizer to remove AI tells
  • Draft reviewed by Perplexity for accuracy and writing feedback
  • Writer does final revisions
  • Gemini Nano Banana 2 generates cover image matching story
  • Posts to test channel first, then main channel after approval
  • Every published story logged with timestamps, message IDs, and source URLs

Cost and Technical Details

  • Total cost: about $5/month
  • Gemini Flash handles LLM editorial filtering (switched from Gemini CLI after OAuth issues)
  • Tavily free tier covers web search
  • Reddit JSON and GitHub API are free
  • Default model in Telegram group is GPT-5.3-codex (improved after setting thinking = high)

📖 Read the full source: r/openclaw

Ad

👀 See Also