OpenClaw Agent Automates AI News With LLM Curation

Automated AI News Pipeline with OpenClaw

This OpenClaw agent runs as a cron job 8 times daily (every 2 hours from 6:40 AM to 8:40 PM ET) to automate an AI newsroom. The pipeline scans multiple sources, curates content with LLMs, and publishes to Telegram with full automation.

Phase 1: Multi-Source Scanning

25 RSS feeds via blogwatcher with keyword filtering and 3-tier source ranking (TechCrunch, OpenAI Blog, Reuters Tech, Simon Willison, etc.)
13 Reddit subreddits via public JSON API with score-filtering and flair-filtering
Twitter/X via bird CLI (curated account lists by tier) and twitterapi keyword search (min 50 likes, 5K followers)
GitHub trending + release monitoring for 16 key AI repos
Tavily web search with 5 targeted queries and 2-day freshness window

All sources run best-effort—if one fails, the rest continue.

Phase 2: Scoring, Deduplication, and LLM Curation

Quality scoring script assigns points based on source tier, keyword signals, and breaking news indicators
Title similarity matching at 80% to collapse duplicate stories
Deterministic URL pre-filter checks against two history files: everything scanned and everything published
Top 8 articles get full text fetched (Cloudflare Markdown preferred, HTML fallback, 1,200 character cap)
Gemini Flash receives scored list, enriched articles, and editorial profile to pick and rank top 7 stories

Phase 3: Learning Editorial Profile

Markdown file captures preferences over time (Anthropic news, M&A over $100M, AI security incidents, geopolitics, etc.)
Currently at 82% scanner approval rate (4 out of 5 stories match preferences)
Nightly cron job updates profile based on daily approval and rejection decisions

Phase 4: Publishing Pipeline

Scan delivers 7 ranked stories to Telegram News Editing Group
"Draft #3" command triggers publishing pipeline
Story goes to Perplexity for fact validation and source gathering
Writer sub-agent (Claude Sonnet) trained on writing style with humanizer to remove AI tells
Draft reviewed by Perplexity for accuracy and writing feedback
Writer does final revisions
Gemini Nano Banana 2 generates cover image matching story
Posts to test channel first, then main channel after approval
Every published story logged with timestamps, message IDs, and source URLs

Cost and Technical Details

Total cost: about $5/month
Gemini Flash handles LLM editorial filtering (switched from Gemini CLI after OAuth issues)
Tavily free tier covers web search
Reddit JSON and GitHub API are free
Default model in Telegram group is GPT-5.3-codex (improved after setting thinking = high)

📖 Read the full source: r/openclaw