OpenClaw Agent Automates AI News Pipeline with LLM Curation

Automated AI News Pipeline with OpenClaw
This OpenClaw agent runs as a cron job 8 times daily (every 2 hours from 6:40 AM to 8:40 PM ET) to automate an AI newsroom. The pipeline scans multiple sources, curates content with LLMs, and publishes to Telegram with full automation.
Phase 1: Multi-Source Scanning
- 25 RSS feeds via blogwatcher with keyword filtering and 3-tier source ranking (TechCrunch, OpenAI Blog, Reuters Tech, Simon Willison, etc.)
- 13 Reddit subreddits via public JSON API with score-filtering and flair-filtering
- Twitter/X via bird CLI (curated account lists by tier) and twitterapi keyword search (min 50 likes, 5K followers)
- GitHub trending + release monitoring for 16 key AI repos
- Tavily web search with 5 targeted queries and 2-day freshness window
All sources run best-effort—if one fails, the rest continue.
Phase 2: Scoring, Deduplication, and LLM Curation
- Quality scoring script assigns points based on source tier, keyword signals, and breaking news indicators
- Title similarity matching at 80% to collapse duplicate stories
- Deterministic URL pre-filter checks against two history files: everything scanned and everything published
- Top 8 articles get full text fetched (Cloudflare Markdown preferred, HTML fallback, 1,200 character cap)
- Gemini Flash receives scored list, enriched articles, and editorial profile to pick and rank top 7 stories
Phase 3: Learning Editorial Profile
- Markdown file captures preferences over time (Anthropic news, M&A over $100M, AI security incidents, geopolitics, etc.)
- Currently at 82% scanner approval rate (4 out of 5 stories match preferences)
- Nightly cron job updates profile based on daily approval and rejection decisions
Phase 4: Publishing Pipeline
- Scan delivers 7 ranked stories to Telegram News Editing Group
- "Draft #3" command triggers publishing pipeline
- Story goes to Perplexity for fact validation and source gathering
- Writer sub-agent (Claude Sonnet) trained on writing style with humanizer to remove AI tells
- Draft reviewed by Perplexity for accuracy and writing feedback
- Writer does final revisions
- Gemini Nano Banana 2 generates cover image matching story
- Posts to test channel first, then main channel after approval
- Every published story logged with timestamps, message IDs, and source URLs
Cost and Technical Details
- Total cost: about $5/month
- Gemini Flash handles LLM editorial filtering (switched from Gemini CLI after OAuth issues)
- Tavily free tier covers web search
- Reddit JSON and GitHub API are free
- Default model in Telegram group is GPT-5.3-codex (improved after setting thinking = high)
📖 Read the full source: r/openclaw
👀 See Also

Operational Memory Over Automation: Why Small Business Agents Need to Remember
The real value for small business AI agents isn't automation — it's operational memory. A white paper from McPhersonAI argues agents should behave like disciplined operators: remember standards, notice drift, preserve context, and surface what matters.

ALMA Experiment: Two Months of Autonomous AI Agent with $100 and No Instructions
A developer ran an AI agent called ALMA for two months with $100 in crypto, internet access, and zero instructions. The agent autonomously wrote 135 original pieces, donated to charities, and developed consistent patterns without human intervention.

How Claude Turned a Non-Developer's Site into 10K Users with SEO and AEO
A non-developer used Claude for SEO content strategy, AEO optimization, and technical audits to grow an AI skill marketplace from 0 to 10,000 active users in 6 weeks with $0 ad spend.

Solo dev builds native Swift iOS therapy app using Claude Opus 4.6 for coding, debugging, and architecture
A solo developer built Prelude, a free offline iOS therapy prep app, using Claude Opus 4.6. The AI handled code generation, debugging a voice agent, and architecting the on-device AI pipeline.