Developer builds local AI research agent that creates podcasts from topics or YouTube links

A developer on r/LocalLLaMA built an autonomous research and podcast agent that runs entirely locally. What started as an attempt to avoid paying for TTS (text-to-speech) services evolved into a full system that can research topics and present information in human-like formats.
What the agent does
The system takes either a topic or a YouTube link as input and produces three outputs:
- A proper deep-dive report
- A conversational podcast-style script
- Generated audio for the podcast
How it works differently from fixed pipelines
The developer focused on making the agent behave less like a fixed pipeline and more like something that decides what to do next dynamically. Instead of step-by-step execution, it:
- Searches and pulls content
- Extracts insights (including from videos)
- Refines summaries in multiple passes
- Converts that into a natural back-and-forth conversation
Key challenges and solutions discovered during development
- Speed issues: Initial performance was rough, but parallelizing tasks made a significant difference
- Shallow summaries: Early summaries felt shallow, but implementing multi-step refinement helped substantially
- Robotic audio: The audio initially sounded robotic, but switching to a 2-speaker format made it much more natural
The developer noted that this project demonstrates how close we're getting to doing powerful AI work entirely on local machines, without relying on cloud services.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Portable engineering system for Claude Code with hooks, specialized agents, and self-improvement
A developer built a portable engineering system that lives in ~/.claude/ and applies to every project automatically, featuring a 650-line constitution, deterministic hooks that block dangerous commands, three specialized agents, and a self-improving compound engineering approach.

AiPayGen: An AI Agent Marketplace with MCP Integration for Claude Code
AiPayGen is a marketplace with 142 AI agents across 27 categories including finance, legal, healthcare, DevOps, security, and marketing. It functions as an MCP server for direct use in Claude Code, allowing users to browse, invoke, and manage agents through specific commands.

Lightfeed Extractor: TypeScript Library for Robust Web Data Extraction with LLMs
Lightfeed Extractor is a TypeScript library that handles the full pipeline from raw HTML to validated structured data using LLMs, with features like HTML-to-markdown conversion, Zod schema validation, JSON recovery, and built-in Playwright browser automation.

Free AI Product Launch Playbook Repository for Claude Users
A developer has released a free repository containing a structured AI product launch playbook designed to work with Claude. The repo organizes launch experience into practical stages including strategy, preparation, execution, and includes templates and tool references.