Local-First Movie Recap Pipeline Using Whisper + CLIP + Ollama

A developer built an automated pipeline that turns any movie into a narrated recap video. The stack is entirely local-first: Whisper for transcription, CLIP for scene matching, Ollama (or OpenAI/Gemini/Anthropic) for script generation, Edge TTS for voiceover, and FFmpeg for rendering.
How it works
- Input: Drop in any movie file via a simple web UI.
- Transcription: Whisper extracts dialogue and timestamps.
- Scene matching: CLIP identifies visual scenes that match the narrative.
- Script generation: Ollama (or any API provider) writes a concise recap script.
- Voiceover + rendering: Edge TTS generates narration, FFmpeg composites everything into a final video.
The entire process runs locally with Ollama, but you can also plug in remote LLM APIs (OpenAI, Gemini, Anthropic). Total runtime is approximately 15 minutes. No manual editing required.
Who it's for
Developers building automated video generation pipelines or anyone who wants to batch-produce movie recaps without cloud dependencies.
📖 Read the full source: r/LocalLLaMA
👀 See Also

AI Subroutines: Deterministic Browser Automation with Zero Token Cost
rtrvr.ai's AI Subroutines let you record browser tasks once as callable tools that replay inside the webpage context with auth propagated for free, eliminating LLM inference costs and non-determinism for repetitive tasks.

The Bottleneck in Parallel AI Agents: Human Approval Queue Bottleself
A developer running parallel Claude Code agents describes the 'bottleself' — the point where parallelism stops adding output and starts creating a backlog of human approvals. Their solution: a planner that decomposes goals into subtasks, spawns agents, and only pings on unresolved decisions.

Quick-Question Plugin Automates Unity Development with Claude Code
A developer has released quick-question, a macOS plugin for Unity 2021.3+ that automates compilation, testing, and cross-model code review when using Claude Code. The tool includes 20 slash commands and uses a 'Tribunal' pattern where Codex and Claude review each other's findings.

Open-Source Claude IDE Bridge Connects Dispatch, Desktop App, and Claude Code
The claude-ide-bridge is an MIT-licensed open-source tool that connects Claude Code to your IDE, providing access to LSP, debugger, terminals, git, and GitHub through 124 tools. It enables a workflow where tasks sent via Dispatch from a phone are handled by the Claude desktop app, which uses Claude Code to write code and run tests while interacting with the IDE.