civStation: Open-Source VLM Harness for Natural Language Control of Civilization VI

What civStation Does
civStation is an open-source, controllable computer-use stack and VLM harness built specifically for Civilization VI. Instead of treating the game as a low-level UI automation problem, the project focuses on strategy-level control. You can give natural language inputs like "expand to the east", "focus on economy this turn", or "aim for a science victory", and the system translates that intent into actual in-game actions.
Core Architecture and Loop
The system implements a complete loop: screen observation → strategy interpretation → action planning → execution → human override. This shifts the interface upward from direct execution to intent expression and controllable delegation. The goal wasn't just to make an agent play Civ6, but to build a loop where the model can observe the game screen, interpret high-level strategy, plan actions, execute them through mouse and keyboard, and be interrupted or guided live through human-in-the-loop (HitL) or MCP.
Current Features and Capabilities
- Live desktop observation
- Real UI interaction on the host machine
- Runtime control interface
- Human-in-the-loop control
- MCP/skill extensibility
- Natural language or voice-driven control
Research Questions and Motivation
The creator is exploring several questions: Where should the boundary be between strategy and execution? How controllable can a computer-use agent be before the loop becomes too slow or brittle? Does this approach make sense only for games, or also for broader desktop workflows?
The motivation stems from observing that most computer-use demos focus on "watch the model click," while civStation aims for something closer to a controllable runtime where you can operate at the level of strategy instead of raw UI interaction. Another motivation was testing whether voice and natural language, combined with computer-use, could open a different interaction layer where the player behaves more like a strategist giving directives rather than directly executing actions.
Repository and Availability
The project is available at: https://github.com/NomaDamas/civStation.git
📖 Read the full source: r/LocalLLaMA
👀 See Also

CC-Wiki: Turn Claude Code Sessions into a Shareable Quartz Knowledge Base
CC-Wiki converts your ~/.claude session history into a Quartz-based knowledge base. One command installs it; running /cc-wiki inside a Claude Code session packages the conversation.

Audio Engineer Builds Mix Analysis Tool with Claude Code
An audio engineer created a tool that analyzes audio mixes using the Web Audio API and Claude to provide specific feedback on issues like muddy low-mids, lack of headroom, and buried vocals. The tool offers a free tier for quick analysis and a paid pro report with detailed frequency notes and plugin suggestions.

Claude Banana: A Claude Code plugin for image generation with design system awareness
Claude Banana is a Claude Code plugin that generates images using Google's Gemini API with context-aware prompt crafting. It reads Tailwind configs, CSS variables, design tokens, and existing assets to understand project visual styles.

Ephemeral OpenClaw setups with network sandboxing and auto-teardown
A setup that boots OpenClaw inside an ephemeral VM with a network egress allowlist, injects API keys into RAM-backed storage, and includes a 2-hour auto-teardown. All LLM calls are recorded to SQLite for replay.