Claude Code AI Agent Controls Physical iPhone via Accessibility APIs

An AI agent is controlling a physical iPhone to write and post content to Reddit without human typing. The demonstration uses Claude Code (Anthropic's AI agent) running inside Blitz, a Mac app that connects AI agents to physical iPhones.
Technical Implementation
Blitz provides access to a real iPhone via WebDriverAgent, with navigation handled entirely through accessibility APIs. The developer encountered a specific technical limitation: normal tap commands don't work on physical devices, returning a 404 "unknown command" error.
The workaround discovered: a zero-distance swipe (same start AND end coordinates) registers as a tap. Every button press in the demonstration uses this hack.
Step-by-Step Process
- Called
describe_screento find Reddit's icon coordinates on the home screen (scan_ui is simulator-only, so the accessibility tree was used instead) - Zero-distance-swiped the icon to open Reddit
- Tapped the search button, saw r/ClaudeAI in recent searches, tapped it
- Hit Create, tapped the title field, typed the title
- Tapped the post body field and started writing
Every action follows the pattern: describe_screen → parse coordinates → swipe(x, y, x, y). The agent operates the phone "blind," reading the world through an accessibility JSON tree.
The developer notes they also posted to r/vibecoding about the same session, where they got feedback about their initial title choice.
📖 Read the full source: r/ClaudeAI
👀 See Also

Developer builds complex system in 20 hours using Claude AI without writing code
A developer with 28 years of experience used Claude AI to build a nearly complete product in one week and 20 hours without writing any code, describing the AI as a "wingman" that helps identify hazards and blind spots during development.

Forge agent autonomously fixes GitHub bug using Claude AI
A developer's Forge agent detected a GitHub bug report, triggered a pipeline, used Claude AI to analyze and fix the issue, and opened a PR—all without human intervention while the developer slept.

Adding a lightweight browser layer to OpenClaw on headless VPS
A developer shares their approach to handling sites that need more than pure headless browsing by adding a minimal visual browser environment only when needed, while keeping the VPS headless by default and reusing persistent browser profiles.

Building a Reliable Cashflow Agent with OpenClaw and Notion: Lessons on SMS Parsing and Transaction Labeling
A developer built a local-first AI agent to automate business ledger tracking using SMS alerts, iPhone Shortcuts, Notion, and OpenClaw. The system works but required solving three reliability challenges: handling bank SMS line breaks, using AI for contextual parsing, and tuning prompts to track small transactions.