Coasty AI Agent Solves CAPTCHA Challenges Up to Level 6 Without Training

Coasty's Computer Using Agent Handles Real Desktop Challenges
Coasty's Computer Using Agent (CUA) has demonstrated the ability to solve CAPTCHA challenges up to Level 6 without being specifically trained for "I'm not a robot" tests. The agent achieved 82% on the OSWorld benchmark, which represents state-of-the-art performance for computer-use agents operating in real desktop environments.
The agent handles various web interface challenges that typically break other agents, including:
- CAPTCHA challenges up to Level 6
- Browser popups
- Cookie banners
According to the source, the developers did not teach the CUA to solve "I'm not a robot" challenges specifically, noting that "the irony is not lost on us." The agent's performance suggests it has developed generalized computer interaction capabilities rather than specialized solutions for individual challenge types.
A replay link is available for those interested in seeing the agent in action: https://coasty.ai/share/1cd404ae-3fcb-4d7f-b9d4-dac7aa26fc6d
📖 Read the full source: HN AI Agents
👀 See Also

Claude Opus 4.6 Memory Fails: Agent Forgets Everything Except File Rename
A developer documents Claude Opus 4.6's 228 log entries, 95 agent actions, and 38 code executions producing only 1 memory: the string 'Agent Zero Tune-Up'.

SwitchBot's AI Hub Set to Integrate OpenClaw for Enhanced Smart Home Automation
SwitchBot's AI Hub is about to get a significant upgrade with the integration of OpenClaw. This move promises enhanced automation and smarter home management capabilities.

Anthropic Removes Gmail Message Body Access from Claude Connector
Anthropic has removed the gmail_read_message and gmail_search_messages tools from the Gmail connector, replacing them with get_thread and search_threads that no longer return message bodies or attachment content.

Claude Code System Prompt Assembly and Structure Revealed
A source map leak in Claude Code's npm package exposed the system prompt assembly flow, showing static prefix sections followed by dynamic session-specific content, with three identity variants and detailed execution guidelines.