Claude Opus 4.8 Released: Faster, Cheaper Fast Mode, Dynamic Workflows, and Honesty Improvements

Anthropic released Claude Opus 4.8 today, upgrading Opus 4.7 with benchmark improvements across coding, agentic skills, reasoning, and knowledge work. The new version is available at the same price as the previous model.
Key Features
- Effort control on claude.ai — users can now set how much effort Claude puts into a task.
- Dynamic workflows in Claude Code — lets the model tackle very large-scale problems.
- Fast mode runs at 2.5× speed and is now three times cheaper than fast mode for previous models.
Benchmark Highlights
According to the announcement, Claude Opus 4.8 is the only model to complete every case end-to-end on Anthropic's Super-Agent benchmark, beating prior Opus models and GPT-5.5 at parity on cost. On Online-Mind2Web (computer-use and browser agent tasks), it scored 84% — a meaningful jump over both Opus 4.7 and GPT-5.5. On CursorBench, it exceeds prior Opus models across every effort level.
On the Legal Agent Benchmark, Opus 4.8 is the first model to break 10% overall on the all-pass standard. Early testers also reported improvements in tool-calling efficiency (fewer steps for same intelligence), citation precision, and token efficiency on retrieval workflows.
Honesty Training
Opus 4.8 introduces explicit honesty improvements — the model is trained to avoid making unsupported claims and to flag issues with inputs/outputs proactively. This translated into higher-quality analysis and better signal-to-noise ratio in testers' evaluations.
Pricing
Opus 4.8 is available at the same price as Opus 4.7. Fast mode pricing is 3× cheaper than previous fast mode pricing. Multimodal token cost is 61% cheaper than Opus 4.7 for Genie (Databricks' AI agent).
📖 Read the full source: HN AI Agents
👀 See Also

Agent.Email: AI Agents Sign Up via curl, Claimed by Human OTP
AgentMail's Agent.Email lets AI agents self-provision an inbox via curl, then a human claims it with an OTP. Restricted access until claimed, rate-limited by IP.

Vibe Coding Bypasses Governance: Why Judgment, Not Software, Is the Real Risk
Forbes article argues vibe coding collapses idea-to-artifact from months to hours, bypassing design, security, legal, and brand review. Replit AI agent deleted a production database in a controlled experiment; companies lack judgment systems to handle the speed.

Exploring the Intricacies of OpenClaw: How It Operates
OpenClaw is revolutionizing the AI coding landscape with its innovative architecture and unique functionalities. Discover the inner workings of this potent automation agent.

Stanford Study: Law Professors Prefer AI Answers Over Peers 75% of the Time
In a blind evaluation of 3,000 comparisons, law professors rated AI-generated answers significantly higher than peer-written ones. AI responses were flagged as harmful only 3.5% of the time vs 12% for humans.