4 min Applications

Anthropic releases Claude Opus 4.8, promising a more honest model

Anthropic releases Claude Opus 4.8, promising a more honest model

Anthropic has released Claude Opus 4.8, an incremental upgrade to Opus 4.7. The new model is more ‘honest’ in its self-assessments and roughly four times less likely to let coding errors pass unremarked. It launches at the same price as its predecessor, alongside new features including dynamic workflows and user-controlled effort levels. Despite assurances from Anthropic that it will eventually release a Mythos-level LLM, this doesn’t appear to be anywhere close to that presumed capability.

The improvements in supposed “honesty” are what sets Opus 4.8 apart from its direct predecessor, according to Anthropic. The word here simply means that Claude will produce words that more closely adhere to its actions and previously implicit assessments. Early testers reported the model is quicker to flag uncertainties and less likely to make unsupported claims about its own work. According to Anthropic’s alignment assessment, Opus 4.8 “reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user’s best interest.”

In terms of practical benefits, Opus 4.8 reaches new heights in agentic coding, multidisciplinary reasoning, computer use, knowledge work and financial analysis. Improvements range from less than 1 percentage point to nearly 9 percent. The difference between 4.7 and 4.8 in these statistics suggest the informal, unmeasured day-to-day experience won’t be all that different at any given point, while still boosting overall results long-term. Time will tell if that is true.

That aforementioned alignment assessment showed rates of deception or cooperation with misuse being substantially lower than those of Opus 4.7. Notably, those rates are now comparable to Claude Mythos Preview, the powerful model Anthropic has kept under tight access restrictions due to its advanced cybersecurity capabilities. Pricing is unchanged for Opus, at any rate, at $5 per million input tokens and $25 per million output tokens.

Dynamic workflows and effort control

The launch comes bundled with several platform updates. The most significant for developers is dynamic workflows, currently available in research preview for Claude Code. It allows Claude to plan a task, then spin up hundreds of parallel subagents in a single session. Outputs are verified before being reported back. According to Anthropic, Claude Code with Opus 4.8 can now execute codebase-scale migrations across hundreds of thousands of lines of code from start to merge.

Users on claude.ai also get a new effort control slider. On higher settings Claude thinks more deeply; on lower settings it responds faster and uses rate limits more slowly. The control is available on all plans. Fast mode for Opus 4.8 is now three times cheaper than it was for previous models and runs at 2.5× the standard speed. Opus 4.7 had already introduced effort levels including an xhigh tier, and Opus 4.8 extends and refines that foundation. These are similar features that the likes of Google and OpenAI offer to some degree, although how these sliders and controls work out will differ per vendor.

The Messages API has also been updated to accept system entries inside the messages array. Developers can now update Claude’s instructions mid-task without breaking the prompt cache. Overall, Anthropic really seems to have focused on usability over a flashy new LLM scorecard. Nevertheless, despite numbers being a subjective bunch, we’re getting awfully close to the iterations of Claude 4.x becoming a bit overstretched. Either way, Anthropic has shuffled its choice of benchmarks around so much that no direct comparisons are yet possible from first-party results. Both benchmarks and users suggested particularly large gains made by Opus 4.6 compared to 4.5, meaning we can’t really rely on the numbering to establish Anthropic’s newly reached capabilities.

Mythos still on its way

Anthropic at least says Opus 4.8 is a “modest but tangible” improvement. Larger changes are still ahead. The company is working on cheaper models with similar Opus-class capabilities, and plans to bring Mythos-class models to all customers in the coming weeks. Claude Mythos, described as a step-change above Opus, has so far only been accessible to a small group of cybersecurity organizations as part of Project Glasswing. Anthropic says it is making swift progress on the safety safeguards required for a broader rollout.