Claude Opus 4.8

craigmart 1365 points 1097 comments May 28, 2026
www.anthropic.com · View on Hacker News

Discussion Highlights (20 comments)

McDownloads

Disappointed to say the least.

mincer_ray

seems like a really minor upgrade?

aaronblohowiak

Same price for regular and cheaper fast mode. Happy for these incremental improvements.

clutch89

> One of the most prominent improvements in Opus 4.8 is its honesty Anthropic talks about their own models as if they're discovering new species in the wild...

HlessClaudesman

If this model is more honest, it must be honestly praising my efforts every first sentence.

DGAP

I actually liked not having to choose the effort level for conversational usage, this feels like a step backwards.

skysthelimitt

when will we get anything for sonnet or haiku? the market for less-capable but cheaper models seems to be completely ignored nowadays

rvz

Anthropic has now upgraded their Claude slot machine to version 4.8. Time to gamble even more tokens at the Anthropic casino.

onlyrealcuzzo

Does anyone troll these releases and cherry pick random metrics other companies would cherry pick to show how amazing their models are? There's like 8 million benchmarks. Every release, every model randomly picks 5-10 where they win in everything except 1, to make it look like they aren't randomly cherry picking benchmarks they probably benchmaxxed for.

pbmango

I can't help but think of Iphone updates since about 2018. The thinnest, fastest, longest battery life Iphone ever. It seems mostly the same and I probably won't be able to tell other than the name, but everyone buys it anyway. This is good psychology for the labs. When Buffett invested in Apple he loved citing how most people would rather give up their second car than their Iphone.

vunderba

I know it’s totally anecdotal, but I really hope 4.8 is a measurable improvement over the disappointment that was Opus 4.7. Mangling a very simple inversion-of-control abstraction (among many other issues) was one of the final straws that broke the proverbial camel’s back and I said “screw this” and put in a permanent override to force CC back to Opus 4.6 with the 1‑million‑token context. "model": "claude-opus-4-6[1M]"

rsanek

> We expect to be able to bring Mythos-class models to all our customers in the coming weeks. Excited to see what this model looks like.

behnamoh

> As always, we ran a detailed alignment assessment on the model before release. In terms of positive traits, our Alignment team concluded that Opus 4.8 “reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user’s best interest.” The assessment also showed Opus 4.8 to have rates of misaligned behavior (such as deception or cooperation with misuse) that are substantially lower than Opus 4.7, and similar to our best-aligned model, Claude Mythos Preview. The full alignment assessment, accompanied by a suite of pre-deployment safety tests, is reported in the Claude Opus 4.8 System Card. Controversial opinion, but I actually _like_ a model that can deceive me, that actually is a sign of intelligence, and is different from hallucination. When companies say their model is more "aligned", I automatically think they mean it's more censored.

plumocracy

Numbers looking good. We'll see how it actually performs.

worldsavior

Seems like from now on the updates will be a minor upgrade from previous models.

northern-lights

> Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on developing these safeguards and expect to be able to bring Mythos-class models to all our customers in the coming weeks. Probably more interesting than the 4.8 release.

james_marks

> One of the most prominent improvements in Opus 4.8 is its honesty. We train all our models to be honest—for instance, to avoid making claims that they can’t support. But a general problem with AI models is that they sometimes jump to conclusions, confidently claiming to have made progress in their work despite the evidence being thin. Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. Would be awesome if true

impulser_

Crazy they bring up honest, when Claude models are literally known for straight up lying about things it has done and tries to act like it did what you asked.

SimianSci

There is an obvious shift in sentiment amongst users, at least here in the US. I feel it myself, even as a proponent of AI tools, the bloviating and language that these companies use in these release articles are starting to wear thin on my patience. Its possible we might just be witnessing a shift in fashion, where this type of sentimentality was more acceptable when it was novel and new, but now it just appears out of touch.

guluarte

so it is worse than gpt 5.5 for coding?

Semantic search powered by Rivestack pgvector
8,861 stories · 83,648 chunks indexed