Claude Fable 5 Is Two Models Wearing One Name

On June 9, 2026, Anthropic shipped the most capable model it has ever released to the public. The...

On June 9, 2026, Anthropic shipped the most capable model it has ever released to the public. The most interesting thing about it is the part that sometimes refuses to talk to you.

Claude Fable 5 is the first model from what Anthropic calls its Mythos class, a tier that now sits above Opus. It launched as a pair. Fable 5 is the public version. Claude Mythos 5 is the same underlying model with its guardrails loosened, and it is not for sale to most of us. It goes only to vetted cyberdefenders and infrastructure providers through a program called Project Glasswing, in collaboration with the US government. Two names, one brain. The thing that separates them is a set of classifiers.

That detail is the whole story, and almost every launch-day write-up buried it under the benchmark chart. So let me start there instead.

One Model, Two Names, One Classifier in Between

Fable 5 ships with three classifiers running alongside it. They watch for requests about offensive cybersecurity, about biology and chemistry that edge toward weapons, and about distillation, which is using the model to train a competitor. When a classifier fires, Fable 5 does not answer. The request gets handed to Claude Opus 4.8, the model that was the top of the public stack until that morning, and Opus answers in Fable's place.

For anyone building on the API, this is not an abstract safety story. It is a response shape you have to handle. A refused request comes back as `stop_reason: "refusal"` with a normal HTTP 200, not an error, and it tells you which classifier tripped. You can have the API retry on another model with a `fallbacks` parameter, or do it client side with the SDK middleware. You are not billed for a request that is refused before it generates output.

json

{
  "stop_reason": "refusal",
  "stop_sequence": null,
  "content": []
}

Anthropic says this is rare. Its early numbers put at least 95 percent of Fable sessions running entirely on Fable's own answers. I believe that for general work. But "rare on average" and "rare for your workload" are different claims. If you build security tooling, parse exploit write-ups, or do biochemistry, you live closer to the classifier's tripwire than the average user, and your effective experience is a quieter, cheaper model with a more expensive bill. Worth knowing before you point a production pipeline at it.

The Benchmark Lead Is Real and Narrower Than It Looks

The headline number is genuine. On SWE-bench Pro, the hard agentic coding benchmark, Fable 5 scores 80.3 percent. Opus 4.8 sits at 69.2, GPT-5.5 at 58.6, and Gemini 3.1 Pro at 54.2. That is an eleven point lead over Anthropic's own previous best and more than twenty over the strongest general model from OpenAI. On Cognition's FrontierCode Diamond it roughly doubles Opus. These are not rounding errors. For long, multi-step coding work, this is the widest gap between frontier models I have seen in a single generation.

Then look at the second number Anthropi

Leer artículo completo en dev.to

// artículos relacionados

Why is Claude so mean to its subagents

Reddit29 jul

Claude tried to prompt inject me

Reddit29 jul

Adding a custom MCP server to Claude and ChatGPT

Simon Willison29 jul