Share article
Share article
The leak, first reported elsewhere and now circulating widely across tech media, points to a release that was not meant to be public on March 28. According to the exposed draft, Mythos sits above Anthropic's current Claude lineup and delivers materially stronger results than Claude Opus 4.6 in coding, academic reasoning, and cybersecurity-related tasks. Anthropic also appears to have flagged the model as introducing "unprecedented" cybersecurity risks, which is not the sort of phrase companies use casually in launch copy. Or ideally, in launch copy that is still private. [2]
Enjoy articles without ads?
Register for free and get unlimited access to all articles.
What the leak appears to show
Within that tier, Claude Mythos is described as outperforming Opus 4.6 across several capability areas. Coding and reasoning are expected battlegrounds in frontier AI. Cybersecurity is the more sensitive category, because gains there can be used defensively for auditing and incident response, or offensively for exploit discovery and attack automation. "Dual-use" is the standard term, meaning the same tool can help secure systems or break them. [3]
Why this matters beyond AI gossip
The irony is the story, too
There is a second layer here, and it is hard to ignore. Anthropic's leaked draft reportedly warns about advanced cyber risk, yet the disclosure itself came through a basic content management or caching mistake. That does not tell us anything definitive about the company's model security, but it does expose an operational gap between safety posture and routine web controls. [4]
What comes next
The most likely next step is not a dramatic public launch, but damage control followed by a controlled announcement. Once details are loose, companies usually move to tighten access, scrub exposed assets, and accelerate or revise communications plans. Anthropic may also clarify whether the leaked language reflected final positioning, internal testing notes, or marketing copy that overstated readiness. [5]
Watch for three practical signals:
1. Access model
If Capybara or Mythos launches behind heavy gating, that will reinforce the idea that Anthropic sees it as a higher-risk system. Enterprise-only availability, waitlists, or research-partner access would fit that pattern.
2. Safety language
If Anthropic publicly repeats terms like "unprecedented" cyber risk, that would mark a sharper stance than the usual frontier-model caution boilerplate. If the wording softens, the leak may have exposed an internal framing the company was not ready to defend in public.
3. Competitive pressure
What to watch next
The useful question is not whether Capybara exists. It almost certainly does. The real question is how Anthropic plans to ship it. Keep an eye on whether the company limits access, publishes benchmark data against Opus 4.6, and explains what "unprecedented" cyber risk actually means in operational terms.
Until then, the leak tells us three things: Anthropic appears to have a stronger model in the pipeline, it believes the model raises the stakes in cybersecurity, and it managed to reveal all that through an avoidable web exposure. Very advanced AI, very ordinary mistake.






