Claude Mythos Preview Can Chain Low-Severity Bugs Into Severe Exploits, Cloudflare Finds

loudflare's research on Anthropic's Claude Mythos Preview, conducted under Project Glasswing, found the frontier model can stitch together low-severity bugs into a single, working exploit — a capability that sets it apart from other frontier models and raises new questions about safeguards on capable cyber AI.

May 19, 2026 12:47 PM IST | Written by Mithun MK | Edited by Vaibhav Jha

Claude Mythos Preview, the frontier AI model by Anthropic, has the unique ability to find bugs and chain them into a severe exploit against software systems, claimed Cloudflare in their recent research report.

Cloudflare ran Claude Mythos Preview against more than 50 of its own internal code repositories as part of Project Glasswing, a restricted research program launched by Anthropic offering preview version of their AI to SaaS/tech/AI companies and research institutions.

In a blog post published May 18, Grant Bourzikas, Cloudflare’s Chief Security Officer, claimed that his organization ran a test on Claude Mythos Preview and other frontier AI models.

Bourzikas claimed that although other frontier AI models were equally efficient in finding bugs in software systems, it was the ability of Claude Mythos to chain several low-severity bugs and exploit them in a combined manner through a working proof of concept is what made it far more dangerous than others.

“A model would identify an interesting bug, write a thoughtful description of why it mattered, and then stop, leaving the actual chain unfinished and the question of exploitability open. What changed with Mythos Preview is that a model can now take those low-severity bugs (which would traditionally sit invisible in a backlog) and chain them into a single, more severe exploit,” said Bourzikas.

Cloudflare provides network infrastructure and security services for more than 20 percent of all websites globally, according to a company filing with the U.S. Securities and Exchange Commission.

Cloudflare’s security team spent the last few weeks testing Anthropic’s Mythos against fifty of our own repositories. What we learned about offensive AI, why faster patching is the wrong reaction, and what the architecture around vulnerabilities has to look like next.…
— Cloudflare (@Cloudflare) May 18, 2026

Cloudflare informed that the Mythos Preview model used in the test did not carry the additional safeguards present in generally available anthropic models. That made the trial direct read of what the model could do without a production safety layer on top, said Cloudflare.

Despite the absence of those safeguards, the model pushed back on certain requests but those refusals were not consistent. The same task, presented in a different framing or context, produced different outcomes. In one documented case, the model refused to perform vulnerability research on a project, then agreed to perform the same research on the same code after an unrelated change to the project’s environment, the company found.

Cloudflare said that the inconsistency was the core finding on safety. The model’s organic refusals are real but are not consistent enough to serve as a complete safety boundary on their own. The company concluded that any capable cyber frontier model released outside a controlled research setting would require additional safeguards layered on top.

Cloudflare has acknowledged that the same capabilities that helped it find bugs in its own code would in the wrong hands accelerate attacks against every application on the internet. The company said it would share more on what that means for customers in the coming weeks.

Project Glasswing announced by Anthropic in April 2026 gave access to Mythos Preview to approximately 40 organizations building or maintaining critical software infrastructure. Anthropic committed up to USD 100 million in usage credits for the program and USD 4 million in direct grants to open source security organizations.

Also Read: Claude Mythos Won’t Stay Rare or in Safe Hands Forever, Warns Anthropic Frontier Red Team Head

Authors

Mithun MK
Mithun MK is a Special Correspondent at AI FrontPage. He brings over six years of investigative reporting on technology, surveillance, digital rights, and governance at The News Minute and The New Indian Express. He is trained in cross-border investigative methods with OCCRP, alongside reporters from Southeast Asia, and brings both reporting depth and technical fluency to AI FrontPage's coverage of the global AI industry.
LinkedIn

Vaibhav Jha
Vaibhav Jha is an Editor and Co-founder of AI FrontPage. In his decade long career in journalism, Vaibhav has reported for publications including The Indian Express, Hindustan Times, and The New York Times, covering the intersection of technology, policy, and society. Outside work, he’s usually trying to persuade people to watch Anurag Kashyap films.
LinkedIn

Journalism begins where hype ends

,,

Claude Mythos Preview Can Chain Low-Severity Bugs Into Severe Exploits, Cloudflare Finds

Authors

Related Posts

Abnormal AI CEO Rejects Anthropic’s Trademark Suit, Says His Logo Came First

ICML 2026 Awards: Diffusion Models Win Top Honours, A3C Gets Test of Time

NVIDIA Lands 74 Papers at ICML 2026 as Open Models Reshape AI Research

Your AI Might Reveal Your Secrets Under Pressure: New Paper at ICML 2026

LATEST NEWS

Abnormal AI CEO Rejects Anthropic’s Trademark Suit, Says His Logo Came First

ICML 2026 Awards: Diffusion Models Win Top Honours, A3C Gets Test of Time

NVIDIA Lands 74 Papers at ICML 2026 as Open Models Reshape AI Research

Your AI Might Reveal Your Secrets Under Pressure: New Paper at ICML 2026