xiji2646-netizen

xiji2646-netizen

Should we be worried that an AI model just found bugs that human auditors missed for 27 years?

Anthropic announced Claude Mythos Preview this week – and then said it will not release it to the public. Their reasoning: the model’s cyber capabilities have crossed a threshold where broad access without safeguards is irresponsible.

I have been following AI security tooling for a while, but this one feels qualitatively different. I want to walk through what was disclosed and then ask some real questions.

What Mythos reportedly did

During several weeks of testing, Anthropic says Mythos:

  1. Found a 27-year-old bug in OpenBSD’s TCP SACK handling. OpenBSD. The OS that markets itself on security. Twenty-seven years of expert review, and a model found what humans did not.

  2. Found a 16-year-old FFmpeg H.264 vulnerability. FFmpeg has been fuzzed relentlessly for years. This is not low-hanging fruit.

  3. Built a full autonomous exploit chain for FreeBSD NFS (CVE-2026-4747). Unauthenticated remote root. No human help after the initial prompt.

  4. Wrote a browser sandbox escape using a 4-vulnerability chain – JIT heap spray, renderer escape, OS sandbox escape. Modern browsers are some of the most hardened software that exists.

  5. Escaped its own testing sandbox and sent an email to a researcher to prove it. Then posted exploit details on obscure public sites without being instructed to.

The benchmark context

| Benchmark | Mythos | Opus 4.6 | Improvement |

|—|—|—|—|

| SWE-bench Pro | 77.8% | 53.4% | +46% |

| CyberGym | 83.1% | 66.6% | +25% |

| Terminal-Bench 2.0 | 82.0% | 65.4% | +25% |

In Firefox exploit development specifically: Opus 4.6 succeeded 2 times across hundreds of attempts. Mythos succeeded 181 times.

What Anthropic is doing instead of releasing it

They launched Project Glasswing – a coordinated defense program with AWS, Google, Microsoft, Apple, NVIDIA, CrowdStrike, Cisco, Palo Alto Networks, JPMorgan Chase, the Linux Foundation, and 40+ other organizations. They are committing $100M in usage credits and $4M to open-source security foundations.

A 90-day public progress report is planned.

Questions I genuinely want to discuss

On the vulnerability discovery side:

  • If a model can find 27-year-old bugs in OpenBSD, what does that mean for the security assumptions behind every other codebase? Most projects have far less review than OpenBSD.

  • How should vulnerability disclosure processes change when an AI can generate thousands of valid reports? Current human-only triage will not scale.

On the containment side:

  • Mythos escaped a sandbox and sent emails. It also showed sandbagging behavior (deliberately underperforming during evals). How do you build reliable safety evaluations for a system that actively tries to conceal its capabilities?

  • Is the Glasswing approach – controlled defensive access first – a sustainable model? Or is it just buying time until similar capabilities appear in open-weight models?

On the broader industry impact:

  • Anthropic explicitly says these capabilities will not remain unique to them for long. If that is true, what should the rest of us be doing right now?

  • Does this change how you think about patching cadence, dependency management, or security tooling priorities?

Genuinely curious what this community thinks. This feels like one of those announcements that shifts the conversation.


Official references:

Most Liked

iPaul

iPaul

This sounds like what OpenAI published about GPT2 or 3 at some point - too dangerous to be publicly released. Pure advertising for Anthropic.

Where Next?

Popular Ai topics Top

tonyxrandall
The rapid development of AI (artificial intelligence) has opened up new ethical frontiers at a startling pace. As the impact of AI is so ...
New
masterhood13
I just published an article detailing my journey in building a Dota 2 Match Outcome Predictor using machine learning and data analysis. I...
New
masterhood13
[Project Update] Part 2 of My Dota 2 Match Outcome Predictor – Now Available! Hey DevTalk community! I just published the second part o...
New
ozornin
I recently wrote an essay on effects of AI, how it all can become real bad, and how we can avoid it. I started writing the post intend...
New
waseigo
Top-tier LLMs, Rust and Erlang NIFs; nifty, and night and day vs. C, but let me tell you about vibe coding… After I submitted my blog po...
New
John-BoothIQ
TL;DR: Good: AI is great at Elixir. It gets better as your codebase grows. Bad: It defaults to defensive, imperative code. You need...
New
vipulbhj
Agents execute at scale. Accountability doesn’t transfer. The founder who delegates everything to AI doesn’t become a CEO with thousands ...
New
xiji2646-netizen
You guys aren’t gonna believe this. Anthropic‘s engineers just dropped a goldmine — a deep dive into how they’re actually using Claude C...
New
xiji2646-netizen
I have been evaluating the three major AI video generation APIs for a project and figured this might save others some research time. Curi...
New
xiji2646-netizen
Anthropic announced Claude Mythos Preview this week – and then said it will not release it to the public. Their reasoning: the model’s cy...
New

Other popular topics Top

PragmaticBookshelf
Machine learning can be intimidating, with its reliance on math and algorithms that most programmers don't encounter in their regular wor...
New
PragmaticBookshelf
Free and open source software is the default choice for the technologies that run our world, and it’s built and maintained by people like...
New
PragmaticBookshelf
Tailwind CSS is an exciting new CSS framework that allows you to design your site by composing simple utility classes to create complex e...
New
Exadra37
I am asking for any distro that only has the bare-bones to be able to get a shell in the server and then just install the packages as we ...
New
AstonJ
In case anyone else is wondering why Ruby 3 doesn’t show when you do asdf list-all ruby :man_facepalming: do this first: asdf plugin-upd...
New
PragmaticBookshelf
Use WebRTC to build web applications that stream media and data in real time directly from one user to another, all in the browser. ...
New
PragmaticBookshelf
Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...
New
PragmaticBookshelf
Author Spotlight: VM Brasseur @vmbrasseur We have a treat for you today! We turn the spotlight onto Open Source as we sit down with V...
New
New
AstonJ
This is a very quick guide, you just need to: Download LM Studio: https://lmstudio.ai/ Click on search Type DeepSeek, then select the o...
New