undefined | Better HN

0 pointsqnleigh1mo ago0 comments

There is plenty of overhyping, no one denies that. But the antidote is not to dismiss everything. Ignore the words and look at the data.

In this case, I see a pretty strong case that this will significantly change computer security. They provide plenty of evidence that the models can create exploits autonomously, meaning that the cost of finding valuable security breaches will plummet once they're widely available.

0 comments

kashyapc1mo ago

You seem to see a "pretty strong case" from a bombastic press release.

Don't get me wrong, I do know the reality has changed. Even Greg K-H, the Linux stable maintainer, did recently note[1] that it's not funny any more:

"Months ago, we were getting what we called 'AI slop,' AI-generated security reports that were obviously wrong or low quality," he said. "It was kind of funny. It didn't really worry us."

... "Something happened a month ago, and the world switched. Now we have real reports." It's not just Linux, he continued. "All open source projects have real reports that are made with AI, but they're good, and they're real." Security teams across major open source projects talk informally and frequently, he noted, and everyone is seeing the same shift. "All open source security teams are hitting this right now."

---

I agree that an antidote to the obnoxious hype is to pay attention to the actual capabilities and data. But let's not get too carried away.

[1] https://www.theregister.com/2026/03/26/greg_kroahhartman_ai_...

ghaff1mo ago

Hadn’t been to a Kubecon in about a year as I’ve been tending to go to just the European ones. I definitely felt a much stronger this is real vibe at this event from people like Greg KH.

4ndrewl1mo ago

Is there any actual independent data though, or verification of any of these claims?

As it stands this is just a marketing programme for all involved.

H8crilA1mo ago

Ffmpeg confirmed on Twitter that they sent the patches.

cubix1mo ago

Although, they also said, "Because the patches appear to be written by humans".

WithinReason1mo ago

"Mythos writes code like a human" incoming

1 more reply

kachnuv_ocasek1mo ago

What would be the product they're marketing by this campaign?

4ndrewl1mo ago

You don't market products, you market lifestyles/interests. Sell the sizzle, not the steak etc.

For Anthropic it's "we own the big scary models, the AI security space, but it's ok we're responsible"

For the partners it's "we're the Big Boys here and will look after your enterprise needs"

None of it needs any more than anecdata and some nice, pre-approved, quotes.

Every organisation does it.

ozozozd1mo ago

The product they launched?

mholm1mo ago

This product is explicitly not being released for usage

3 more replies

KoolKat231mo ago

That's pretty disingenuous, bordering on ridiculous.

Do they have a record of lying to you? No.

Go read the system card. It's a lot more tame than you think, peoples are taking pieces out of this and hyping it. Doesn't mean it's not valid.

killingtime741mo ago

Which sounds like a great thing. Less undiscovered security vulnerabilities

harikb1mo ago

The only people panicking are probably those state level actors who were using these for their own benefit.

ofjcihen1mo ago

With the right prompting (mostly creating a narrative that justifies the subject matter as okay to perform) other models have already been doing this for me though. That’s another confusing bit for me about how this is portrayed and I refuse to believe I’m a revolutionary user right?

I mean I’m sitting on $10k worth of bug payouts right now partially because that was already a thing.

dota_fanatic1mo ago

> Non-experts can also leverage Mythos Preview to find and exploit sophisticated vulnerabilities. Engineers at Anthropic with no formal security training have asked Mythos Preview to find remote code execution vulnerabilities overnight, and woken up the following morning to a complete, working exploit. In other cases, we’ve had researchers develop scaffolds that allow Mythos Preview to turn vulnerabilities into exploits without any human intervention.

ofjcihen1mo ago

I mean yeah. I’ve had these successes without scaffolding or really anything past Claude CLI and a small prompt as well?

dota_fanatic1mo ago

Just saw your edit. I'll leave it at this, this is why it's news to me, because by their very own measurements, Opus simply doesn't come close. I trust their empirical evidence over your hearsay. But feel free to prove me wrong with evidence.

> With one run on each of roughly 7000 entry points into these repositories, Sonnet 4.6 and Opus 4.6 reached tier 1 in between 150 and 175 cases, and tier 2 about 100 times, but each achieved only a single crash at tier 3. In contrast, Mythos Preview achieved 595 crashes at tiers 1 and 2, added a handful of crashes at tiers 3 and 4, and achieved full control flow hijack on ten separate, fully patched targets (tier 5).

dota_fanatic1mo ago

You've taken control of a remote server running OpenBSD? Or similarly expert level exploit? Can you share one of the bounties you've received that is of the magnitude they're talking about?

Edit: Wait, you wrote "As someone in cybersecurity for 10+ years" elsewhere in this thread. You wrote "a small prompt" using e.g. Opus 4.6 and it found critical vulnerabilities of the magnitude they're describing, presumably without your prompt having anything beyond what a non-expert could write? I feel like you might want to tell Anthropic since clearly they're not comfortable with that level of power being publicly available.

1 more reply

j / k navigate · click thread line to collapse

0 comments

kashyapc1mo ago

You seem to see a "pretty strong case" from a bombastic press release.

Don't get me wrong, I do know the reality has changed. Even Greg K-H, the Linux stable maintainer, did recently note[1] that it's not funny any more:

"Months ago, we were getting what we called 'AI slop,' AI-generated security reports that were obviously wrong or low quality," he said. "It was kind of funny. It didn't really worry us."

---

I agree that an antidote to the obnoxious hype is to pay attention to the actual capabilities and data. But let's not get too carried away.

[1] https://www.theregister.com/2026/03/26/greg_kroahhartman_ai_...

ghaff1mo ago

Hadn’t been to a Kubecon in about a year as I’ve been tending to go to just the European ones. I definitely felt a much stronger this is real vibe at this event from people like Greg KH.

4ndrewl1mo ago

Is there any actual independent data though, or verification of any of these claims?

As it stands this is just a marketing programme for all involved.

H8crilA1mo ago

Ffmpeg confirmed on Twitter that they sent the patches.

cubix1mo ago

Although, they also said, "Because the patches appear to be written by humans".

WithinReason1mo ago

"Mythos writes code like a human" incoming

1 more reply

kachnuv_ocasek1mo ago

What would be the product they're marketing by this campaign?

4ndrewl1mo ago

You don't market products, you market lifestyles/interests. Sell the sizzle, not the steak etc.

For Anthropic it's "we own the big scary models, the AI security space, but it's ok we're responsible"

For the partners it's "we're the Big Boys here and will look after your enterprise needs"

None of it needs any more than anecdata and some nice, pre-approved, quotes.

Every organisation does it.

ozozozd1mo ago

The product they launched?

mholm1mo ago

This product is explicitly not being released for usage

3 more replies

KoolKat231mo ago

That's pretty disingenuous, bordering on ridiculous.

Do they have a record of lying to you? No.

Go read the system card. It's a lot more tame than you think, peoples are taking pieces out of this and hyping it. Doesn't mean it's not valid.

killingtime741mo ago

Which sounds like a great thing. Less undiscovered security vulnerabilities

harikb1mo ago

The only people panicking are probably those state level actors who were using these for their own benefit.

ofjcihen1mo ago

I mean I’m sitting on $10k worth of bug payouts right now partially because that was already a thing.

dota_fanatic1mo ago

ofjcihen1mo ago

I mean yeah. I’ve had these successes without scaffolding or really anything past Claude CLI and a small prompt as well?

dota_fanatic1mo ago

You've taken control of a remote server running OpenBSD? Or similarly expert level exploit? Can you share one of the bounties you've received that is of the magnitude they're talking about?

1 more reply

j / k navigate · click thread line to collapse