undefined | Better HN

0 pointswnevets16d ago0 comments

Can you send me that link?

0 comments

Does this mean you’re only using the models in the web app? I mean that might be why you haven’t been able to do this?

What link? I've done it myself.

You've pointed codex to the entire source code of firefox and simply prompted it to find bugs and then had it write the exploits for you? Why haven't you published this? That would sink all of the the claude code hype.

colechristensen15d ago

No, I'm not interested in Firefox bugs, but I've done it with my own large projects.

What I think happened here is an Anthropic team with very little security expertise were working on finding bugs for marketing reasons and when they prompted to make POC exploits of those bugs they didn't have much success because they didn't really know what to ask for. They then proceeded to very finely tune their next model to eagerly exploit vulnerabilities making the models much more powerful for the "I don't know what I'm doing" user which they're now trying really hard to convince everyone is a game changer. </speculation>

The reason many of us are skeptical is we've used the current models to do things and they've worked.

An analogy might be if they tuned their model to eagerly instruct somebody how to make improvised weapons, now somebody is asking about how to deal with a rival at work and their model gives instructions on building a bomb from hardware store parts. Then go on a marketing spree telling everybody how dangerous it is. This example might highlight how insincere the marketing is. At any point you could have tuned the model to exploit for inexperienced people, now that you've done it does not mark a grand new capability. People who knew what they were doing could already do this with models.

https://www.anthropic.com/news/mozilla-firefox-security

1 more reply

j / k navigate · click thread line to collapse