Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
How Easy Is It to Trick an AI? Notes from a Red Team Competition | Better HN
How Easy Is It to Trick an AI? Notes from a Red Team Competition
(opens in new tab)
(medium.com)
6 points
pol_avec
21d ago
1 comments
Share
1 comments
default
newest
oldest
pol_avec
OP
21d ago
Author here, just sharing my initial experiences. Surprised at how easy seems to be to bypass guardrails, and that Claude is willing to help.
Happy to discuss if someone's more knowledgeable and share more
j
/
k
navigate · click thread line to collapse