Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
tjungblut
1mo ago
0 comments
Share
I wonder if we can do a prompt injection from the comments
0 comments
default
newest
oldest
7moritz7
1mo ago
These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning
verdverm
1mo ago
not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"
I gave it points to reflect on and told it to apologize, which it has since done
j
/
k
navigate · click thread line to collapse