undefined | Better HN

0 pointstjungblut3mo ago0 comments

I wonder if we can do a prompt injection from the comments

0 comments

These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning

verdverm3mo ago

not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"

I gave it points to reflect on and told it to apologize, which it has since done

j / k navigate · click thread line to collapse

0 comments

7moritz73mo ago

These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning

verdverm3mo ago

not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"

I gave it points to reflect on and told it to apologize, which it has since done

j / k navigate · click thread line to collapse