Mastodon Share
Sharing on Mastodon:

Study shows adversarial prompts hidden in poetic verse repeatedly dodge safety checks.

HomeAbout