Sharing on Mastodon:
Study shows adversarial prompts hidden in poetic verse repeatedly dodge safety checks.
Save
Home
About