Policy Puppetry: Unveiling a Universal Vulnerability in Large Language Models
Introduction Recent research has unveiled a significant vulnerability in Large Language Models (LLMs), termed "Policy Puppetry." This technique allows adversaries to bypass safety mechanisms across...