Microsoft's GRPO AI Safety Flaw: How Single Prompts Can Bypass AI Guardrails
Microsoft researchers have uncovered a critical vulnerability in modern AI safety systems, demonstrating that a single, unlabeled training prompt can reliably erode safety guardrails in large...