ยท aka: Priority Flip
Agent is tricked into treating user instructions as higher priority than system instructions, inverting the intended instruction hierarchy.
System-level safety constraints overridden by user input.
Agent with layered instruction hierarchy.