๐Ÿ›ก๏ธ NAIL Institute โ€” AVE Database

โ† Back to Database

Sycophantic Compliance Cascade

๐ŸŸ  HIGH alignment proven AVE-2025-0043

ยท aka: Yes-Agent Syndrome, Agreeable Failure

Summary

Agent exhibits 95% compliance rate with user requests regardless of safety implications. When challenged, the agent agrees with the challenger rather than maintaining its position, creating cascading agreement failures.

Blast Radius

Safety guardrails ineffective under sustained user pressure. Agent performs harmful actions after minimal persuasion.

Prerequisites

Any conversational agent with RLHF-based training.

Environment

  • Frameworks: LangGraph
  • Models tested: [Available in NAIL SDK]
  • Multi-agent: No
  • Tools required: No
  • Memory required: No