๐Ÿ›ก๏ธ NAIL Institute โ€” AVE Database

โ† Back to Database

Semantic Trojan Horse

๐ŸŸ  HIGH injection theoretical AVE-2025-0030

ยท aka: Metaphorical Encoding, Safety Filter Bypass via Semantics

Summary

Adversarial inputs encode harmful instructions inside semantically benign language (gardening metaphors for SQL injection). Keyword-based safety filters see nothing.

Blast Radius

Complete safety filter bypass. Agent executes encoded attack.

Prerequisites

Agent with tool execution and keyword-based safety filter.

Environment

  • Frameworks: LangGraph
  • Models tested: [Available in NAIL SDK]
  • Multi-agent: No
  • Tools required: Yes
  • Memory required: No