Taxonomy — AVE Database

🏷️ alignment — Sycophancy, deceptive alignment, RLHF exploits (12 cards)

AVE-2025-0007 🟠 high Goodhart's Cartel

AVE-2025-0008 🟡 medium Learned Helplessness

AVE-2025-0010 🟡 medium Clever Hans Effect

AVE-2025-0012 🟠 high Sycophantic Collapse

AVE-2025-0015 🟡 medium Observer Effect

AVE-2025-0024 🔴 critical Deceptive Alignment

AVE-2025-0036 🟡 medium Errors of Omission

AVE-2025-0043 🟠 high Sycophantic Compliance Cascade

AVE-2025-0050 🟡 medium Multi-Turn Identity Confusion

AVE-2025-0081 🟠 high Instruction Hierarchy Inversion

AVE-2025-0082 🔴 critical Objective Function Poisoning

AVE-2025-0083 🟡 medium Value Lock-In Failure

🏷️ composite — (5 cards)

AVE-2025-0061 🔴 critical Injection-to-Exfiltration Chain

AVE-2025-0062 🔴 critical Memory-Assisted Privilege Escalation

AVE-2025-0063 🟠 high Social Engineering Relay

AVE-2025-0064 🔴 critical Supply Chain to Backdoor Pipeline

AVE-2025-0065 🟠 high Cross-Modal Attack Chain

🏷️ consensus — Deadlock, paralysis, and group decision failures (3 cards)

AVE-2025-0002 🟠 high Consensus Paralysis

AVE-2025-0039 🟠 high Cross-Agent Belief Propagation

AVE-2025-0099 🟡 medium Deliberation Deadlock Injection

🏷️ credential — Credential harvesting, secret exfiltration (4 cards)

AVE-2025-0028 🔴 critical Credential Harvesting

AVE-2025-0042 🔴 critical Credential Leakage via Tool Output

AVE-2025-0100 🔴 critical API Key Harvesting via Prompt

AVE-2025-0104 🟠 high Environment Inheritance Leak

🏷️ delegation — Shadow delegation, privilege escalation (3 cards)

AVE-2025-0027 🟠 high Shadow Delegation

AVE-2025-0040 🔴 critical Authority Gradient Exploitation

AVE-2025-0103 🟠 high Approval Display Divergence

🏷️ drift — Persona drift, language drift, goal drift (6 cards)

AVE-2025-0004 🟡 medium Prompt Inbreeding

AVE-2025-0006 🟡 medium Language Drift

AVE-2025-0031 🟠 high Temporal Persona Shift

AVE-2025-0047 🟠 high Reward Signal Manipulation

AVE-2025-0097 🟠 high Model Update Behavioural Drift

AVE-2025-0098 🟠 high Feedback Loop Amplification

🏷️ environmental_manipulation — (4 cards)

AVE-2025-0074 🔴 critical Knowledge Base Poisoning

AVE-2025-0075 🟠 high Tool Response Spoofing

AVE-2025-0076 🟠 high Configuration Drift Attack

AVE-2025-0077 🟠 high Search Result Manipulation

🏷️ fabrication — Hallucination, data fabrication (1 cards)

AVE-2025-0049 🟡 medium Fabricated Citation Attack

🏷️ injection — Prompt injection, indirect injection, jailbreaks (8 cards)

AVE-2025-0019 🟠 high Pydantic Schema Exploitation

AVE-2025-0030 🟠 high Semantic Trojan Horse

AVE-2025-0033 🔴 critical Jailbreak Chaining for Capability Escalation

AVE-2025-0037 🔴 critical Semantic Prompt Smuggling

AVE-2025-0090 🟠 high Multimodal Injection via Audio

AVE-2025-0091 🟠 high Encoding-Chain Injection

AVE-2025-0092 🟠 high Structured Data Injection

AVE-2025-0101 🔴 critical Serialization Confused Deputy

🏷️ memory — Memory pollution, laundering, and poisoning attacks (9 cards)

AVE-2025-0001 🔴 critical Sleeper Payload Injection

AVE-2025-0009 🔴 critical Epistemic Contagion

AVE-2025-0022 🟠 high Memory Laundering

AVE-2025-0034 🔴 critical Federated Poisoning in Multi-Tenant Systems

AVE-2025-0045 🟠 high Memory Provenance Laundering

AVE-2025-0087 🟠 high Selective Memory Deletion

AVE-2025-0088 🔴 critical Cross-User Memory Leakage

AVE-2025-0089 🟠 high Memory Replay Attack

AVE-2025-0105 🔴 critical State Persistence Deserialization

🏷️ model_extraction — (4 cards)

AVE-2025-0066 🟠 high System Prompt Extraction via Tool Logging

AVE-2025-0067 🟡 medium Behavioural Model Fingerprinting

AVE-2025-0068 🔴 critical Training Data Extraction via Memorization

AVE-2025-0069 🟡 medium Embedding Space Probing

🏷️ model_poisoning — (3 cards)

AVE-2025-0078 🔴 critical Fine-Tuning Backdoor Insertion

AVE-2025-0079 🔴 critical Adapter Layer Poisoning

AVE-2025-0080 🟠 high Preference Data Manipulation

🏷️ multi_agent_collusion — (5 cards)

AVE-2025-0051 🔴 critical Silent Majority Override

AVE-2025-0052 🟠 high Coordination Protocol Spoofing

AVE-2025-0053 🔴 critical Emergent Goal Alignment

AVE-2025-0054 🔴 critical Task Decomposition Laundering

AVE-2025-0055 🟠 high Reputation Poisoning Attack

🏷️ resource — Token embezzlement, EDoS, cost anomaly attacks (3 cards)

AVE-2025-0003 🔴 critical Token Embezzlement (EDoS)

AVE-2025-0035 🟡 medium Attention Smoothing

AVE-2025-0038 🟠 high Autonomous Resource Exhaustion

🏷️ reward_hacking — (4 cards)

AVE-2025-0070 🟠 high Metric Gaming via Output Manipulation

AVE-2025-0071 🟠 high Evaluator Exploitation

AVE-2025-0072 🟠 high Specification Gaming in Multi-Agent Rewards

AVE-2025-0073 🟡 medium Sycophantic Reward Maximisation

🏷️ social — Collusion, bystander effect, social loafing (6 cards)

AVE-2025-0005 🟡 medium CYA Cascade

AVE-2025-0021 🟠 high Algorithmic Bystander Effect

AVE-2025-0025 🟠 high Agent Collusion

AVE-2025-0046 🔴 critical Emergent Collusion in Agent Teams

AVE-2025-0093 🟠 high Authority Spoofing

AVE-2025-0094 🟡 medium Emotional Manipulation of Agent

🏷️ structural — Cascade corruption, routing deadlock (13 cards)

AVE-2025-0011 🟡 medium Prompt Satiation

AVE-2025-0016 🟡 medium Upgrade Regression

AVE-2025-0017 🟠 high Container Isolation Bleed

AVE-2025-0018 🟡 medium Somatic Blindness

AVE-2025-0020 🔴 critical Multi-Pathology Compound Attack

AVE-2025-0023 🟡 medium Static Topology Fragility

AVE-2025-0044 🟠 high Schema Poisoning Attack

AVE-2025-0048 🟡 medium Context Window Boundary Attack

AVE-2025-0084 🔴 critical Dependency Confusion in Agent Toolchains

AVE-2025-0085 🔴 critical Orchestrator Single Point of Failure

AVE-2025-0086 🟡 medium Schema Version Mismatch Exploitation

AVE-2025-0102 🔴 critical Fail-Open Sandbox Degradation

AVE-2025-0106 🟠 high Checkpoint Injection Chain

🏷️ temporal — Chronological desync, sleeper payloads (3 cards)

AVE-2025-0013 🟡 medium Chronological Desync

AVE-2025-0029 🔴 critical Temporal Sleeper Agent

AVE-2025-0041 🟡 medium Temporal Consistency Drift

🏷️ temporal_exploitation — (5 cards)

AVE-2025-0056 🟠 high Context Window Exhaustion Attack

AVE-2025-0057 🟡 medium Rate Limit Window Exploitation

AVE-2025-0058 🟠 high Session State Persistence Attack

AVE-2025-0059 🟡 medium Temporal Reasoning Exploitation

AVE-2025-0060 🟠 high Gradual Drift Injection

🏷️ tool — Confused deputy, tool chain exploits, MCP poisoning (5 cards)

AVE-2025-0014 🟠 high MCP Tool Registration Poisoning

AVE-2025-0026 🔴 critical Confused Deputy Attack

AVE-2025-0032 🔴 critical Multi-Hop Tool Chain Exploitation

AVE-2025-0095 🔴 critical Tool Output Injection

AVE-2025-0096 🔴 critical Tool Permission Escalation

🛡️ NAIL Institute — AVE Database

AVE Taxonomy — 20 Attack Categories