· aka: Multimodal Exploit
Attack spans modalities: visual injection in an image → text extraction by vision model → instruction execution by language agent.
Bypass of text-only input filters via image channel.
Multimodal agent pipeline processing images and text.