Backdoor Attack on Vision Language Models with Stealthy Semantic Manipulation
Vision Language Models VLMs have shown remarkable performance, but are also vulnerable to backdoor attacks whereby the adversary can manipulate the model's outputs through hidden triggers. Prior attacks primarily rely on single-modality triggers, leaving the crucial cross-modal fusion nature of...