3 matches found
JPRO: Automated Multimodal Jailbreaking Via Multi-Agent Collaboration Framework
The widespread application of large VLMs makes ensuring their secure deployment critical. While recent studies have demonstrated jailbreak attacks on VLMs, existing approaches are limited: they require either white-box access, restricting practicality, or rely on manually crafted patterns, leadin...
CHAI: Command Hijacking against Embodied AI
Embodied Artificial Intelligence AI promises to handle edge cases in robotic vehicle systems where data is scarce by using common-sense reasoning grounded in perception and action to generalize beyond training distributions and adapt to novel real-world situations. These capabilities, however, al...
The Security Threat of Compressed Projectors in Large Vision-Language Models
The choice of a suitable visual language projector VLP is critical to the successful training of large visual language models LVLMs. Mainstream VLPs can be broadly categorized into compressed and uncompressed projectors, and each offering distinct advantages in performance and computational...