Test-Time Immunization: a Universal Defense Framework against Jailbreaks for (Multimodal) Large Language Models
While multimodal large language models LLMs have attracted widespread attention due to their exceptional capabilities, they remain vulnerable to jailbreak attacks. Various defense methods are proposed to defend against jailbreak attacks, however, they are often tailored to specific types of...