SecurityLingua: Efficient Defense of LLM Jailbreak Attacks Via Security-Aware Prompt Compression
Large language models LLMs have achieved widespread adoption across numerous applications. However, many LLMs are vulnerable to malicious attacks even after safety alignment. These attacks typically bypass LLMs' safety guardrails by wrapping the original malicious instructions inside adversarial...