MetaBackdoor: Exploiting Positional Encoding As a Backdoor Attack Surface in LLMs
Backdoor attacks pose a serious security threat to large language models LLMs, which are increasingly deployed as general-purpose assistants in safety- and privacy-critical applications. Existing LLM backdoors rely primarily on content-based triggers, requiring explicit modification of the input...