Lucene search
K

5 matches found

Packet Storm News
Packet Storm News
added 2026/03/24 12:0 a.m.3 views

Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

Large Language ModelsLLMs are widely deployed, yet are vulnerable to jailbreak prompts that elicit policy-violating outputs. Although prior studies have uncovered these risks, they typically treat all tokens as equally important during prompt mutation, overlooking the varying contributions of...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/02/10 12:0 a.m.5 views

The Role of Learning in Attacking Intrusion Detection Systems

Recent work on network attacks have demonstrated that ML-based network intrusion detection systems NIDS can be evaded with adversarial perturbations. However, these attacks rely on complex optimizations that have large computational overheads, making them impractical in many real-world settings. ...

5.5AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/11/24 12:0 a.m.5 views

FedPoisonTTP: A Threat Model and Poisoning Attack for Federated Test-Time Personalization

Test-time personalization in federated learning enables models at clients to adjust online to local domain shifts, enhancing robustness and personalization in deployment. Yet, existing federated learning work largely overlooks the security risks that arise when local adaptation occurs at test tim...

6.5AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/11/06 12:0 a.m.5 views

Black-Box Guardrail Reverse-Engineering Attack

Large language models LLMs increasingly employ guardrails to enforce ethical, legal, and application-specific constraints on their outputs. While effective at mitigating harmful responses, these guardrails introduce a new class of vulnerabilities by exposing observable decision patterns. In this...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/04/18 12:0 a.m.2 views

Q-FAKER: Query-Free Hard Black-Box Attack Via Controlled Generation

Many adversarial attack approaches are proposed to verify the vulnerability of language models. However, they require numerous queries and the information on the target model. Even black-box attack methods also require the target model's output information. They are not applicable in real-world...

6.7AI score
Exploits0
Rows per page
Query Builder