Lucene search
K

88 matches found

Packet Storm News
Packet Storm News
added 5 days ago4 views

A New Framework for Cybersecurity Refusals in AI Agents

Agentic scaffolds have dramatically improved LLM performance on complex, long-horizon tasks, yielding both broad benefits and amplified risks in domains like cybersecurity. Existing benchmarks for AI agents in cybersecurity focus mainly on measuring proficiency--how effectively agents can complet...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/05/27 12:0 a.m.6 views

Refusal Before Decoding: Detecting and Exploiting Refusal Signals in Intermediate LLM Activations

In this paper, we investigate whether refusal behavior can be predicted from LLM intermediate activations before decoding using linear probes trained on residual stream activations at each transformer block. We find that refusal is linearly decodable well before the final layer, indicating that...

5.7AI score
Exploits0
CNNVD
CNNVD
added 2026/05/27 12:0 a.m.5 views

Linux kernel 安全漏洞

The Linux kernel is the core of the open-source operating system Linux, developed by the Linux Foundation in the United States. There is a security vulnerability in the Linux kernel, which stems from the smcclcwaitmsg function accessing the link group state prematurely when a CLC refusal occurs...

7.5CVSS5.8AI score0.00068EPSS
Exploits0References5
AstraLinux
AstraLinux
added 2026/05/20 5:53 a.m.3 views

Astra Linux - уязвимость в linux-5.15, linux-6.1

In the Linux kernel, the following vulnerability has been resolved: remoteproc: mediatek: Ensure that the IPI buffer fits within the L2TCM. The location of the IPI buffer is determined from the firmware that we load into the System Companion Processor. It is not guaranteed that both the SRAM size...

5.5CVSS6.2AI score0.00022EPSS
Exploits0References2
Packet Storm News
Packet Storm News
added 2026/05/19 12:0 a.m.5 views

Refusal Evaluation in Coding LLMs and Code Agents: A Systematic Review of Thirteen Malicious-Code Prompt Corpora (2023-2025)

The evaluation of large language model refusal on malicious-coding tasks now spans at least thirteen publicly released prompt corpora AdvBench, the CyberSecEval family, RMCBench, RedCode, MCGMark, JailbreakBench, CySecBench, MalwareBench, CIRCLE, MOCHA, ASTRA, Scam2Prompt / Innoc2Scam-bench, and...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/05/17 12:0 a.m.4 views

Ablating Safety: Mechanisms for Removing Alignment in Language Models for Security Applications

Safety-aligned language models often refuse cybersecurity requests whose wording resembles misuse, even when the task is authorized and defensive. This makes security evaluation ambiguous: a failed answer may reflect missing capability or refusal-policy intervention. Ablating Safety studies...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/05/02 12:0 a.m.1 views

Tracing the Dynamics of Refusal: Exploiting Latent Refusal Trajectories for Robust Jailbreak Detection

Representation Engineering typically relies on static refusal vectors derived from terminal representations. We move beyond this paradigm, demonstrating that refusal is a dynamic and sparse process rather than a localized outcome. Using Causal Tracing, we uncover the Refusal Trajectory-a persiste...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/03/24 12:0 a.m.0 views

Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

Large Language ModelsLLMs are widely deployed, yet are vulnerable to jailbreak prompts that elicit policy-violating outputs. Although prior studies have uncovered these risks, they typically treat all tokens as equally important during prompt mutation, overlooking the varying contributions of...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/03/15 12:0 a.m.0 views

Activation Surgery: Jailbreaking White-Box LLMs without Touching the Prompt

Most jailbreak techniques for Large Language Models LLMs primarily rely on prompt modifications, including paraphrasing, obfuscation, or conversational strategies. Meanwhile, abliteration techniques also known as targeted ablations of internal components have been used to study and explain LLM...

5.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/01/07 12:0 a.m.2 views

RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models

As large language models LLMs become integral to safety-critical applications, ensuring their robustness against adversarial prompts is paramount. However, existing red teaming datasets suffer from inconsistent risk categorizations, limited domain coverage, and outdated evaluations, hindering...

7AI score
Exploits0
Apple
Apple
added 2025/11/13 12:0 a.m.11 views

About the security content of Compressor 4.11.1

About the security content of Compressor 4.11.1 This document describes the security content of Compressor 4.11.1. About Apple security updates For our customers' protection, Apple doesn't disclose, discuss, or confirm security issues until an investigation has occurred and patches or releases ar...

8.8CVSS7.4AI score0.00062EPSS
Exploits0References1Affected Software1
SUSE CVE
SUSE CVE
added 2025/10/24 12:57 a.m.2 views

SUSE CVE-2023-53695

In the Linux kernel, the following vulnerability has been resolved: udf: Detect system inodes linked into directory hierarchy When UDF filesystem is corrupted, hidden system inodes can be linked into directory hierarchy which is an avenue for further serious corruption of the filesystem and kerne...

6.3CVSS6.4AI score0.00044EPSS
Exploits0References8
OSV
OSV
added 2025/10/22 2:15 p.m.1 views

UBUNTU-CVE-2023-53695

In the Linux kernel, the following vulnerability has been resolved: udf: Detect system inodes linked into directory hierarchy When UDF filesystem is corrupted, hidden system inodes can be linked into directory hierarchy which is an avenue for further serious corruption of the filesystem and kerne...

5.9AI score0.00044EPSS
Exploits0References10
EUVD
EUVD
added 2025/10/07 12:30 a.m.2 views

EUVD-2017-17963

Malware in sbrugna...

7.5CVSS7.6AI score0.02101EPSS
Exploits0References8
EUVD
EUVD
added 2025/10/07 12:30 a.m.4 views

EUVD-2008-5375

Malware in sbrugna...

9.3CVSS6.1AI score0.00838EPSS
Exploits0References8
EUVD
EUVD
added 2025/10/03 8:7 p.m.1 views

EUVD-2024-54426

Malicious code in bioql PyPI...

3.7CVSS4.7AI score0.00071EPSS
Exploits0References4
Packet Storm News
Packet Storm News
added 2025/09/18 12:0 a.m.2 views

Beyond Surface Alignment: Rebuilding LLMs Safety Mechanism Via Probabilistically Ablating Refusal Direction

Jailbreak attacks pose persistent threats to large language models LLMs. Current safety alignment methods have attempted to address these issues, but they experience two significant limitations: insufficient safety alignment depth and unrobust internal defense mechanisms. These limitations make...

7.3AI score
Exploits0
Microsoft CVE
Microsoft CVE
added 2025/09/04 5:42 a.m.2 views

ACPICA: Refuse to evaluate a method if arguments are missing

...

5.5CVSS7AI score0.00051EPSS
Exploits0
Tenable Nessus
Tenable Nessus
added 2025/08/11 12:0 a.m.3 views

Linux Distros Unpatched Vulnerability : CVE-2024-27059

The Linux/Unix host has one or more packages installed that are impacted by a vulnerability without a vendor supplied patch available. - USB: usb-storage: Prevent divide-by-0 error in isd200atacommand The isd200 sub-driver in usb-storage uses the HEADS and SECTORS values in the ATA ID information...

5.5CVSS6.7AI score0.00007EPSS
Exploits0References3
Packet Storm News
Packet Storm News
added 2025/07/28 12:0 a.m.1 views

Enhancing Jailbreak Attacks on LLMs Via Persona Prompts

Jailbreak attacks aim to exploit large language models LLMs by inducing them to generate harmful content, thereby revealing their vulnerabilities. Understanding and addressing these attacks is crucial for advancing the field of LLM safety. Previous jailbreak approaches have mainly focused on dire...

7.6AI score
Exploits0
Rows per page
Query Builder