Lucene search
K

62 matches found

Packet Storm News
Packet Storm News
added 2026/05/11 12:0 a.m.4 views

LITMUS: Benchmarking Behavioral Jailbreaks of LLM Agents in Real OS Environments

The rapid proliferation of LLM-based autonomous agents in real operating system environments introduces a new category of safety risk beyond content safety: behavior jailbreak, where an adversary induces an agent to execute dangerous OS-level operations with irreversible consequences. Existing...

5.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/04/09 12:0 a.m.2 views

ACIArena: Toward Unified Evaluation for Agent Cascading Injection

Collaboration and information sharing empower Multi-Agent Systems MAS but also introduce a critical security risk known as Agent Cascading Injection ACI. In such attacks, a compromised agent exploits inter-agent trust to propagate malicious instructions, causing cascading failures across the...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/04/03 12:0 a.m.1 views

A Systematic Security Evaluation of OpenClaw and Its Variants

Tool-augmented AI agents substantially extend the practical capabilities of large language models, but they also introduce security risks that cannot be identified through model-only evaluation. In this paper, we present a systematic security assessment of six representative OpenClaw-series agent...

6AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/03/24 12:0 a.m.2 views

Towards Leveraging LLMs to Generate Abstract Penetration Test Cases from Software Architecture

Software architecture models capture early design decisions that strongly influence system quality attributes, including security. However, architecture-level security assessment and feedback are often absent in practice, allowing security weaknesses to propagate into later phases of the software...

5.7AI score
Exploits0
Packet Storm News
Packet Storm News
added 2026/02/23 12:0 a.m.3 views

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

The evaluation of Large Language Models LLMs for code generation relies heavily on the quality and robustness of test cases. However, existing benchmarks often lack coverage for subtle corner cases, allowing incorrect solutions to pass. To bridge this gap, we propose CodeHacker, an automated agen...

6AI score
Exploits0
NVD
NVD
added 2026/02/04 5:16 p.m.4 views

CVE-2026-23077

In the Linux kernel, the following vulnerability has been resolved: mm/vma: fix anonvma UAF on mremap faulted, unfaulted merge Patch series "mm/vma: fix anonvma UAF on mremap faulted, unfaulted merge", v2. Commit 879bca0a2c4f "mm/vma: fix incorrectly disallowed anonymous VMA merges" introduced th...

7.8CVSS0.00019EPSS
Exploits0References2
Packet Storm News
Packet Storm News
added 2025/12/18 12:0 a.m.3 views

AutoDFBench 1.0: A Benchmarking Framework for Digital Forensic Tool Testing and Generated Code Evaluation

The National Institute of Standards and Technology NIST Computer Forensic Tool Testing CFTT programme has become the de facto standard for providing digital forensic tool testing and validation. However to date, no comprehensive framework exists to automate benchmarking across the diverse forensi...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/12/01 12:0 a.m.9 views

BackportBench: A Multilingual Benchmark for Automated Backporting of Patches

Many modern software projects evolve rapidly to incorporate new features and security patches. It is important for users to update their dependencies to safer versions, but many still use older, vulnerable package versions because upgrading can be difficult and may break their existing codebase...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/09/28 12:0 a.m.3 views

SafeSearch: Automated Red-Teaming for the Safety of LLM-Based Search Agents

Search agents connect LLMs to the Internet, enabling access to broader and more up-to-date information. However, unreliable search results may also pose safety threats to end users, establishing a new threat surface. In this work, we conduct two in-the-wild experiments to demonstrate both the...

7.2AI score
Exploits0
GithubExploit
GithubExploit
added 2025/09/26 4:41 a.m.186 views

XSS-CTFs

XSS-CTFs Contains hands-on XSS test cases from beginner...

6.4AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/09/24 12:0 a.m.4 views

STAF: Leveraging LLMs for Automated Attack Tree-Based Security Test Generation

In modern automotive development, security testing is critical for safeguarding systems against increasingly advanced threats. Attack trees are widely used to systematically represent potential attack vectors, but generating comprehensive test cases from these trees remains a labor-intensive,...

6.8AI score
Exploits0
Gitee
Gitee
added 2025/09/21 1:12 a.m.161 views

WAFTest

This repository is an offensive tool for testing web application firewalls WAFs. It contains a collection of test cases and scripts to evaluate the effectiveness of WAFs against various types of attacks. The tool includes test cases for common web application vulnerabilities such as: Command...

7.3AI score
Exploits0
SUSE CVE
SUSE CVE
added 2025/09/04 11:23 p.m.1 views

SUSE CVE-2025-38708

In the Linux kernel, the following vulnerability has been resolved: drbd: add missing krefget in handlewriteconflicts With two-primaries enabled, DRBD tries to detect "concurrent" writes and handle write conflicts, so that even if you write to the same sector simultaneously on both nodes, they en...

7.8CVSS6.3AI score0.00025EPSS
Exploits0References3
Gitee
Gitee
added 2025/07/06 2:38 a.m.754 views

Exploit for Out-of-bounds Write in F5 Nginx

Disclosures Zero-day and N-day security vulnerability notes, analysis, and proof-of-concepts URL: https://github.com/badd1e/Disclosures List CVE-2009-2629: nginx http module Buffer Underflow Remote Code Execution Vulnerability Patch analysis, testcase, notes CVE-2013-0007: Microsoft XML Core...

9.3CVSS8.4AI score0.781EPSS
Exploits8
Packet Storm News
Packet Storm News
added 2025/06/16 12:0 a.m.7 views

Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications

Recent advancements in LLMs indicate potential for novel applications, e.g., through reasoning capabilities in the latest OpenAI and DeepSeek models. For applying these models in specific domains beyond text generation, LLM-based multi-agent approaches can be utilized that solve complex tasks by...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/03 12:0 a.m.3 views

VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents

Computer-Use Agents CUAs with full system access enable powerful task automation but pose significant security and privacy risks due to their ability to manipulate files, access user data, and execute arbitrary commands. While prior work has focused on browser-based agents and HTML-level attacks,...

7.8AI score
Exploits0
Positive Technologies
Positive Technologies
added 2025/01/01 12:0 a.m.1 views

PT-2026-4771

Name of the Vulnerable Software and Affected Versions eslint versions prior to 9.26.0 Description A stack overflow issue exists in eslint when serializing objects containing circular references within the eslint/lib/shared/serialization.js file. The issue is triggered through the RuleTester.run...

5.5CVSS5.4AI score0.0003EPSS
Exploits1References15
Tenable Nessus
Tenable Nessus
added 2024/06/07 12:0 a.m.45 views

Ollama < 0.1.34 Improper Input Validation

The version of Ollama installed on the remote host is prior to 0.1.34. It is, therefore, affected by an improper input validation vulnerability. Ollama before 0.1.34 does not validate the format of the digest sha256 with 64 hex digits when getting the model path, and thus mishandles the...

8.8CVSS7.2AI score0.93747EPSS
Exploits4References2
Veracode
Veracode
added 2024/06/05 6:7 a.m.32 views

Improper Input Validation

github.com/ollama/ollama is vulnerable to Improper Input Validation. The vulnerability is due to improper validation of the digest format sha256 with 64 hex digits when getting the model path, which results in the mishandling of the TestGetBlobsPath test cases with fewer than 64 hex digits, more...

8.8CVSS6.4AI score0.93747EPSS
Exploits4References6Affected Software1
Vulnrichment
Vulnrichment
added 2024/05/31 12:0 a.m.29 views

CVE-2024-37032

Ollama before 0.1.34 does not validate the format of the digest sha256 with 64 hex digits when getting the model path, and thus mishandles the TestGetBlobsPath test cases such as fewer than 64 hex digits, more than 64 hex digits, or an initial ../ substring...

7AI score0.93747EPSS
Exploits4References4
Rows per page
Query Builder