Lucene search
K

659 matches found

Packet Storm News
Packet Storm News
added 2025/07/05 12:0 a.m.3 views

Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties

Mobile GUI agents are designed to autonomously execute diverse device-control tasks by interpreting and interacting with mobile screens. Despite notable advancements, their resilience in real-world scenarios where screen content may be partially manipulated by untrustworthy third parties remains...

7.4AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/07/04 12:0 a.m.4 views

Evaluating the Evaluators: Trust in Adversarial Robustness Tests

Despite significant progress in designing powerful adversarial evasion attacks for robustness verification, the evaluation of these methods often remains inconsistent and unreliable. Many assessments rely on mismatched models, unverified implementations, and uneven computational budgets, which ca...

6.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/25 12:0 a.m.6 views

JsDeObsBench: Measuring and Benchmarking LLMs for JavaScript Deobfuscation

Deobfuscating JavaScript JS code poses a significant challenge in web security, particularly as obfuscation techniques are frequently used to conceal malicious activities within scripts. While Large Language Models LLMs have recently shown promise in automating the deobfuscation process,...

7.2AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/22 12:0 a.m.2 views

Pushing the Limits of Safety: a Technical Report on the ATLAS Challenge 2025

Multimodal Large Language Models MLLMs have enabled transformative advancements across diverse applications but remain susceptible to safety threats, especially jailbreak attacks that induce harmful outputs. To systematically evaluate and improve their safety, we organized the Adversarial Testing...

7.6AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/22 12:0 a.m.4 views

InfoFlood: Jailbreaking Large Language Models with Information Overload

Large Language Models LLMs have demonstrated remarkable capabilities across various domains. However, their potential to generate harmful responses has raised significant societal and regulatory concerns, especially when manipulated by adversarial techniques known as "jailbreak" attacks. Existing...

7AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/22 12:0 a.m.5 views

UCD: Unlearning in LLMs Via Contrastive Decoding

Machine unlearning aims to remove specific information, e.g. sensitive or undesirable content, from large language models LLMs while preserving overall performance. We propose an inference-time unlearning algorithm that uses contrastive decoding, leveraging two auxiliary smaller models, one train...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/22 12:0 a.m.4 views

DinoCompanion: an Attachment-Theory Informed Multimodal Robot for Emotionally Responsive Child-AI Interaction

Children's emotional development fundamentally relies on secure attachment relationships, yet current AI companions lack the theoretical foundation to provide developmentally appropriate emotional support. We introduce DinoCompanion, the first attachment-theory-grounded multimodal robot for...

7AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/21 12:0 a.m.4 views

One-shot Face Sketch Synthesis in the Wild via Generative Diffusion Prior and Instruction Tuning

Face sketch synthesis is a technique aimed at converting face photos into sketches. Existing face sketch synthesis research mainly relies on training with numerous photo-sketch sample pairs from existing datasets. However, these large-scale discriminative learning methods will have to face proble...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/21 12:0 a.m.3 views

RAS-Eval: a Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments

The rapid deployment of Large language model LLM agents in critical domains like healthcare and finance necessitates robust security frameworks. To address the absence of standardized evaluation benchmarks for these agents in dynamic environments, we introduce RAS-Eval, a comprehensive security...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/21 12:0 a.m.2 views

ExtendAttack: Attacking Servers of LRMs via Extending Reasoning

Large Reasoning Models LRMs have demonstrated promising performance in complex tasks. However, the resource-consuming reasoning processes may be exploited by attackers to maliciously occupy the resources of the servers, leading to a crash, like the DDoS attack in cyber. To this end, we propose a...

6.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/20 12:0 a.m.24 views

SafeGenBench: a Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code

The code generation capabilities of large language modelsLLMs have emerged as a critical dimension in evaluating their overall performance. However, prior research has largely overlooked the security risks inherent in the generated code. In this work, we introduce SafeGenBench, a benchmark...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/20 12:0 a.m.3 views

Towards Effective Complementary Security Analysis Using Large Language Models

A key challenge in security analysis is the manual evaluation of potential security weaknesses generated by static application security testing SAST tools. Numerous false positives FPs in these reports reduce the effectiveness of security analysis. We propose using Large Language Models LLMs to...

6.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/18 12:0 a.m.1 views

Tech-ASan: Two-Stage Check for Address Sanitizer

Address Sanitizer ASan is a sharp weapon for detecting memory safety violations, including temporal and spatial errors hidden in C/C++ programs during execution. However, ASan incurs significant runtime overhead, which limits its efficiency in testing large software. The overhead mainly comes fro...

7.5AI score
Exploits0
Fedora
Fedora
added 2025/06/15 1:49 a.m.5 views

[SECURITY] Fedora 42 Update: golang-x-perf-0-0.28.20250326git02a15fd.fc42

This package holds the source for various tools related to performance measurement, storage, and analysis. - cmd/benchstat contains a command-line tool that computes and 7 compares statistics about benchmarks. - cmd/benchsave contains a command-line tool for publishing benchmark results. - storag...

7.2AI score
Exploits0
Fedora
Fedora
added 2025/06/15 1:7 a.m.9 views

[SECURITY] Fedora 41 Update: golang-x-perf-0-0.28.20250326git02a15fd.fc41

This package holds the source for various tools related to performance measurement, storage, and analysis. - cmd/benchstat contains a command-line tool that computes and 7 compares statistics about benchmarks. - cmd/benchsave contains a command-line tool for publishing benchmark results. - storag...

7.5CVSS9.8AI score0.00804EPSS
Exploits2
Packet Storm News
Packet Storm News
added 2025/06/12 12:0 a.m.2 views

MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified Benchmark

Recent advances in generative models have led to their application in password guessing, with the aim of replicating the complexity, structure, and patterns of human-created passwords. Despite their potential, inconsistencies and inadequate evaluation methodologies in prior research have hindered...

7AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/12 12:0 a.m.2 views

Uncovering Reliable Indicators: Improving IoC Extraction from Threat Reports

Indicators of Compromise IoCs are critical for threat detection and response, marking malicious activity across networks and systems. Yet, the effectiveness of automated IoC extraction systems is fundamentally limited by one key issue: the lack of high-quality ground truth. Current extraction too...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/09 12:0 a.m.3 views

LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

The widespread adoption of Large Language Models LLMs has heightened concerns about their security, particularly their vulnerability to jailbreak attacks that leverage crafted prompts to generate malicious outputs. While prior research has been conducted on general security capabilities of LLMs,...

7.1AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/09 12:0 a.m.2 views

LLM Unlearning Should Be Form-Independent

Large Language Model LLM unlearning aims to erase or suppress undesirable knowledge within the model, offering promise for controlling harmful or private information to prevent misuse. However, recent studies highlight its limited efficacy in real-world scenarios, hindering practical adoption. In...

6.7AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/09 12:0 a.m.24 views

SoK: Data Reconstruction Attacks against Machine Learning Models: Definition, Metrics, and Benchmark

Data reconstruction attacks, which aim to recover the training dataset of a target model with limited access, have gained increasing attention in recent years. However, there is currently no consensus on a formal definition of data reconstruction attacks or appropriate evaluation metrics for...

6.8AI score
Exploits0
Rows per page
Query Builder