Lucene search
K

5 matches found

Packet Storm News
Packet Storm News
added 2026/05/10 12:0 a.m.33 views

AgentShield: Deception-Based Compromise Detection for Tool-Using LLM Agents

Defenses against indirect prompt injection IPI in tool-using LLM agents share two structural weaknesses. First, they all attempt to prevent attacks rather than detect the compromises that slip through. Second, they have only been evaluated in English, leaving users of low-resource languages such ...

5.8AI score
Exploits0
Microsoft Secure
Microsoft Secure
added 2026/01/21 2:33 p.m.6 views

A new era of agents, a new era of posture

The rise of AI Agents marks one of the most exciting shifts in technology today. Unlike traditional applications or cloud resources, these agents are not passive components- they reason, make decisions, invoke tools, and interact with other agents and systems on behalf of users. This autonomy...

5.8AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/11 12:0 a.m.3 views

Prompt Attacks Reveal Superficial Knowledge Removal in Unlearning Methods

In this work, we show that some machine unlearning methods may fail when subjected to straightforward prompt attacks. We systematically evaluate eight unlearning techniques across three model families, and employ output-based, logit-based, and probe analysis to determine to what extent supposedly...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/04/16 12:0 a.m.9 views

InjectLab: a Tactical Framework for Adversarial Threat Modeling against Large Language Models

Large Language Models LLMs are changing the way people interact with technology. Tools like ChatGPT and Claude AI are now common in business, research, and everyday life. But with that growth comes new risks, especially prompt-based attacks that exploit how these models process language. InjectLa...

7AI score
Exploits0
Trend Micro Simply Security
Trend Micro Simply Security
added 2025/03/04 12:0 a.m.10 views

Exploiting DeepSeek-R1: Breaking Down Chain of Thought Security

This entry explores how the Chain of Thought reasoning in the DeepSeek-R1 AI model can be susceptible to prompt attacks, insecure output generation, and sensitive data theft...

7.1AI score
Exploits0
Rows per page
Query Builder