Lucene search
K

6 matches found

Packet Storm News
Packet Storm News
added 2025/08/05 12:0 a.m.7 views

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

As large language models become increasingly integrated into daily life, audio has emerged as a key interface for human-AI interaction. However, this convenience also introduces new vulnerabilities, making audio a potential attack surface for adversaries. Our research introduces WhisperInject, a...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/05/28 12:0 a.m.2 views

Private Rate-Constrained Optimization with Applications to Fair Learning

Many problems in trustworthy ML can be formulated as minimization of the model error under constraints on the prediction rates of the model for suitably-chosen marginals, including most group fairness constraints demographic parity, equality of odds, etc.. In this work, we study such constrained...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/05/18 12:0 a.m.2 views

Private Statistical Estimation Via Truncation

We introduce a novel framework for differentially private DP statistical estimation via data truncation, addressing a key challenge in DP estimation when the data support is unbounded. Traditional approaches rely on problem-specific sensitivity analysis, limiting their applicability. By leveragin...

6.9AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/05/14 12:0 a.m.2 views

Adversarial Attack on Large Language Models Using Exponentiated Gradient Descent

As Large Language Models LLMs are widely used, understanding them systematically is key to improving their safety and realizing their full potential. Although many models are aligned using techniques such as reinforcement learning from human feedback RLHF, they are still vulnerable to jailbreakin...

7.4AI score
Exploits0
Schneier on Security
Schneier on Security
added 2022/05/25 3:30 p.m.16 views

Manipulating Machine-Learning Systems through the Order of the Training Data

Yet another adversarial ML attack: Most deep neural networks are trained by stochastic gradient descent. Now “stochastic” is a fancy Greek word for “random”; it means that the training data are fed into the model in random order. So what happens if the bad guys can cause the order to be not rando...

1.1AI score
Exploits0
CERT
CERT
added 2020/03/19 12:0 a.m.66 views

Machine learning classifiers trained via gradient descent are vulnerable to arbitrary misclassification attack

Overview Machine learning models trained using gradient descent can be forced to make arbitrary misclassifications by an attacker that can influence the items to be classified. The impact of a misclassification varies widely depending on the ML model's purpose and of what systems it is a part...

6.6AI score
Exploits0References11
Rows per page
Query Builder