Lucene search
K

10 matches found

Packet Storm News
Packet Storm News
added 2026/05/01 12:0 a.m.3 views

STARE: Step-Wise Temporal Alignment and Red-Teaming Engine for Multi-Modal Toxicity Attack

Red-teaming Vision-Language Models is essential for identifying vulnerabilities where adversarial image-text inputs trigger toxic outputs. Existing approaches treat image generation as a black box, returning only terminal toxicity scores and leaving open the question of when and how toxic semanti...

5.8AI score
Exploits0
Schneier on Security
Schneier on Security
added 2026/02/20 12:8 p.m.6 views

Ring Cancels Its Partnership with Flock

It's a demonstration of how toxic the surveillance-tech company Flock has become when Amazon's Ring cancels the partnership between the two companies. As Hamilton Nolan advises, remove your Ring doorbell...

5.5AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/07/08 12:0 a.m.2 views

TuneShield: Mitigating Toxicity in Conversational AI While Fine-Tuning on Untrusted Data

Recent advances in foundation models, such as LLMs, have revolutionized conversational AI. Chatbots are increasingly being developed by customizing LLMs on specific conversational datasets. However, mitigating toxicity during this customization, especially when dealing with untrusted training dat...

7.5AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/11 12:0 a.m.3 views

GenBreak: Red Teaming Text-To-Image Generators Using Large Language Models

Text-to-image T2I models such as Stable Diffusion have advanced rapidly and are now widely used in content creation. However, these models can be misused to generate harmful content, including nudity or violence, posing significant safety risks. While most platforms employ content moderation...

7.3AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/06/06 12:0 a.m.3 views

The Scales of Justitia: a Comprehensive Survey on Safety Evaluation of LLMs

With the rapid advancement of artificial intelligence technology, Large Language Models LLMs have demonstrated remarkable potential in the field of Natural Language Processing NLP, including areas such as content generation, human-computer interaction, machine translation, and code generation,...

7.5AI score
Exploits0
Packet Storm News
Packet Storm News
added 2025/05/23 12:0 a.m.3 views

Chain-Of-Lure: a Synthetic Narrative-Driven Approach to Compromise Large Language Models

In the era of rapid generative AI development, interactions between humans and large language models face significant misusing risks. Previous research has primarily focused on black-box scenarios using human-guided prompts and white-box scenarios leveraging gradient-based LLM generation methods,...

7.3AI score
Exploits0
Akamai Blog
Akamai Blog
added 2024/12/02 2:0 p.m.7 views

Battling the Emotional Toxicity Within Games: How to Digitally Thrive

...

7.3AI score
Exploits0
Akamai Blog
Akamai Blog
added 2024/12/02 2:0 p.m.1 views

Battling the Emotional Toxicity Within Games: How to Digitally Thrive

...

7.3AI score
Exploits0
Schneier on Security
Schneier on Security
added 2024/06/19 8:26 p.m.9 views

New Blog Moderation Policy

There has been a lot of toxicity in the comments section of this blog. Recently, were having to delete more and more comments. Not just spam and off-topic comments, but also sniping and personal attacks. Its gotten so bad that I need to do something. My options are limited because Im just one...

7.2AI score
Exploits0
Wired Threat Level
Wired Threat Level
added 2020/06/04 3:58 p.m.27 views

Games Don't Do Enough to Combat Toxicity at Launch

Riot Games has cutting-edge moderation tools at its disposal. Few of them are present in Valorant, which launched this week...

1.9AI score
Exploits0
Rows per page
Query Builder