3 matches found
MAD-Spear: a Conformity-Driven Prompt Injection Attack on Multi-Agent Debate Systems
Multi-agent debate MAD systems leverage collaborative interactions among large language models LLMs agents to improve reasoning capabilities. While recent studies have focused on increasing the accuracy and scalability of MAD systems, their security vulnerabilities have received limited attention...
PhishDebate: an LLM-Based Multi-Agent Framework for Phishing Website Detection
Phishing websites continue to pose a significant cybersecurity threat, often leveraging deceptive structures, brand impersonation, and social engineering tactics to evade detection. While recent advances in large language models LLMs have enabled improved phishing detection through contextual...
Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-Based Multi-Agent Debate
Multi-Agent Debate MAD, leveraging collaborative interactions among Large Language Models LLMs, aim to enhance reasoning capabilities in complex tasks. However, the security implications of their iterative dialogues and role-playing characteristics, particularly susceptibility to jailbreak attack...