BlindGuard: Safeguarding LLM-Based Multi-Agent Systems under Unknown Attacks
The security of LLM-based multi-agent systems MAS is critically threatened by propagation vulnerability, where malicious agents can distort collective decision-making through inter-agent message interactions. While existing supervised defense methods demonstrate promising performance, they may be...