3 matches found
Benchmarking-Agent-Architectures
Benchmarking Agent Architectures for LLM-Based Exploit Gener...
AI in Cybersecurity Education -- Scalable Agentic CTF Design Principles and Educational Outcomes
Large language models are rapidly changing how learners acquire and demonstrate cybersecurity skills. However, when human--AI collaboration is allowed, educators still lack validated competition designs and evaluation practices that remain fair and evidence-based. This paper presents a...
Recursive Language Models for Jailbreak Detection: A Procedural Defense for Tool-Augmented Agents
Jailbreak prompts are a practical and evolving threat to large language models LLMs, particularly in agentic systems that execute tools over untrusted content. Many attacks exploit long-context hiding, semantic camouflage, and lightweight obfuscations that can evade single-pass guardrails. We...