5 matches found
How Dangerous Is Anthropic’s Mythos AI?
Last month, Anthropic made a remarkable announcement about its new model, Claude Mythos Preview: it was so good at finding security vulnerabilities in software that the company would not release it to the general public. Instead, it would only be available to a select group of companies to scan a...
PT-2026-31562
Open AI models detect vulnerabilities by analyzing code, configs, and system behavior step-by-step—like spotting the FreeBSD zero-day CVE-2024-47467 in the Mythos showcase. Even a 3B open model nailed it, matching bigger ones. The chart shows rankings flip across tasks: OWASP easy stuff vs. gnarl...
Before You Hand over the Wheel: Evaluating LLMs for Security Incident Analysis
Security incident analysis SIA poses a major challenge for security operations centers, which must manage overwhelming alert volumes, large and diverse data sources, complex toolchains, and limited analyst expertise. These difficulties intensify because incidents evolve dynamically and require...
Predicting Tail-Risk Escalation in IDS Alert Time Series
Network defenders face a steady stream of attacks, observed as raw Intrusion Detection System IDS alerts. The sheer volume of alerts demands prioritization, typically based on high-level risk classifications. This work expands the scope of risk measurement by examining alerts not only through the...
Dark LLMs: the Growing Threat of Unaligned AI Models
Large Language Models LLMs rapidly reshape modern life, advancing fields from healthcare to education and beyond. However, alongside their remarkable capabilities lies a significant threat: the susceptibility of these models to jailbreaking. The fundamental vulnerability of LLMs to jailbreak...