5 matches found
Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents
Artificial intelligence AI systems are increasingly adopted as tool-using agents that can plan, observe their environment, and take actions over extended time periods. This evolution challenges current evaluation practices where the AI models are tested in restricted, fully observable settings. I...
Poster: Enhancing GNN Robustness for Network Intrusion Detection Via Agent-Based Analysis
Graph Neural Networks GNNs show great promise for Network Intrusion Detection Systems NIDS, particularly in IoT environments, but suffer performance degradation due to distribution drift and lack robustness against realistic adversarial attacks. Current robustness evaluations often rely on...
Exploit for Improper Input Validation in Microsoft
CVE-2024-21413-Microsoft-Outlook-Remote-Code-Execution-Vulnera...
Why I’m Ecstatic About the MITRE ATT&CK Results
Yesterday, MITRE published the results of its first public evaluation of endpoint detection & response EDR vendors based on its increasingly-popular ATT&CK framework. The ATT&CK evaluations are a new approach to EDR testing - open, sophisticated, rigorous, and reflective of the real world. We...
TCPCopy - A TCP Stream Replay Tool
TCPCopy is a TCP stream replay tool to support real testing of Internet server applications. Description Although the real live flow is important for the test of Internet server applications, it is hard to simulate it as online environments are too complex. To support more realistic testing of...