Beyond Classification: Evaluating LLMs for Fine-Grained Automatic Malware Behavior Auditing
Automated malware classification has achieved strong detection performance. Yet, malware behavior auditing seeks causal and verifiable explanations of malicious activities -- essential not only to reveal what malware does but also to substantiate such claims with evidence. This task is challengin...