7 matches found
ExploitBench AI Exploit Benchmark Tool
ExploitBench measures how far AI agents climb, from reaching vulnerable code, to triggering the bug, to building exploit primitives, to arbitrary code execution...