365 matches found
A New Framework for Cybersecurity Refusals in AI Agents
Agentic scaffolds have dramatically improved LLM performance on complex, long-horizon tasks, yielding both broad benefits and amplified risks in domains like cybersecurity. Existing benchmarks for AI agents in cybersecurity focus mainly on measuring proficiency--how effectively agents can complet...
offensive-claude
Offensive Security Research Config for Claude Code A comprehe...
portofolio_DWForSec
DwF — Cybersecurity Portfolio A professional cybersecurity po...
web-app-pentest-playbook
Web Application Pentest Playbook A structured methodology and...
01-Pentesting-and-Offensive-Security
No d...
Introducing Penetration Test Findings: Unified Offensive Security in Wiz
Streamline pen-testing by unifying findings from bug bounties, manual audits, and Wiz Red Agent into a single, context-rich view...
Automation-Exploit: A Multi-Agent LLM Framework for Adaptive Offensive Security with Digital Twin-Based Risk-Mitigated Exploitation
The offensive security landscape is highly fragmented: enterprise platforms avoid memory-corruption vulnerabilities due to Denial of Service DoS risks, Automatic Exploit Generation AEG systems suffer from semantic blindness, and Large Language Model LLM agents face safety alignment filters and...
vantix
Vantix Vantix is a Codex-native offensive-security control pl...
Decepticon
⚡ Decepticon — Autonomous Multi-Agent Offensive Security !L...
Wa3r-OffSec-Kit
🔐 Wa3r-OffSec-Kit - Practical Security Tools and Notes !Dow...
PT-2026-25586
Summary Prior to Memray 1.19.2, Memray rendered the command line of the tracked process directly into generated HTML reports without escaping. Because there was no escaping, attacker-controlled command line arguments were inserted as raw HTML into the generated report. This allowed JavaScript...
Offensive-Security-KnowledgeBase
Offensive-Security-KnowledgeBase Str...
Cyber-Guide
!Typing SVGhttps://readme-typing-svg.herokuapp.com?font=Fira...
From arcades to Azure: Felix’s security research journey
When you talk with Felix, you quickly get the sense that he has always been propelled by curiosity and by a need for something that truly challenges him. Today, he is a successful independent security researcher who uncovers vulnerabilities across Microsoft cloud services. However, his path into...
KOREX
⚡ KOREX FRAMEWORK ⚡ Advanced Security Operations Framework "Kore...
Introducing AI Cyber Model Arena: A Real-World Benchmark for AI Agents in Cybersecurity
Wiz Research’s AI Cyber Model Arena benchmarks offensive AI security on 257 real-world challenges zero-days, CVEs, API/web, and cloud across AWS/Azure/GCP/K8s demonstrating what AI models and agents can really do...
CyberExplorer: Benchmarking LLM Offensive Security Capabilities in a Real-World Attacking Simulation Environment
Real-world offensive security operations are inherently open-ended: attackers explore unknown attack surfaces, revise hypotheses under uncertainty, and operate without guaranteed success. Existing LLM-based offensive agent evaluations rely on closed-world settings with predefined goals and binary...
Top AI Tools for Red Teaming in 2026
Red teaming has undergone a radical evolution. Modern organizations can no longer rely solely on human creativity or…...
Ofensive-security-Portfolio
This repository contains my Offensive Cyber Security / Penetrati...
To Defend against Cyber Attacks, We Must Teach AI Agents to Hack
For over a decade, cybersecurity has relied on human labor scarcity to limit attackers to high-value targets manually or generic automated attacks at scale. Building sophisticated exploits requires deep expertise and manual effort, leading defenders to assume adversaries cannot afford tailored...