Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025

|

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

Featured tools

Upscayl AI

Free

Upscayl AI is a free, open-source AI-powered tool that enhances and upscales images to higher resolutions. It transforms blurry or low-quality visuals into sharp, detailed versions with ease.

#

Productivity

Learn more

Symphony Ayasdi AI

Free

SymphonyAI Sensa is an AI-powered surveillance and financial crime detection platform that surfaces hidden risk behavior through explainable, AI-driven analytics.

#

Finance

Learn more

Learn more about future of AI

Join 80,000+ Ai enthusiast getting weekly updates on exciting AI tools.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Promote Your Tool

Copy Embed Code

Similar Blogs

May 14, 2026

|

ChatGPT Outage Highlights AI Infrastructure Reliance

Users of ChatGPT experienced temporary errors indicating content loading failures, interrupting access to AI-generated responses.

May 14, 2026

|

Chrome On Device AI Raises Gemini Storage Risk

Reports indicate that Google Chrome may install or cache a sizable AI model, Gemini Nano, on user devices to enable faster on-device AI processing.

May 14, 2026

|

Amazon AI Commerce Shift Alexa Rufus Role

Amazon’s decision to position Alexa as the primary AI shopping interface marks a strategic restructuring of its retail AI stack.

May 14, 2026

|

Google Books Gains AI Magic Pointer

Google’s new Magic Pointer functionality allows users to interact with digital books in a more dynamic way, enabling contextual insights, instant explanations, and intelligent navigation across text.

May 14, 2026

|

Microsoft Edge Gains AI Cross-Tab Intelligence

Microsoft’s updated Edge Copilot now enables users to leverage AI that can extract, compare, and summarize content across multiple open tabs simultaneously.

May 14, 2026

|

Robotics Market Expands with Unitree Mecha Launch

Unitree’s latest offering is a large-scale, transformable robotic “mecha” system priced at approximately $650,000, positioning it in the ultra-premium robotics segment.

View Blogs