Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025
|

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

  • Featured tools
Scalenut AI
Free

Scalenut AI is an all-in-one SEO content platform that combines AI-driven writing, keyword research, competitor insights, and optimization tools to help you plan, create, and rank content.

#
SEO
Learn more
Hostinger Website Builder
Paid

Hostinger Website Builder is a drag-and-drop website creator bundled with hosting and AI-powered tools, designed for businesses, blogs and small shops with minimal technical effort.It makes launching a site fast and affordable, with templates, responsive design and built-in hosting all in one.

#
Productivity
#
Startup Tools
#
Ecommerce
Learn more

Learn more about future of AI

Join 80,000+ Ai enthusiast getting weekly updates on exciting AI tools.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

Promote Your Tool

Copy Embed Code

Similar Blogs

March 2, 2026
|

Ideogram AI Boosts Visual Creativity, Revolutionizing Content Production

Ideogram AI leverages advanced generative algorithms to produce images from text prompts, offering customization, style transfer, and real-time iterative adjustments.
Read more
March 2, 2026
|

Pixelcut Rises as AI Photo Editing Powerhouse

Pixelcut, available via the Google Play Store, offers automated background removal, AI-generated product photography, image upscaling, and design templates tailored for social commerce.
Read more
March 2, 2026
|

Pony AI Hits Robotaxi Breakeven in Shenzhen

Pony.ai confirmed that its seventh-generation robotaxis reached UE (unit economics) breakeven in Shenzhen. The company attributed the milestone to improved hardware integration, lower sensor costs.
Read more
March 2, 2026
|

Scrutiny Grows Over Grok AI Amid Ethical Concerns

In commentary reported by AL.com, Gidley raised concerns regarding Grok AI’s responses and potential inconsistencies in politically sensitive contexts. The discussion centers on whether AI systems deployed on major digital platforms.
Read more
March 2, 2026
|

Investors Pivot as AI SaaS Hype Fades

A notable recalibration is unfolding in venture markets as investors signal waning appetite for hype-driven AI SaaS startups. Instead, capital is increasingly flowing toward companies demonstrating defensible technology.
Read more
March 2, 2026
|

Big Tech to Spend $655 Billion on AI

A sweeping capital surge is underway as the four largest U.S. technology companies prepare to spend a combined $655 billion on artificial intelligence infrastructure and development this year.
Read more