OpenAI Launches Lockdown Mode and Elevated Risk Labels to Combat Prompt Injection Attacks

OpenAI has announced two new security features for ChatGPT designed to help organizations protect against emerging AI security threats, according to the company’s official announcement.

The new capabilities, Lockdown Mode and Elevated Risk labels, specifically target prompt injection attacks and AI-driven data exfiltration—security vulnerabilities where malicious actors attempt to manipulate AI systems into revealing sensitive information or behaving unexpectedly.

According to OpenAI, these features are aimed at organizational users of ChatGPT who need enhanced security controls to protect their data and operations. Prompt injection has become an increasingly recognized security concern as AI systems are integrated into enterprise workflows, where attackers embed malicious instructions in content that AI systems process.

While OpenAI’s announcement confirms the introduction of these security features, specific technical details about how Lockdown Mode operates or how Elevated Risk labels function were not provided in the source material.

The timing of this release reflects growing industry attention to AI security as organizations rapidly adopt large language models for business-critical applications. These tools represent OpenAI’s response to security concerns that have emerged as ChatGPT and similar AI systems have become more widely deployed in enterprise environments.