OpenAI’s New Enterprise Security Mode Locks Down ChatGPT Against Prompt Injection

OpenAI’s New Enterprise Security Mode Locks Down ChatGPT Against Prompt Injection
好的，我现在需要帮用户总结一篇文章的内容，控制在100字以内。首先，我得仔细阅读这篇文章，了解其主要内容和重点。文章主要讲的是OpenAI推出了两个新的安全功能，Lockdown Mode和Elevated Risk标签，目的是防范提示注入攻击。提示注入攻击是指攻击者通过恶意指令操控AI系统，窃取敏感数据。这两个新功能通过基础设施级别的限制来防止数据外泄，而不仅仅是依赖模型训练。 Lockdown Mode是一个可选的安全设置，针对高风险用户，如企业高管和安全团队。它限制了ChatGPT与外部系统的互动，只允许访问缓存内容，禁止实时网络请求。此外，还禁用了无法保证数据安全的功能，如生成图片、Deep Research等。 Elevated Risk标签则提供透明度，提醒用户某些功能可能存在未解决的安全漏洞。当用户启用网络相关功能时，标签会显示潜在威胁，并建议何时使用这些功能合适。文章还提到这些安全措施是现有保护机制的补充，包括沙盒、URL数据外泄防护等。OpenAI认为单一的安全控制点不够，因此采用了多层防御策略。总结起来，文章的核心是OpenAI推出的新安全措施如何防范提示注入攻击，并通过基础设施级别的限制来增强安全性。我需要将这些关键点浓缩到100字以内。接下来，我要确保语言简洁明了，不使用复杂的术语。同时要注意不要遗漏主要信息：Lockdown Mode、Elevated Risk标签、防范提示注入攻击的方法以及它们的作用。最后检查字数是否符合要求，并确保内容准确传达文章的核心信息。 </think> OpenAI推出Lockdown Mode和Elevated Risk标签两项安全功能,防范提示注入攻击。Lockdown Mode限制ChatGPT与外部系统交互,仅允许访问缓存内容,禁用高风险功能;Elevated Risk标签提醒用户启用网络功能可能带来的安全威胁。这些措施通过基础设施级别控制,物理阻止数据外泄,增强AI系统安全性。 2026-2-18 10:5:6 Author: thecyberexpress.com(查看原文) 阅读量:13 收藏

OpenAI deployed two security features targeting prompt injection attacks that exploit AI systems’ growing connectivity to external networks and applications. Lockdown Mode and Elevated Risk labels, announced last week, represent a shift from relying solely on model training to implementing deterministic infrastructure controls that physically prevent data exfiltration regardless of prompt manipulation.

What You Need to Know About the Lockdown Mode

Lockdown Mode is an optional security setting designed for high-risk users including executives and security teams at prominent organizations who require protection against advanced threats. The feature tightly constrains how ChatGPT interacts with external systems through deterministic restrictions that eliminate attack surfaces prompt injection exploits.

The mode’s core protection mechanism limits web browsing to cached content only. No live network requests leave OpenAI’s controlled network, preventing attackers from tricking ChatGPT into sending sensitive conversation data to external servers. This addresses scenarios where malicious websites contain hidden instructions designed to manipulate AI into exfiltrating confidential information through browsing activity.

Additional restrictions disable capabilities that cannot provide “strong deterministic guarantees of data safety.” ChatGPT responses cannot include images, Deep Research and Agent Mode are disabled, users cannot approve Canvas-generated code for network access, and the system cannot download files for data analysis, though manually uploaded files remain usable.

Workspace administrators on ChatGPT Enterprise, Edu, Healthcare and Teachers plans activate Lockdown Mode by creating specialized roles through Workspace Settings. Admins retain granular control over which apps and specific actions remain available even when Lockdown Mode is engaged. The Compliance API Logs Platform provides visibility into app usage, shared data and connected sources.

OpenAI said Lockdown Mode is not necessary for most users. The feature targets a small subset of highly security-conscious individuals who face elevated targeting risk and handle exceptionally sensitive organizational data. The company plans consumer availability in coming months following the enterprise rollout.

Also read: OpenAI Launches Trusted Access for Cyber to Expand AI-Driven Defense While Managing Risk

All About Elevated Risk Labels

Complementing Lockdown Mode’s preventative approach, Elevated Risk labels provide transparency about features introducing unresolved security vulnerabilities. The standardized labeling system appears across ChatGPT, ChatGPT Atlas and Codex whenever users enable network-related capabilities that may increase exposure.

In Codex, OpenAI’s coding assistant, developers can grant network access so the system can look up documentation or interact with websites. The settings screen now displays an Elevated Risk label explaining what changes when network access is enabled, what threats it introduces and when that access is appropriate. The labels represent educational signals rather than prohibitions, empowering users to make informed decisions about risk acceptance.

OpenAI stated it will remove Elevated Risk labels as security advances mitigate identified threats, and will continue updating which features carry labels to best communicate risk. The dynamic labeling approach acknowledges that some network-related capabilities introduce risks current industry mitigations do not fully address.

The security enhancements build on existing protections including sandboxing, protections against URL-based data exfiltration, monitoring and enforcement systems, and enterprise controls like role-based access and audit logs. The layered approach reflects recognition that as AI systems become more capable and connected, single-point security controls prove insufficient.

An Effort to Negate Prompt Injection Attacks

Prompt injection attacks manipulate AI systems by embedding malicious instructions in external content that conversational models process. When ChatGPT accesses web pages, reads documents or interacts with third-party applications, attackers can hide commands within that content designed to override the system’s intended behavior. Successful attacks extract conversation history, connected app data or sensitive organizational information without user awareness.

The vulnerability stems from language models’ inability to reliably distinguish between legitimate instructions from system prompts and malicious instructions embedded in user-supplied or externally sourced content. Traditional security measures focusing on input validation and output filtering have proven inadequate because sophisticated prompt injection techniques can bypass content-level filters.

OpenAI’s infrastructure-level restrictions in Lockdown Mode sidestep this challenge by physically preventing the actions attackers attempt to trigger. Rather than trusting the model to refuse malicious requests, the system architecture makes those requests impossible to execute regardless of prompt manipulation sophistication.

文章来源: https://thecyberexpress.com/openai-new-lockdown-mode/
如有侵权请联系:admin#unsafe.sh