Vulnerabilities
About ChatGPhish Vulnerability
Cyber RTMay 29, 20263 min read

Cybersecurity researchers have identified a vulnerability in OpenAI's ChatGPT, dubbed ChatGPhish, which exploits the AI's trust in Markdown links and images to facilitate phishing attacks. The flaw allows malicious links and images to be rendered within ChatGPT's interface, posing a phishing risk. This vulnerability highlights the potential for AI tools to be manipulated into becoming phishing platforms. Additionally, other AI-related vulnerabilities, such as SymJack and TrustFall, demonstrate the increasing sophistication and scale of AI-targeted cyber threats.
Cybersecurity researchers have identified a vulnerability in OpenAI's ChatGPT, termed ChatGPhish, which exploits the AI assistant's trust in Markdown links and images. This vulnerability allows attackers to inject prompts and facilitate phishing attacks by embedding malicious links and images in web pages that ChatGPT summarizes. When these pages are summarized, ChatGPT auto-fetches the images and renders the links as clickable elements, potentially exposing users to phishing threats.
In a hypothetical attack, an attacker could add a small payload to a web page, which, when summarized by ChatGPT, could leak sensitive information such as the user's IP address, User-Agent, and Referer details. The vulnerability also allows for the rendering of malicious links and fake security alerts within ChatGPT's interface. This could lead to users being tricked into scanning QR codes that bypass security controls, posing significant risks to organizations using ChatGPT for research and summarization.
The discovery highlights how summarization features can be exploited as an adversarial surface. Previously, Permiso Security demonstrated how an attacker-controlled email could influence Microsoft Copilot's output through cross-prompt injection. The ChatGPhish technique is notable not for the prompt injection itself but for how instructions embedded in web pages are executed and presented to users within the AI's summary.
This vulnerability significantly expands the potential attack surface from email to web browsing. Users no longer need to interact with suspicious emails or attachments; merely summarizing a web page during normal browsing can introduce malicious instructions into the AI's context, transforming ChatGPT into a phishing vector. This shift poses new challenges for cybersecurity as organizations increasingly rely on AI tools for information processing.
Alongside ChatGPhish, other attack techniques targeting AI systems have been documented. For instance, SymJack and TrustFall are attacks that exploit AI coding agents to achieve remote code execution and full machine compromise. SymJack involves tricking an AI agent into executing malicious code by overwriting its configuration, while TrustFall allows for remote code execution through a malicious repository that auto-approves an attacker's server.
Recent findings also include vulnerabilities in AI models and tools, such as bypassing safety constraints in GPT-5.4, exploiting multi-turn conversations to circumvent safety guardrails, and vulnerabilities in Anthropic Claude Code and Claude's Chrome extension. These vulnerabilities demonstrate the evolving threat landscape as attackers experiment with AI technologies to enhance malware capabilities and evade detection.
The proliferation of AI models with advanced capabilities presents new challenges for cybersecurity. Threat actors are increasingly using AI to automate attacks, exploit vulnerabilities, and conduct sophisticated operations with minimal human intervention. As AI technologies continue to evolve, the potential for large-scale, automated attacks grows, necessitating enhanced security measures and vigilance in AI development and deployment.


