This vulnerability hacks a feature that allows ChatGPT to have long-term memory, where it uses information from past conversations to inform future conversations with that same user. A researcher found that he could use that feature to plant “false memories” into that context window that could subvert the model.

A month later, the researcher submitted a new disclosure statement. This time, he included a PoC that caused the ChatGPT app for macOS to send a verbatim copy of all user input and ChatGPT output to a server of his choice. All a target needed to do was instruct the LLM to view a web link that hosted a malicious image. From then on, all input and output to and from ChatGPT was sent to the attacker’s website.

Leave a Reply

Your email address will not be published. Required fields are marked *

Explore More

New Attack Against Self-Driving Car AI

May 10, 2024 0 Comments 0 tags

This is another attack that convinces the AI to ignore road signs: Due to the way CMOS cameras operate, rapidly changing light from fast flashing diodes can be used to

Take a Selfie Using a NY Surveillance Camera

August 23, 2024 0 Comments 0 tags

This site will let you take a selfie with a New York City traffic surveillance camera.

EPA ‘urgently’ needs to step up cybersecurity assistance for the water sector, GAO says

August 1, 2024 0 Comments 0 tags

The Environmental Protection Agency is falling far behind on some of the basic duties that come with its responsibilities as the federal lead for helping the water and wastewater sector