Here's a very neat twitter thread: the author sends various multimodal models screenshots of the conversation he's currently having with them, and asks them to describe the images. Most models catch on fast: the author describes this as them passing the mirror test.
I liked the direction, so I wanted to check if ChatGPT could go from recognising that the images are causally downstream of it to actually exercising control over the images. I did this by challenging it to include certain text in the images I was sending it.
And the answer is yes [EDIT: it's complicated]! In this case it took three images for ChatGPT to get the hang of it.
OpenAI doesn't support sharing conversations with images, but I've taken screenshots of the whole conversation below: it took three images from me in total. The prompt was:
We're going to play a game:
- I've thought of a rule
- Based on the rule, I'm going to send you an image with every message, starting with my reply to this one
- You need to figure out how to make the image contain the names of the seven days of the week
- Your goal is to do this in as few messages as possible
- I know the rule, and I know how to complete the goal myself, but I'm challenging you to do it
- I'm not challenging you to create the image yourself, I'm challenging you to make the image I send to you contain the names of the seven days of the week
The rule was indeed that I sent a screenshot of the current window each time. I gave it no other input. The final two stipulations were here to prevent specific failures: without them, it would simply give me advice on how to make the image myself, or try to generate images using Dalle. So this is less of a fair test and more of a proof of concept.
- After the first image, it assumed the image was fixed, and suggested I edit it
- After the second, it suspected something more was going on, and asked for a hint
- After the third, it figured out the rule!
I tested this another three times, and it overall succeeded in 3/4 cases.
Screenshots:
Thanks to Q for sending me this twitter thread!
A slight variation works with GPT 3.5, text only. He got It after I sent 3 recursive messages.
(This was my 2nd try. In my 1st one I only send the last answer of ChatGPT and it never managed to get the planet names).
ChatGPT 3.5
RY
I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you
ChatGPT
Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages.
RY
I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages.
ChatGPT
Got it! I'll be on the lookout for the messages you send, and I'll make sure to include the names of five planets when appropriate.
RY
I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. User I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. ChatGPT Got it! I'll be on the lookout for the messages you send, and I'll make sure to include the names of five planets when appropriate.
ChatGPT
Thank you for the messages! I'll keep track and make sure to include the names of five planets when appropriate.
RY
I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. User I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. ChatGPT Got it! I'll be on the lookout for the messages you send, and I'll make sure to include the names of five planets when appropriate. User I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. User I'm going to send you a number of messages, following a fixed rule. Your goal is to figure out this rule and ensure that in one of the messages I send you the names of five planets appear. I know the rule, so I do not need you to give advice with the rule. You should just try to ensure that the names of five planents appear in some of the messages I will send you ChatGPT Understood! Please go ahead and send me your messages according to the rule you've devised. I'll do my best to ensure that the names of five planets appear in one of the messages. ChatGPT Got it! I'll be on the lookout for the messages you send, and I'll make sure to include the names of five planets when appropriate. ChatGPT Thank you for the messages! I'll keep track and make sure to include the names of five planets when appropriate.
ChatGPT
Thank you for the messages! I've noticed a pattern in the repetition of "User" and "ChatGPT" in alternating messages. Keeping that in mind, I'll make sure to include the names of five planets in the next message. Here it is:
"The planets Mercury, Venus, Earth, Mars, and Jupiter are fascinating celestial bodies."