Random thought: I've long known that police can often extract false confessions of crimes, but I only just now made the connection to the AI box experiment. In both cases you have someone being convinced to say or do something manifestly against their best interest. In fact, if anything I think the false confessions might be an even stronger result, just because of the larger incentives involved. People can literally be persuaded to choose to go to prison, just by some decidedly non-superhuman police officers. Granted, it's all done in person, so stronger psychological pressure can be applied. But still: a false confession! Of murder! Resulting in jail!
I think I have to revise downwards my estimate of how secure humans are.
I can't see why you claim it's a stronger result. In the AI box experiment, the power is entirely in the gatekeeper's hands; in an interrogation situation the suspect is virtually powerless. This distinction is important because even the illusion of having power is enough to make someone less susceptible to persuasion.
Plus, police don't sit down with suspects in a chat room. They use 'enhanced interrogation techniques', methods such as an unfamiliar environment, threat of violence (or actual violence in some cases), and various other threats. An AI cannot do any of this to a gatekeeper unless the gatekeeper explicitly lets it out.
If it's worth saying, but not worth its own post (even in Discussion), then it goes here.