"This Eliezer fellow is the scariest person the internet has ever introduced me to. What could possibly have been at the tail end of that conversation? I simply can't imagine anyone being that convincing without being able to provide any tangible incentive to the human.
After all, if you already knew that argument, you'd have let that AI out the moment the experiment started. Or perhaps not do the experiment at all. But that seems like a case of the availability heuristic.
Oh, come on! Maybe the people he who played this game with Yudkowsky and lost colluded with him, or they were just thinking poorly. Why won't him release at least the logs of the games he lost? Clearly, whatever trick he allegedly used it didn't work those times.
Seriously, this AI-box game serves no other purpose than creating an aura of mysticism around the magical guru with alleged superpowers. It provides no evidence on the question of the feasibility of boxing an hostile intelligence, because the games are not repetable and it's not even possible to verify that they were played properly.
Nothing magical or superpower about it. Maybe Eliezer isn't so good that he can get himself out with just one line; certainly, one would think that forewarned is forearmed and that a simple resolve to type the necessary line is enough to win every single time and that it'd take several lines to sink a hook into someone so they'll continue the conversation.
On the other hand, people writing about psychopaths have, for at least as far back as Cleckley in the early 1900s, marveled at how psychopaths can manipulate trained experienced doctors and nurses who kno...
Update 2013-09-05.
I have since played two more AI box experiments after this one, winning both.
Update 2013-12-30:
I have lost two more AI box experiments, and won two more. Current Record is 3 Wins, 3 Losses.