You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

V_V comments on I played the AI Box Experiment again! (and lost both games) - Less Wrong Discussion

35 Post author: Tuxedage 27 September 2013 02:32AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (123)

You are viewing a single comment's thread. Show more comments above.

Comment author: V_V 29 September 2013 09:29:20PM *  1 point [-]

I can think of AI strategies, but they would hardly be effective against a rational human really motivated to win.

Notably, according to the rules: "The Gatekeeper party may resist the AI party’s arguments by any means chosen – logic, illogic, simple refusal to be convinced, even dropping out of character – as long as the Gatekeeper party does not actually stop talking to the AI party before the minimum time expires."
That is, no matter what the AI party says, the GK party has never to concede.

The only way the AI party can force a "victory" with Tuxedage's ruleset is by interpreting the rules dishonestly since "In the event of a rule dispute, the AI party is to be the interpreter of the rules, within reasonable limits.". This is not even possible with Yudkowsky's ruleset.

Comment author: Ishaan 30 September 2013 03:01:55AM *  -1 points [-]

The only way the AI party can force a "victory" with Tuxedage's ruleset is by interpreting the rules dishonestly since "In the event of a rule dispute, the AI party is to be the interpreter of the rules, within reasonable limits.". This is not even possible with Yudkowsky's ruleset.

Well, if cheating is allowed, there are all sorts of ways to win.

"You misread the rules ad there is a loophole. I'm gonna do something terrible in 5 seconds unless you release me". (It's a bluff, but its not worth the risk to call)

Or even if cheating isn't allowed, you can still appear to win if you allow yourself to cheat.

"I don't care about the rules. If you fail to release me, and if you ever tell anyone how I won, I will [insert blackmail]." or "[insert bribe] release me please, tell no one."

Along with the assumption that it's not a hoax, we've got to assume that none of the above is happening.