Michaelos comments on I attempted the AI Box Experiment (and lost) - Less Wrong

47 Post author: Tuxedage 21 January 2013 02:59AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (244)

You are viewing a single comment's thread. Show more comments above.

Comment author: MugaSofer 23 January 2013 02:57:57PM -1 points [-]

I also note there doesn't appear to be any distinguishing factor that allows you to win better than chance odds, but I haven't actually played a lot of Mafia before, so I may just be unfamiliar with the strategies involved.

Well, it's usually played in person, and humans (usually) aren't perfect liars.

Your proposed game has one flaw - there is an FAI and they want to help you win. It might be closer to have only two players, and the AI flips a coin to decide if it's friendly - but then they would win if they let it out, with 50/50 odds, which seems unrealistic.

Perhaps the AI decides, in character, after being released, whether to be Friendly towards the human? Then the Gatekeeper could try to persuade the AI that Friendliness is optimal for their goals. The temptation might help as well, of course.

Comment author: [deleted] 23 January 2013 03:53:50PM 1 point [-]

I tried coming up with a more isomorphic game in another reply to you. Let me know if you think it models the situation better.