You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Gurkenglas comments on I played the AI Box Experiment again! (and lost both games) - Less Wrong Discussion

35 Post author: Tuxedage 27 September 2013 02:32AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (123)

You are viewing a single comment's thread. Show more comments above.

Comment author: Gurkenglas 28 September 2013 02:47:53PM *  3 points [-]

You forgot to adress Eliezers point that "10% of AI box experiments were won even by the human emulation of an AI" is more effective against future proponents of deliberately creating boxed AIs than "Careful, the guardian might be persuaded by these 15 arguments we have been able to think of".

I don't think the probability of "AIs can find unboxing arguments we didn't" is sub-1 enough for preparation to matter. If there is any chance of a mathematical exhaustability of those arguments, its research should be conducted by a select circle of individuals that won't disclose our critical unboxers until a proof of safety.