Locke comments on AI Box Role Plays - Less Wrong

5 Post author: lessdazed 22 January 2012 07:11PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (49)

You are viewing a single comment's thread.

Comment author: Locke 22 January 2012 08:54:46PM *  5 points [-]

I'm always intrigued by these experiments. If the box AI is not confirmed to be friendly, everything it says and promises is absolutely unreliable. I don't see how the arguments of such an entity could be at all convincing.

Comment author: Jonathan_Graehl 22 January 2012 09:52:11PM 1 point [-]

Good point.

But if you knew anything about the process leading up to the development of successful AI, you'd have some beliefs about how likely the AI is to perpetrate a ruse for the purpose of escaping.

But I get the difficulty: how well do you have to understand a being's nature before you feel confident in predicting its motivations/values?

Comment author: Locke 22 January 2012 11:50:26PM 0 points [-]

So the key to containing an AI is to have a technologically-ignorant rationalist babysit it?

Comment author: cata 22 January 2012 09:27:30PM *  0 points [-]

Not more unreliable than the things humans say, and thereby convince you of.

Comment author: Jonathan_Graehl 22 January 2012 09:53:03PM 4 points [-]

Important difference: we can assume that other humans are probably like us.