You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

ciphergoth comments on Goal retention discussion with Eliezer - Less Wrong Discussion

56 Post author: MaxTegmark 04 September 2014 10:23PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (26)

You are viewing a single comment's thread. Show more comments above.

Comment author: [deleted] 05 September 2014 09:37:09AM 0 points [-]

It always seemed to me that this strategy had the fatal flaw that we would not be able to tell if the AI was really already superintelligent and was just playing dumb and telling us what we wanted to hear so that we would let it loose, or if the AI really was just learning.

You could, you know, look inside the machine and see what makes it tick. It's not a black box.

Comment author: ciphergoth 06 September 2014 12:42:48PM 3 points [-]

That seems desirable and perhaps possible, but extremely difficult, especially when you have a superintelligent mind anticipating that you'll do it and trying to work out how to ensure you come away with the wrong impression.