Eugine_Nier comments on Evidence for the orthogonality thesis - Less Wrong

11 Post author: Stuart_Armstrong 03 April 2012 10:58AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (289)

You are viewing a single comment's thread. Show more comments above.

Comment author: cousin_it 03 April 2012 09:32:19PM *  4 points [-]

If the AI's map represents the territory accurately enough, the AI can use the map to check the consequences of returning different actions, then pick one action and return it, ipso facto affecting the territory. I think I already know how to build a working paperclipper in a Game of Life universe, and it doesn't seem to wirehead itself. Do you have a strong argument why all non-magical real-world AIs will wirehead themselves before they get a chance to hurt humans?

Comment author: Eugine_Nier 05 April 2012 01:50:16AM 1 point [-]

This isn't quite an AGI. In particular, it doesn't even take input from its surroundings.

Comment author: cousin_it 05 April 2012 07:51:07AM *  0 points [-]

Fair enough. We can handwave a little and say that AI2 built by AI1 might be able to sense things and self-modify, but this offloading of the whole problem to AI1 is not really satisfying. We'd like to understand exactly how AIs should sense and self-modify, and right now we don't.

Comment author: Vladimir_Nesov 05 April 2012 02:06:42AM *  0 points [-]

Let it build a machine that takes input from own surroundings.

Comment author: Eugine_Nier 05 April 2012 06:37:50AM 0 points [-]

But the new machine can't self-modify. My point is about the limitations of cousin_it's example. The machine has a completely accurate model of the world as input and uses an extremely inefficient algorithm to find a way to paperclip the world.

Comment author: Vladimir_Nesov 05 April 2012 09:05:20AM 0 points [-]

The second machine can be designed to build a third machine, based on the second machine's observations.

Comment author: Eugine_Nier 06 April 2012 02:36:23AM 0 points [-]

Yes, but now the argument that you will converge to a paper clipper is much weaker.