You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Carinthium comments on The idiot savant AI isn't an idiot - Less Wrong Discussion

8 Post author: Stuart_Armstrong 18 July 2013 03:43PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (133)

You are viewing a single comment's thread. Show more comments above.

Comment author: Carinthium 18 July 2013 04:52:34PM 1 point [-]

What I'm saying is a bit different from CEV- it would involve modelling only a single's human's preferences, and would involve modelling their brain only in the short term (which would be a lot easier). Human beings have at least reasonable judgement with things such as, say, a paperclip factory, to the point where human will calling the shots will have no consequences that are too severe.

Comment author: Stuart_Armstrong 18 July 2013 05:04:56PM 2 points [-]

Specifying that kind of thing (including specifying preference) is probably almost as hard as getting the AI's motivations right in the first place.

Though Paul Christiano had some suggestions along those lines, which (in my opinion) needed uploads (human minds instantiated in a computer) to have a hope of working...

Comment author: kmgroove 18 July 2013 05:52:26PM 0 points [-]

Would a human be bound to "at least reasonable judgement" if given super intelligent ability?

Comment author: Carinthium 18 July 2013 05:56:33PM -1 points [-]

We should remember that we aren't talking about true Friendly AI here, but AI in charge of lesser tasks such as, in the example, running a factory. There will be many things the AI doesn't know because it doesn't need to, including how to defend itself against being shut down (I see no logical reason why that would be necessary for running a paperclip factory). Combining that with the limits on intelligence necessary for such lesser tasks, and failure modes become far less likely.

Comment author: ikrase 18 July 2013 05:27:16PM -1 points [-]

THat's sort of similar to what I keep talking about w/ 'obedient AI'.