Vladimir_Nesov comments on Let's reimplement EURISKO! - Less Wrong

19 Post author: cousin_it 11 June 2009 04:28PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (151)

You are viewing a single comment's thread. Show more comments above.

Comment author: JulianMorrison 15 June 2009 01:59:08PM -1 points [-]

BTW, none of the above classifications are "friendly".

Comment author: Vladimir_Nesov 15 June 2009 02:09:55PM 0 points [-]

Good AI is a category containing Friendly AI, that doesn't require the outcome to be precisely right. This separates more elaborated concept of Friendly AI from an informal concept (requirement) of good outcome.

I believe the concepts are much more close than it seems, that is it's hard to construct an AI that is not precisely Friendly, but still Good.

Comment author: JulianMorrison 15 June 2009 08:19:31PM 0 points [-]

FAI is about being reliably harmless. Whether the outcome seems good in the short term is tangential. Even a "good" AI ought to be considered unfriendly if it's opaque to proof - what can you possibly rely upon? No amount of demonstrated good behavior can be trusted. It could be insincere, it could be sincere but fatally misguided, it could have a flaw that will distort its goals after a few recursions. We would be stupid to just run it and see.

Comment author: Vladimir_Nesov 15 June 2009 09:01:42PM 0 points [-]

At which point you are starting to think of what it takes to make not just informally "Good" AI, but an actually Friendly AI.