Stuart_Armstrong comments on Reduced impact AI: no back channels - Less Wrong

13 Post author: Stuart_Armstrong 11 November 2013 02:55PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (41)

You are viewing a single comment's thread. Show more comments above.

Comment author: Stuart_Armstrong 17 November 2013 02:08:29PM 0 points [-]

I would say that the current model of value-learning is already safe for this.

I found a "cake-or-death" problem with the initial formulation (http://lesswrong.com/lw/f3v/cake_or_death/). If such problems can be found with a formulation that looked pretty solid initially, then I'm certainly not confident we can say the current model is safe...

Comment author: [deleted] 17 November 2013 04:47:33PM 0 points [-]

Safe enough to do mathematics on, surely. I wouldn't declare anything safe to build unless someone hands me a hard hat and a one-time portal to a parallel universe.

Comment author: Stuart_Armstrong 17 November 2013 04:51:54PM 0 points [-]

You are wise, my child ;-)