You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

interstice comments on Help needed: nice AIs and presidential deaths - Less Wrong Discussion

1 Post author: Stuart_Armstrong 08 June 2015 04:47PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (24)

You are viewing a single comment's thread.

Comment author: interstice 08 June 2015 09:42:02PM 1 point [-]

How about you ask the AI "if you were to ask a counterfactual version of you who lives in a world where the president died, what would it advise you to do?". This counterfactual AI is motivated to take nice actions, so it would advise the real AI to take nice actions as well, right?

Comment author: Stuart_Armstrong 09 June 2015 09:20:22AM 0 points [-]

This counterfactual AI is motivated to take nice actions in worlds where the president died. It might not even know what "nice" means in other worlds.

Comment author: rikisola 08 July 2015 09:08:52AM 0 points [-]

And even if it knew the correct answer to that question, how can you be sure it wouldn't instead lie to you in order to achieve its real goals? You can't really trust the AI if you are not sure it is nice or at least indifferent...