Perplexed comments on What can you do with an Unfriendly AI? - Less Wrong

16 Post author: paulfchristiano 20 December 2010 08:28PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (127)

You are viewing a single comment's thread. Show more comments above.

Comment author: Vladimir_Nesov 20 December 2010 09:44:18PM 2 points [-]

Again, if your genie is already incentivized to be honest, in what sense is your scheme with all its bells and whistles better than asking for the shortest answer the genie can find, in plain English?

Comment author: Perplexed 20 December 2010 10:11:52PM 0 points [-]

As I understand it, the genie is not incentivized to honest. It is incentivized to not get caught being dishonest. And the reason for the roundabout way of asking the question is to make the answer-channel bandwidth as narrow as possible.

Comment author: paulfchristiano 20 December 2010 10:30:23PM 2 points [-]

It is impossible to be dishonest by saying "yes," by construction. The genie is incentivized to say "yes' whenever possible, so it is disincentivized to be dishonest by saying "no." So the genie is incentivized to be honest, not just to avoid being called out for dishonesty.

Comment author: Vladimir_Nesov 20 December 2010 10:27:47PM 0 points [-]

As I understand it, the genie is not incentivized to honest. It is incentivized to not get caught being dishonest.

Since we care about the genie actually being honest, the technique can be thought about as a way of making it more likely that the genie is honest, with the threat of punishing dishonestly a component of that technique.