paulfchristiano comments on Utility vs Probability: idea synthesis - Less Wrong

4 Post author: Stuart_Armstrong 27 March 2015 12:30PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (15)

You are viewing a single comment's thread. Show more comments above.

Comment author: paulfchristiano 28 October 2015 07:13:43PM 2 points [-]

Again, can you describe any case where the two proposals do anything different?

Given that they do the same thing in every case, it seems highly unlikely that one is safe and the other is dangerous! At best you are obfuscating the problem.

For example, if this really dodges any failures associated with giving the AI false beliefs, that should give you an example where the two proposals do something different.

Comment author: Stuart_Armstrong 29 October 2015 06:57:26PM 0 points [-]

Now that I've rested a bit, let me think about this properly. One reason I was wary of changing probability was because of all the related other probabilities - conditional probabilities, AND and OR expressions, and so on. Changing one probability would have to keep the rest consistent, while changing utility had consistency built in.

It feels like changing a prior might be equivalent. I'm not sure that there is any difference between changing a prior and changing the utility. But, again, there might be some consistency worries to think about - eg how do we change priors over correlations between events, and so on? It still seems that changing probability involves many choices while changing the utility doesn't (it seems equivalent with finding a Bayes factor that provides evidence for that specific event?)

I will think more.

Comment author: paulfchristiano 30 October 2015 03:03:12PM 2 points [-]

The normal way of modifying a probability distribution to make X more likely is to increase the probability of each world where X is true, e.g. by doubling it. This is equivalent to observing evidence for X. It's also equivalent to your procedure for modifying utility functions.