steven0461 comments on Model Uncertainty, Pascalian Reasoning and Utilitarianism - Less Wrong

23 Post author: multifoliaterose 14 June 2011 03:19AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (154)

You are viewing a single comment's thread. Show more comments above.

Comment author: Will_Newsome 14 June 2011 09:15:11PM *  10 points [-]

ETA: This is a meta comment about some aspects of some comments on this post and what I perceive to be problems with the sort of communication/thinking that leads to the continued existence of those aspects. This comment is not meant to be taken as a critique of the original post.

ETA2: This comment lacks enough concreteness to act as a serious consideration in favor of one policy over another. Please disregard it as a suggestion for how LW should normatively respond to something. Instead one might consider if one might personally benefit from enacting a policy I might be suggesting, on an individual basis.


Why are people on Less Wrong still talking about 'their' 'values' using deviations from a model that assumes they have a 'utility function'? It's not enough to explicitly believe and disclaim that this is obviously an incorrect model, at some point you have to actually stop using the model and adopt something else. People are godshatter, they are incoherent, they are inconsistent, they are an abstraction, they are confused about morality, their revealed preferences aren't their preferences, their revealed preferences aren't even their revealed preferences, their verbally expressed preferences aren't even preferences, the beliefs of parts of them about the preferences of other parts of them aren't their preferences, the beliefs of parts of them aren't even beliefs, preferences aren't morality, predisposition isn't justification, et cetera...

Can we please avoid using the concept of a human "utility function" even as an abstraction, unless it obviously makes sense to do so? If you're specific enough and careful enough it can work out okay (e.g. see JenniferRM's comment) but generally it is just a bad idea. Am I wrong to think this is both obviously and non-obviously misleading in a multitude of ways?

Comment author: steven0461 14 June 2011 10:43:46PM 4 points [-]

As I see it, humans have revealed behavioral tendencies and reflected preferences. I share your reservations about "revealed preferences", which if they differ from both would have to mean something in between. Maybe revealed preferences would be what's left after reflection to fix means-ends mistakes but not other reflection, if that makes sense. But when is that concept useful? If you're going to reflect on means-ends, why not reflect all the way?

Also note that the preferences someone reveals through programming them into a transhuman AI may be vastly different from the preferences someone reveals through other sorts of behavior. My impression is that many people who talk about "revealed preferences" probably wouldn't count the former as authentic revealed preferences, so they're privileging behavior that isn't too verbally mediated, or something. I wonder if this attributing revealed preference to a person rather than a person-situation pair should set off fundamental attribution error alarms.

If we have nothing to go by except behavior, it seems like it's underdetermined whether we should say it's preferences or beliefs (aliefs) or akrasia that's being revealed, given that these factors determine behavior jointly and that we're defining them by their effects. With reflected preferences it seems like you can at least ask the person which one of these factors they identify as having caused their behavior.

Comment author: Will_Newsome 14 June 2011 11:26:55PM -1 points [-]

I wonder if this attributing revealed preference to a person rather than a person-situation pair should set off fundamental attribution error alarms.

Good plausible hypothesis to cache for future priming, but I'm not sure I fully understand it:

preferences someone reveals through programming them into a transhuman AI

More specifically, what process are you envisioning here (or think others might be envisioning)?