Friendly AI ideas needed: how would you ban porn? — LessWrong

x

Friendly AI ideas needed: how would you ban porn? — LessWrong

Comment Permalink

They have preferences like ambiguity aversion, eg being willing to pay to find out, during a holiday, whether they were accepted for a job, while knowing that they can't make any relevant decisions with that early knowledge. This is not compatible with following a standard utility function.

I don't know what you mean by "standard" utility function. I don't even know what you mean by "following". We want to find out since uncertainty makes you nervous, being nervous is unpleasant and pleasure is a terminal value. It is entirely consistent with having a utility function and with my formalism in particular.

Humans are not ideal rational optimizers of their respective utility functions.

Then why claim that they have one? If humans have intransitive preferences (A>B>C>A), as I often do, then why claim that actually their preferences are secretly transitive but they fail to act on them properly?

In what epistemology are you asking this question? That is, what is the criterion according to which the validity of answer would be determined?

If you don't think human preferences are "secretly transitive", then why do you suggest the following:

Whenever revealed preferences are non-transitive or non-independent, use the person's stated meta-preferences to remove the issue. The AI thus calculates what the person would say if asked to resolve the transitivity or independence (for people who don't know about the importance of resolving them, the AI would present them with a set of transitive and independent preferences, derived from their revealed preferences, and have them choose among them).

What is the meaning of asking a person to resolve intransitivities if there are no transitive preferences underneath?

Stuart_Armstrong12y00

I don't even know what you mean by "following".

That is, what is the criterion according to which the validity of answer would be determined?

Those are questions for you, not for me. You're claiming that humans have a hidden utility function. What do you mean by that, and what evidence do you have for your position?