Is friendly AI "trivial" if the AI cannot rewire human values?

Alerus

Is friendly AI "trivial" if the AI cannot rewire human values? — LessWrong

Comment Permalink

Assuming humans don't want the AI to make new people that are simply easier to maximize, if it created a new person, all people on the earth view this negatively and their well-being drops.

I'm not sure how common it is, but I at least consider total well-being to be important. The more people the better. The easier to make these people happy, the better.

Indeed it's difficult to say precisely, that's why I used what we can do now as analogy. I can't really rewire a person's values at all except through persuasion or other such methods.

An AI is much better at persuasion than you are. It would pretty much be able to convince you whatever it wants.

Even our best neuroscientists can't do that unless I'm ignorant to some profound advances.

Our best neuroscientists are still mere mortals. Also, even among mere mortals, making small changes towards someones values are not difficult, and I don't think significant changes are impossible. For example, the consumer diamond industry would be virtually non-existant if De Beers didn't convince people to want diamonds.

ArisKatsaris14y00

The more people the better.

The more people in what? Any particular moment in time? The complete timeline of any given Everett Branch? The whole multiverse?

Between an Everett branch of 10 billion people, and ten Everett branches of 1 billion people each, which do you prefer?

Between 10 billion people that live in the same century, and one billion people per century over a span of ten centuries, which do you prefer?

0Alerus14y

You must also consider that well-being need not be defined as a positive function. Even if it wasn't, if the gain of adding a person was less than drop in well-being of others, it wouldn't be beneficial unless the AI was able to without prevention, create many more such people. [...] I'm sure it'd be better than me (unless I'm also heavily augmented by technology, but we can avoid that issue for now). On what grounds can you say that it'd be able to persuade me to anything it wants? Intelligence doesn't mean you can do anything and think this needs to be justified. [...] I know they're mere mortals. We're operating under the assumption that the AI's methods of value manipulation are limited to what we can do ourselves, in which case rewiring is not something we can do with any great affect. The point of the assumption is to ask what the AI could do without more direct manipulation. To that end, only persuasion has been offered and as I've stated, I'm not seeing a compelling argument for why an AI could persuade anyone to anything.

See in context

-9

Is friendly AI "trivial" if the AI cannot rewire human values?

-9

-9

-9

Is friendly AI "trivial" if the AI cannot rewire human values?

-9

-9