From the last thread:
From Costanza's original thread (entire text):
"This is for anyone in the LessWrong community who has made at least some effort to read the sequences and follow along, but is still confused on some point, and is perhaps feeling a bit embarrassed. Here, newbies and not-so-newbies are free to ask very basic but still relevant questions with the understanding that the answers are probably somewhere in the sequences. Similarly, LessWrong tends to presume a rather high threshold for understanding science and technology. Relevant questions in those areas are welcome as well. Anyone who chooses to respond should respectfully guide the questioner to a helpful resource, and questioners should be appropriately grateful. Good faith should be presumed on both sides, unless and until it is shown to be absent. If a questioner is not sure whether a question is relevant, ask it, and also ask if it's relevant."
Meta:
- How often should these be made? I think one every three months is the correct frequency.
- Costanza made the original thread, but I am OpenThreadGuy. I am therefore not only entitled but required to post this in his stead. But I got his permission anyway.
Meta:
- I still haven't figured out a satisfactory answer to the previous meta question, how often these should be made. It was requested that I make a new one, so I did.
- I promise I won't quote the entire previous threads from now on. Blockquoting in articles only goes one level deep, anyway.
I didn't say that it was. Rather I pointed out the difference between morality and Friendliness.
For an AI to be able to be Friendly towards everyone requires not moral realism, but "friendliness realism" - which is basically the idea that a single behavior of the AI can satisfy everyone. This is clearly false if "everyone" means "all intelligences including aliens, other AIs, etc." It may be true if we restrict ourselves to "all humans" (and stop humans from diversifying too much, and don't include hypothetical or far-past humans).
I, personally, believe the burden of proof is on those who believe this to be possible to demonstrate it. My prior for "all humans" says they are a very diverse and selfish bunch and not going to be satisfied by any one arrangement of the universe.
Regardless, moral realism and friendliness realism are different. If you built an objectively moral but unFriendly AI, that's the scenario I discussed in my previous comment - and people would be unhappy. OTOH, if you think a Friendly AI is by logical necessity a moral one (under moral realism), that's a very strong claim about objective morals - a claim that people would perceive an AI implementing objective morals as Friendly. This is a far stronger claim than that people who are sufficiently educated and exposed to the right knowledge will come to agree with certain universal objective morals. A Friendly AI means one that is Friendly to people as they really are, here and now. (As I said, to me it seems very likely that an AI cannot in fact be Friendly to everyone at once.)
I think we are simply having a definitional dispute. As the term is used generally, moral realism doesn't mean that each agent has a morality, but that there are facts about morality that are external to the agent (i.e. objective). Now, "objective" is not identical to "universal," but in practice, objective facts tend to cause convergence of beliefs. So I think what I am calling "moral realism" is something like what you are calling "Friendliness realism."
Lengthening the inferential distance further is that realism i... (read more)