I'm glad to see a post on alignment asking about the definition of human values. I propose the following conundrum. Let's suppose that humans, if ask, say they value a peaceful, stable society. I accept the assumption the human mind contains one or more utility optimizers. I point out that the utility optimizers are likely to operate at individual, family, or local group levels, while the stated "value" has to do with society at large. So humans are not likely "optimizing" on the same scope as they "value".
This leads to game t...
Thank you - the best of many good lesswrong posts. I am currently trying to figure out what to tell my 9-year old son. But your letter could "almost" have been written to myself. I'm not in whichever bay area (Seattle? SanFran?). I worked for NASA and it is also called the bay area here. Very much success is defined by others. Deviating from that produces accolades at first, even research dollars, but finally the "big machine" moves in a different direction way over your head and its for naught.
My son asked point ...
I do research in cooperation and game theory, including some work on altruism, and also some hard science work. Everyone looks at the Rorschach blot of human behavior and sees something different. Most of the disagreements have never been settled. Even experiment does not completely settle them.
My experience from having children and observing them in the first few months of life is more definitive. They come with values and personal traits that are not very malleable, and not directly traceable to parents. Sometimes gra...
A related phenomenon, which I have encountered in life but not in systematic research, is that an exceptionally valuable turn is treated as a last turn, and someone will defect. This was evident in at least two states during the tobacco lawsuits. In Texas, the attorney general went to jail for cheating. In Mississippi, where some relatives of mine were on the legal team, one of the lawyers tried to claim all the credit, to the extent they got involved in a separate lawsuit against each other, and felt more animosity than against the tobac...
Thank you for your clear and utterly honest comment on the idea of "alignment with human values". If truly executed, we should not expect anything but an extension of human rights and wrongs, perhaps on an accelerated scale.
Any other alignment must be considered speculative, since we have no reasonable facsimile of society upon which to test. That does not invalidate simulations, but just suggests they be held in skepticism until proven in society, which could be costly. Before I ever started discussions with AIs that might lead to sentie... (read more)