JamesAndrix comments on Fusing AI with Superstition - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (75)
Unfriendly AI is only vastly more plausible if you're not doing it right. Out of the space of all possible preferences, human friendly preferences are a tiny sliver. If you picked at random you would surely get something as bad as a paperclipper.
As optimizers, we can try to aim at the space of human friendly preferences, but we're stupid optimizers in this domain and compared to the complexity of this problem. A program could better target this space, and we are much much more likely to be smart enough to write that program, than to survive the success of an AI based on hand coded goals and killswitches.
This is like going to the moon: Let the computer steer.