Just How Hard a Problem is Alignment?
It is commonly asserted that aligning AI is extremely hard because 1. human values are complex: they have a high Kolmogorov complexity, and 2. they're fragile: if you get them even a tiny bit wrong, the result is useless, or worse than useless. If these statements are both true, then...
Feb 25, 20233