I understood Omohundro's Basic AI Drives as applying only to successful (although not necessarily Friendly) GAI. If a recursively self-improving GAI had massive value drift with each iterative improvement to its ability at reaching its values, it'd end up just flailing around, doing a stochastic series of actions with superhuman efficiency.
I think the Eliezer quote is predicated on the same sort of idea--that you've designed the AI to attempt to preserve its values; you just did it imperfectly. Assuming the value of value preservation isn't among the ones that become altered in various self-rewrites, at some point it'll become good enough at value preservation to keep whatever it has. But at that point, it'll be too late to preserve the original.
I have stopped understanding why these quotes are correct. Help!
More specifically, if you design an AI using "shallow insights" without an explicit goal-directed architecture - some program that "just happens" to make intelligent decisions that can be viewed by us as fulfilling certain goals - then it has no particular reason to stabilize its goals. Isn't that anthropomorphizing? We humans don't exhibit a lot of goal-directed behavior, but we do have a verbal concept of "goals", so the verbal phantom of "figuring out our true goals" sounds meaningful to us. But why would AIs behave the same way if they don't think verbally? It looks more likely to me that an AI that acts semi-haphazardly may well continue doing so even after amassing a lot of computing power. Or is there some more compelling argument that I'm missing?