In all of this, there is just a BIG problem with self-consistency of values when there is no FOV to pin anything down.
It might be worthwhile to explore more precisely the role of the word "problem" in that sentence (and your associated thoughts).
I mean, OK, maybe one function an FOV serves is to enforce consistency, and maybe losing an FOV therefore makes my values less consistent over time. For at least some FOVs that's certainly true.
What makes that a problem?
You're right. I was in a mode of using familiar and related words without really thinking about what they meant.
This was the thesis I was developing, related to the hypothetical problem of writing your own utility function:
In all of this, there is just a BIG problem with self-generation of values when there is no FOV to pin anything down.
And the problem is one of logic. When choosing what to value, why should you value this or that or anything? Actually, you can't value anything; there's no value.
X is valued if you can use it to get Y that is valued. ...
Link: physicsandcake.wordpress.com/2011/01/22/pavlovs-ai-what-did-it-mean/
Suzanne Gildert basically argues that any AGI that can considerably self-improve would simply alter its reward function directly. I'm not sure how she arrives at the conclusion that such an AGI would likely switch itself off. Even if an abstract general intelligence would tend to alter its reward function, wouldn't it do so indefinitely rather than switching itself off?
If it wants to maximize its reward by increasing a numerical value, why wouldn't it consume the universe doing so? Maybe she had something in mind along the lines of an argument by Katja Grace:
Link: meteuphoric.wordpress.com/2010/02/06/cheap-goals-not-explosive/
I am not sure if that argument would apply here. I suppose the AI might hit diminishing returns but could again alter its reward function to prevent that, though what would be the incentive for doing so?
ETA:
I left a comment over there:
ETA #2:
What else I wrote: