So: the only information available to any agent is in the form of its internal state and its sensory channels. Any function it computes must have that domain (or some subset of it). Confining the agent to that domain isn't any kind of restriction. All utility functions calulated over the state of the world necessarily correspond to other utility functions calulated over the domain of internal state and sensory input.
Your example seems wrong to me. The problem is with:
For example, you could offer to change a robot's sensory contents and internal state to something with higher utility than its current state - and if the agent refuses, you will reset it. If we were using a "utility wrapper" model, all modeled agents would say yes.
That's not correct. For one thing, the agent may not believe what you say.


Subscribe to RSS Feed
Here's the corresponding utility function - assuming that state transitions are tied to actions.
Using simple maximisation algorithms (e.g. gradient descent) on that utility landscape will produce the behaviour in question. More sophisticted algorithms will do no better.
Your "BartlebeyBot" agent totally ignored Bayesian evidence. By what rule does "my" example agent have to listen and respond to such evidence, while "yours" does not? Again, I don't think your proposed counter example is remotely convincing.
Why do you think there's a counter-example? Did you read the referenced Dewey paper about O-Maximisers?