To recap, A is trying to deduce enough about U to maximize it:
A := Find some f with a proof of "U = f(A)", then return argmax f.
If U is A, A can fail by proving f = {(5,5),(10,0)}: If it could prove this, it would be true, for only 5 is checked. Then by Löb's theorem, it can prove this.
I've thrown this problem at some people in my university and a fellow student's idea led to:
A' := Find all f with a proof of "U = f(A')", then return argmax (their pointwise maximum).
The true f is among the found f. Fooling ourselves into a suboptimal action would require us to believe it at least as good as the true maximum, which the single check refutes.
Note that A' can't prove that some f is not found, due to Gödel. Therefore, it can never prove what it is going to do!
Yeah, we figured out sometime later that the chicken rule isn't needed. Also we switched to using provability logic (a kind of modal logic), because it leads to a more natural formulation than halting oracles. As a bonus, that logic is decidable, so the whole thing can be implemented on a computer.
https://agentfoundations.org/item?id=4
That idea is known as "modal UDT" and I'm happy to have come up with it, because it seems to be a local peak in the space of ideas. But unfortunately that's as far as it goes. Our world doesn't look like a modal formula, it's more like a cellular automaton, so we need decision theories that can work with that. Hence all the research on logical inductors etc.
(Oh, "have heart" was not "stay strong even though you're 7 years behind", but "be merciful, I wrote that 7 years ago".)
Can the failure of this approach to handle scenarios as complicated as cellular automata be formalized using a problem like 5-and-10?
What do you mean by provability being decidable? Couldn't I ask whether it is provable that a Turing machine halts?
Edit: In Conway's Game of Life, one can build replicators and Turing machines. If we strewed provability oracle interfaces across the landscape, we should be able to implement this reasoner and