Tyrrell_McAllister comments on A Problem About Bargaining and Logical Uncertainty - Less Wrong

23 Post author: Wei_Dai 21 March 2012 09:03PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (45)

You are viewing a single comment's thread. Show more comments above.

Comment author: Vladimir_Nesov 25 March 2012 12:55:39AM *  3 points [-]

1) A good decision theory should always do what it would have precommitted to doing.

It's dangerous to phrase it this way, since coordination (which is what really happens) allows using more knowledge than was available at the time of a possible precommitment, as I described here.

4) Writing P is equivalent to supplying only one bit: should P pay up if asked?

Not if the correct decision depends on an abstract fact that you can't access, but can reference. In that case, P should implement a strategy of acting depending on the value of that fact (computing and observing that value to feed to the strategy). That is, abstract facts that will only be accessible in the future play the same role as observations that will only be accessible in the future, and a strategy can be written conditionally on either.

The difference between abstract facts and observations however is that observations may tell you where you are, without telling you what exists and what doesn't (both counterfactuals exist and have equal value, you're in one of them), while abstract facts can tell you what exists and what doesn't (the other logical counterfactual doesn't exist and has zero value).

Comment author: Tyrrell_McAllister 25 March 2012 03:42:43PM *  1 point [-]

4) Writing P is equivalent to supplying only one bit: should P pay up if asked?

Not if the correct decision depends on an abstract fact that you can't access, but can reference.

In general, the distinction is important. But, for this puzzle, the proposition "asked" is equivalent to the relevant "abstract fact". The agent is asked iff the millionth digit of pi is odd. So point (4) already provides as much of a conditional strategy as is possible.

Comment author: Vladimir_Nesov 25 March 2012 04:03:35PM *  0 points [-]

It's assumed that the agent doesn't know if the digit is odd (and whether it'll be in the situation described in the post) at this point. The proposal to self-modify is a separate event that precedes the thought experiment.

Comment author: Tyrrell_McAllister 25 March 2012 04:40:36PM *  2 points [-]

It's assumed that the agent doesn't know if the digit is odd (and whether it'll be in the situation described in the post) at this point.

Yes. Similarly, it doesn't know whether it will be asked (rather than do the asking) at this point.

Comment author: Vladimir_Nesov 25 March 2012 04:54:13PM 1 point [-]

I see, so there's indeed just one bit, and it should be "don't cooperate".

This is interesting in that UDT likes to ignore epistemic significance of observations, but here we have an observation that implies something about the world, and not just tells where the agent is. How does one reason about strategies if different branches of those strategies tell something about the value of the other branches?..