Liron comments on Counterfactual Mugging v. Subjective Probability - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (32)
Eliezer wrote:
Your post seems more appropriate as a comment to Eliezer's post. Your example with the fluctuating probabilities just shows that you didn't arrive at your "fine and rational" solution by computing with a generalized decision theory. You just guess-and-checked the two possible decisions to find the reflectively consistent one.
So Eliezer has asked: What mathematical formalism should a rational agent use to represent decision problems that crop up in its environment?
A causal decision theorist would tell you that the agent can use a Markov decision process. But in counterfactual-mugging-like situations, an MDP doesn't define a quantity that a reflectively self-consistent agent would maximize.
The challenge is to present a formalism in which to represent decision problems that might include some level of "decision-dependent counterfactual outcomes", and define what quantity is to be maximized for each formalized problem-instance.