You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

mushroom comments on Explanations for Less Wrong articles that you didn't understand - Less Wrong Discussion

18 Post author: Kaj_Sotala 31 March 2014 11:19AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (118)

You are viewing a single comment's thread. Show more comments above.

Comment author: cousin_it 31 March 2014 03:51:51PM *  16 points [-]

If you want to build an AI that maximizes utility, and that AI can create copies of itself, and each copy's existence and state of knowledge can also depend on events happening in the world, then you need a general theory of how to make decisions in such situations. In the limiting case when there's no copying at all, the solution is standard Bayesian rationality and expected utility maximization, but that falls apart when you introduce copying. Basically we need a theory that looks as nice as Bayesian rationality, is reflectively consistent (i.e. the AI won't immediately self-modify away from it), and leads to reasonable decisions in the presence of copying. Coming up with such a theory turns out to be surprisingly hard. Many of us feel that UDT is the right approach, but many gaps still have to be filled in.

Note that many problems that involve copying can be converted to problems that create identical mind states by erasing memories. My favorite motivating example is the Absent-Minded Driver problem. The Sleeping Beauty problem is similar to that, but formulated in terms of probabilities instead of decisions, so people get confused.

An even simpler way to emulate copying is by putting multiple people in the same situation. That leads to various "anthropic problems", which are well covered in Bostrom's book. My favorite example of these is Psy-Kosh's problem.

Another idea that's equivalent to copying is having powerful agents that can predict your actions, like in Newcomb's problem, Counterfactual Mugging and some more complicated scenarios that we came up with.

Comment author: [deleted] 31 March 2014 04:40:19PM 4 points [-]

Another idea that's equivalent to copying is having powerful agents that can predict your actions...

Can you explain this equivalence?

Comment author: cousin_it 31 March 2014 05:12:44PM *  8 points [-]

When a problem involves a predictor that's predicting your actions, it can often be transformed into another problem that has an indistinguishable copy of you inside the predictor. In some cases, like Counterfactual Mugging, the copy and the original can even receive different evidence, though they are still unable to tell which is which.

There are more complicated scenarios, where the predictor is doing high-level logical reasoning about you instead of running a simulation of you. In simple cases like Newcomb's Problem, that distinction doesn't matter, but there is an important family of problems where it matters. The earliest known example is Gary Drescher's Agent Simulates Predictor. Other examples are Wei Dai's problem about bargaining and logical uncertainty and my own problem about logical priors. Right now this is the branch of decision theory that interests me most.