dlthomas comments on Satisficers want to become maximisers - Less Wrong

21 Post author: Stuart_Armstrong 21 October 2011 04:27PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (67)

You are viewing a single comment's thread.

Comment author: dlthomas 21 October 2011 08:04:54PM 8 points [-]

I don't think this follows. Consider the case where there's two choices:

1) 10% chance of no paperclips, 90% chance of 3^^^3 paperclips 2) 100% chance of 20 paperclips

The maximizer will likely pick 1, while the satisficer will definitely prefer 2.

Comment author: timtyler 22 October 2011 07:29:18PM *  4 points [-]

I don't think this follows.

As I understand it, the actual problem in this area is not so much that "satisficers want to become maximisers" - but rather that a simple and effective means of satisficing fairly often involves constructing a maximising resource-gathering minion. Then after the satisficer is satisfied, the minion(s) may continue unsupervised - unless care is taken. I discussed this issue in my 2009 essay on the topic.

Comment author: Stuart_Armstrong 21 October 2011 08:36:59PM 3 points [-]

The expected utility of option 1 is higher than the threshold for the satisficer, so it could just as easily pick 1) as 2); it'll be indifferent between the two choices, and will need some sort of tie-breaker.

Comment author: dlthomas 21 October 2011 09:25:34PM *  3 points [-]

But inasmuch as it will want to want one over the other, it will want to want 2 which is guaranteed to continue to satisfice over 1 which has only a 90% chance of continuing to satisfice, so it should not want to become a maximizer.

Comment author: Manfred 21 October 2011 11:04:22PM *  4 points [-]

So that's actually the "bounded utility" definition, which Stuart says he isn't using. It does seem more intuitive though... I think you can get a paradox out of Stuart's definition, actually, which should not be surprising, since it isn't a utility-maximizer.

Comment author: Stuart_Armstrong 22 October 2011 08:01:32AM 0 points [-]

A satisficer is not motivated to continue to satisfice. It is motivated to take an action that is a satisficing action, and 1) and 2) are equally satisficing.

I know what you're trying to do, I think. I tried to produce a "continuously satisficing agent" or "future satisficing agent", but couldn't get it to work out.

Comment author: timtyler 22 October 2011 07:25:14PM 0 points [-]

It is motivated to take an action that is a satisficing action, and 1) and 2) are equally satisficing.

Surey option 1 has a 10% chance of failing to satisfy.

Comment author: Stuart_Armstrong 23 October 2011 12:01:57AM 2 points [-]

Option 1) already satisfies. Taking option 1) brings the expected utility up above the threshold, so the satisficer is done.

If you add the extra requirement that the AI must never let the expected utility fall below the threshold in future, then the AI will simply blind itself or turn itself off, once the satisficing level is reached; then its expected utility will never fall, as no extra information ever arrives.

Comment author: timtyler 23 October 2011 02:10:43AM 2 points [-]

Sorry - a failure to reread the question on my part :-(

Comment author: antigonus 21 October 2011 08:14:42PM *  1 point [-]

Right, the satisficer will not have an incentive to increase its expected utility by becoming a maximizer when its expected utility (by remaining a satisficer) is already over the threshold. But surely this condition would fail frequently.

Comment author: dlthomas 21 October 2011 09:26:40PM 0 points [-]

If it isn't over the threshold, it could just keep making the same decisions a maximizer would.