Zack_M_Davis comments on 2013 Less Wrong Census/Survey - Less Wrong

78 Post author: Yvain 22 November 2013 09:26AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (616)

Sort By: Leading

You are viewing a single comment's thread.

Comment author: Zack_M_Davis 22 November 2013 03:34:59AM *  22 points [-]

For the Prize Question, you should use a random number generator and cooperate with probability 0.8. Why? Suppose that the fraction of survey-takers that cooperate is p. Then the value of the prize will be proportional to p and there will be p + 4(1 - p) raffle entries. The expected value of Cooperating is p/(p + 4(1-p)) and the expected value of Defecting is 4(1-p)/(p + 4(1-p)). In equilibrium, these must be the same: if one choice were more profitable than the other, then people would switch until this was no longer the case. Thus p = 4(1 - p) and thus p = 4/5.

Addendum 29 November: Actually, this is wrong; see ensuing discussion.

Comment author: ThrustVectoring 22 November 2013 04:44:20AM 12 points [-]

The expected value of defecting is 4p/(p + 4(1-p), to within one part in the number of survey takers. Whether or not you defect makes no difference as to the proportion of people who defect.

The solution is to determine how likely it is that a random participant is going to defect, conditional on your choice of cooperate or defect. If you're playing with a total of N copies of yourself, you cooperate and get the maximal payout ($60/N). If you're playing against cooperate bots, you defect and get $60*4N/(N-1).

We can generalize this to partial levels. If you play with D defectors and C cooperators whose opinion you can't change, and X people who will cooperate when you cooperate (and defect when you defect), then the payouts are as thus:

C: (C + X)/(C + D + X) D: 4(C /(C + D + X)

You can solve for the break even point by setting C + X = 4 * C

So the answer is that you should defect, unless you think that for every person who is going to cooperate no matter what, there are at least three people who are thinking with similar enough reasoning to come up with the same answer you come up with (regardless of what answer that is).

Comment author: Oscar_Cunningham 25 November 2013 10:56:33PM *  2 points [-]

I think you've got the denominators of your fractions wrong. There are 4 raffle tickets for everyone who defects. I get the values

C: (C + X)/(C + 4D + X) D: 4(C /(C + 4D + 4X)

which solves to a horrible quadratic surd.

If we wanted to we could combine your method with Zack's and assume that C people cooperate, D defect and X make the same choice I do, which is to cooperate with probability p. I think this gets kinda ugly though.

Comment author: ThrustVectoring 26 November 2013 12:44:15AM 3 points [-]

The fractions I wrote are payout * number of tickets, not the chance of winning. But you do have a point: changing many people from cooperate to defect does dilute the total pool of tickets, and not be an unnoticeable amount.

The corrected answer is Payout * Chance to win, which is:

C: (C + X)/(C + D + X) * (1 / (C + 4D + X)

D: (C/(C+D+C)) * (4 / (C + 4D + 4X))

And you don't want to combine my method with Zack's. You don't want a probabilistic strategy - you want to figure out what your beliefs are as far as "how many people do I expect to be in categories C, X, and D". Given your beliefs about how your choices affect others, there's exactly one right choice.

Comment author: Oscar_Cunningham 26 November 2013 04:05:59PM *  2 points [-]

(By the way, the numbers I gave are the same as the ones you gave, only I cancelled a common factor of (C+D+X))

And you don't want to combine my method with Zack's. You don't want a probabilistic strategy - you want to figure out what your beliefs are as far as "how many people do I expect to be in categories C, X, and D". Given your beliefs about how your choices affect others, there's exactly one right choice.

I think that your "one right choice" might sometimes be a probabilistic one. To make this more obvious, consider a game where the value of the prize is maximal when exactly half of the participants choose C, and the value goes down as the proportion gets further from a half (and any of the participants is equally likely to win the prize). Then I think it's obvious that the correct strategy is to estimate C, D, and X as before, and then cooperate with probability p so that C+pX=D+(1-p)X. Then because everyone else in X acts as you do you'll end up with exactly half the people choosing C, which is what you want.

Note that even some of the people in X who you are "acausally controlling" still end up choosing a different option from you (assuming that your random number generators are independent). This allows you to exactly optimise the proportion of people who choose C, which is what makes the strategy work.

I think the same thing applies in Yvain's game. In particular, if we thought that C=D=0 then I think that Zack's analysis is exactly correct (although I wouldn't have used exactly the same words as he does).

EDIT: I retract the last sentence. Zack's calculation isn't what you want to do even in the C=D=0 case. In that case I endorse cooperating with p=1. But I still think that mixed strategies are best in some of the cases with C or D non-zero. In particular what about the case with D=0 but C=X? Then I reckon you should pick C with p=0.724.

Comment author: Zack_M_Davis 30 November 2013 05:57:35AM 4 points [-]

I think this is it. Suppose there are C CooperateBots, D DefectBots, and X players who Cooperate with probability p. The expected utility of the probabilistic strategy is (proportional to) (p(C + pX) + 4(1-p)(C + pX))/(C + 4D + pX + 4(1-p)X). Then (he said, consulting his computer algebra system) if C/X < 1/3 then p = 1 (Cooperate), if C/X > 3 then p = 0 (Defect), and p assumes intermediate values if 1/3 < C/X < 3 (including 0.7239 if C/X = 1, as you mention).

Comment author: hylleddin 22 November 2013 09:45:22AM *  0 points [-]

The expected value of defecting is 4p/(p + 4(1-p), to within one part in the number of survey takers. Whether or not you defect makes no difference as to the proportion of people who defect.

Unless you're using timeless decision theory, if I understand TDT correctly (which I very well might not). In that case, the calculations by Zack show the amount of causal entanglement for which cooperation is a good choice. That is, P(others cooperate | I cooperate) and P(others defect | I defect) should be more than 0.8 for cooperation to be a good idea.

I do not think my decisions have that level of causal entanglement with other humans, so I defected.

Though, I just realized, I should have been basing my decision on my entanglement with lesswrong survey takers, which is probably substantially higher. Oh well.

Comment author: Oscar_Cunningham 26 November 2013 04:40:25PM 0 points [-]

Though, I just realized, I should have been basing my decision on my entanglement with lesswrong survey takers, which is probably substantially higher. Oh well.

I defected for the same reasons as you. We're entangled! Reading the responses of the other survey takers I think it's clear that very few people are entangled with us, so we did indeed make the right choice!

Comment author: hylleddin 22 November 2013 09:55:19AM 0 points [-]

Nevermind, you already covered this, though in a different fashion.

Comment author: ThrustVectoring 22 November 2013 03:30:25PM 1 point [-]

Yeah, and the math is a little different, three entangled decision makers for each cooperate-bot you can defect against (the number of defectors don't matter, surprisingly). You get three extra chances to get the money generously donated to the pool by the cooperate bots by defecting, compared to causing a certain number of people to help you make the pool even larger.

Comment author: stevko 22 November 2013 12:03:02PM 5 points [-]

I think that way to get maximum reward is doing the survey (at least) four times and always answering cooperate.