bigjeff5 comments on Bayes' Theorem Illustrated (My Way) - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (191)
No, it's the exact same question, only the labels are different.
The probability that any one child is boy is 50%. We have been told that one child is a boy, which only leaves two options - HH and HT. If TH were still available, then so would TT be available because the next flip could be revealed to be tails.
Here's the probability in bayesian:
P(BoyBoy) = 0.25 P(Boy) = 0.5 P(Boy|BoyBoy) = 1
P(BoyBoy|Boy) = P(Boy|BoyBoy)*P(BoyBoy)/P(Boy)
P(BoyBoy|Boy)= (1*0.25) / 0.5 = 0.25 / 0.5 = 0.5
P(BoyBoy|Boy) = 0.5
It's exactly the same as the coin flip, because the probability is 50% - the same as a coin flip. This isn't the monty hall problem. Knowing half the problem (that there's at least one boy) doesn't change the probability of the other boy, it just changes what our possibilities are.
No, it isn't. You should consider that you are disagreeing with a pretty standard stats question, so odds are high you are wrong. With that in mind, you should reread what people are telling you here.
Now, consider "I flip two coins" the possible outcomes are hh,ht,th,tt
I hope we can agree on that much.
Now, I give you more information and I say "one of the coins is heads," so we Bayesian update by crossing out any scenario where one coin isn't heads. There is only 1 (tt)
hh,ht,th
So it should be pretty clear the probability I flipped two heads is 1/3.
Now, your scenario, flipped two coins (hh,ht,th,tt), and I give you the information "the first coin is heads," so we cross out everything where the first coin is tails, leaving (hh,ht). Now the probability you flipped two heads is 1/2.
I don't know how to make this any more simple.
http://en.wikipedia.org/wiki/Boy_or_Girl_paradox
I know it's not the be all end all, but it's generally reliable on these types of questions, and it gives P = 1/2, so I'm not the one disagreeing with the standard result here.
Do the math yourself, it's pretty clear.
Edit: Reading closer, I should say that both answers are right, and the probability can be either 1/2 or 1/3 depending on your assumptions. However, the problem as stated falls best to me in the 1/2 set of assumptions. You are told one child is a boy and given no other information, so the only probability left for the second child is a 50% chance for boy.
Did you actually read it? It does not agree with you. Look under the heading "second question."
I did the math in the post above, enumerating the possibilities for you to try to help you find your mistake.
Edit, in response to the edit:
Which is exactly analogous to what Jiro was saying about the Tuesday question. So we all agree now? Tuesday can raise your probability slightly above 50%, as was said all along.
And you are immediately making the exact same mistake again. You are told ONE child is a boy, you are NOT told the FIRST child is a boy. You do understand that these are different?
Re-read it.
The relevant quote from the Wiki:
We have no general population information here. We have one man with at least one boy.
I'm not at all sure you understand that quote. Lets stick with the coin flips:
Do you understand why these two questions are different: I tell you- "I flipped two coins, at least one of them came out heads, what is the probability that I flipped two heads?" A:1/3 AND "I flipped two coins, you choose one at random and look at it, its heads.What is the probability I flipped two heads" A: 1/2
Yes, and we are dealing with the second question here.
Is that not what I said before?
We don't have 1000 families with two children, from which we've selected all families that have at least one boy (which gives 1/3 probability). We have one family with two children. Then we are told one of the children is a boy, and given zero other information. The probability that the second is a boy is 1/2, so the probability that both are boys is 1/2.
The possible options for the "Boy born on Tuesday" are not Boy/Girl, Girl/Boy, Boy/Boy. That would be the case in the selection of 1000 families above.
The possible options are Boy (Tu) / Girl, Girl / Boy (Tu), Boy (Tu) / Boy, Boy / Boy (Tu).
There are two Boy/Boy combinations, not one. You don't have enough information to throw one of them out.
This is NOT a case of sampling.
As long as you realize there is a difference between those two questions, fine. We can disagree about what assumptions the wording should lead us to, thats irrelevant to the actual statistics and can be an agree-to-disagree situation. Its just important to realize that what the question means/how you get the information is important.
If we have one family with two children, of which one is a boy, they are (by definition) a member of the set "all families that have at least one boy." So it matters how we got the information.
If we got that information by grabbing a kid at random and looking at it (so we have information about one specific child), that is sampling, and it leads to the 1/2 probability.
If we got that information by having someone check both kids, and tell us "at least one is a boy" we have different information (its information about the set of kids the parents have, not information about one specific kid).
If it IS sampling (if I grab a kid at random and say "whats your Birthday?" and it happens to be Tuesday), then the probability is 1/2. (we have information about the specific kid's birthday).
If instead, I ask the parents to tell me the birthday of one of their children, and the parent says 'I have at least one boy born on Tuesday', then we get, instead, information about their set of kids, and the probability is the larger number.
Sampling is what leads to the answer you are supporting.
The answer I'm supporting is based on flat priors, not sampling. I'm saying there are two possible Boy/Boy combinations, not one, and therefore it takes up half the probability space, not 1/3.
Sampling to the "Boy on Tuesday" problem gives roughly 48% (as per the original article), not 50%.
We are simply told that the man has a boy who was born on tuesday. We aren't told how he chose that boy, whether he's older or younger, etc. Therefore we have four possibilites, like I outlined above.
Is my analysis that the possibilities are Boy (Tu) /Girl, Girl / Boy (Tu), Boy (Tu)/Boy, Boy/Boy (Tu) correct?
If so, is not the probability for some combination of Boy/Boy 1/2? If not, why not? I don't see it.
BTW, contrary to my previous posts, having the information about the boy born on Tuesday is critical because it allows us (and in fact requires us) to distinguish between the two boys.
That was in fact the point of the original article, which I now disagree with significantly less. In fact, I agree with the major premise that the tuesday information pushes the odds of Boy/Boy closer 50%, I just disagree that you can't reason that it pushes it to exactly 50%.
No. For any day of the week EXCEPT Tuesday, boy and girl are equivalent. For the case of both children born on Tuesday you have for girls: Boy(tu)/Girl(tu),Girl(tu)/Boy(tu), and for boys: boy(tu)/boy(tu).
This statement leads me to believe you are still confused. Do you agree that if I know a family has two kids, I knock on the door and a boy answers and says "I was born on a Tuesday," that the probability of the second kid being a girl is 1/2? And in this case, Tuesday is irrelevant? (This the wikipedia called "sampling")
Do you agree that if, instead, the parents give you the information "one of my two kids is a boy born on a Tuesday", that this is a different sort of information, information about the set of their children, and not about a specific child?
For the record, I'm sure this is frustrating as all getout for you, but this whole argument has really clarified things for me, even though I still think I'm right about which question we are answering.
Many of my arguments in previous posts are wrong (or at least incomplete and a bit naive), and it didn't click until the last post or two.
Like I said, I still think I'm right, but not because my prior analysis was any good. The 1/3 case was a major hole in my reasoning. I'm happily waiting to see if you're going to destroy my latest analysis, but I think it is pretty solid.