orthonormal comments on Bayes' rule =/= Bayesian inference - Less Wrong

37 Post author: neq1 16 September 2010 06:34AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (70)

You are viewing a single comment's thread.

Comment author: Oscar_Cunningham 16 September 2010 08:37:31AM 4 points [-]

I like this post, there's still a lot of confusion around Bayesian methods.

Two things that would have helped me while I was learning Bayesianism were that:

The frequency with which a coin comes up heads isn't a probability, no matter how much it looks like one.*

and

Bayes theorem doesn't do parameter estimation.

I might write these into a post sometime.

*This is what's going wrong in the heads of people who say things like "The probability is either 1 or 0, but I don't know which."

Comment author: TobyBartels 16 September 2010 06:04:33PM 2 points [-]

The frequency with which a coin comes up heads isn't a probability, no matter how much it looks like one.

Pedantry alert: This is not technically true, although it's still a very important point.

Every frequency is the probability of something; in this case, the frequency with which the coin comes up heads is the probability, given that you pick one of the times that the coin is flipped, that coin comes up heads that time.

But this is not the same thing as the probability that the coin comes up heads the next time that you flip it, which is what you are more likely to be interested in (and which people are liable to uselessly claim is "either 1 or 0, but I don't know which").

Comment author: wedrifid 16 September 2010 06:37:44PM *  2 points [-]

Even more pedantic: It still isn't a probability.

There is a probability p(heads | you picked one of those coins) and it can be found by simply taking the frequency. But the frequency still doesn't mean the probability. In much the same way 5 / 100 balls in the jar being red isn't a probability. It is a curious fact about the colors of balls in the jar. p(ball is red | I take a ball from that jar) is a probability.

Comment author: TobyBartels 16 September 2010 06:46:08PM 3 points [-]

Our think that our pedantries are clashing on the word "is".

I'm thinking of both frequencies and probabilities as numbers, and using "is" between them if they are equal numbers. You are (I guess) thinking of frequencies and probabilities are things of different types, which are not numbers even though they may be measured by numbers.

Come to think of it, your interpretation is more pedantic than mine, so I concede.

Comment author: Oscar_Cunningham 17 September 2010 04:53:42PM 0 points [-]

Thinking about it further, there is no probability which is even numerically equal to the frequency. Probabilities are subjective, you know them or can work them out in your head. But you don't know the frequency, so it can't be equal to any of the probabilities in your head (except by coincidence).

Comment author: TobyBartels 17 September 2010 11:55:43PM *  1 point [-]

I think that it's a mistake to reserve the term ‘probability’ for beliefs held by actual people (or other beings with beliefs). In fact, since actual people are subject to such pervasive epistemic biases (such as we try to overcome here), I doubt that anybody (even readers of Less Wrong) holds actual beliefs that obey the mathematical laws of probability.

I prefer to think of probabiliy as the belief of an ideal rational being with given information / evidence / observations. (This makes me what they call an ‘objective Bayesian’, although really it just pushes the subjectivity back to the level of information.) So even if nobody knows the frequency with which a given coin comes up heads (which is certainly true if the coin is still around and may be flipped in the future), I can imagine a rational being who knows that frequency.

But in a post that was supposed to be pedantic, I was remiss in not specifying exactly what information the probability depends on!

Comment author: Oscar_Cunningham 17 September 2010 11:57:20AM 1 point [-]

Thanks, this clears some things up for me.

Comment author: TobyBartels 17 September 2010 11:41:55PM 0 points [-]

You're welcome!

Comment author: cousin_it 16 September 2010 09:44:20AM *  1 point [-]

I don't understand how you can hold a position like that and still enjoy the post. How do you parse the phrase "my prior for the probability of heads" in the second example?

Comment author: neq1 16 September 2010 01:31:36PM 2 points [-]

In the second example the person was speaking informally, but there is nothing wrong with specifying a probability distribution for an unknown parameter (and that parameter could be a probability for heads)

Comment author: Oscar_Cunningham 16 September 2010 11:28:11AM *  1 point [-]

I hadn't seen that, but you're right that that sentence is wrong. "Probability" should have been replaced with "frequency" or something. A prior on a probability would be a set of probabilities of probabilities, and would soon lead to infinite regress.

Comment author: neq1 16 September 2010 01:33:14PM 1 point [-]

only if you keep specifying hyper-priors, which there is no reason to do

Comment author: Oscar_Cunningham 16 September 2010 02:50:26PM 0 points [-]

Exactly. There's no point in the first meta-prior either.