ESRogs comments on Book: AKA Shakespeare (an extended Bayesian investigation) - Less Wrong

-1 Post author: matt 28 June 2013 04:34PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (13)

You are viewing a single comment's thread. Show more comments above.

Comment author: Benja 28 June 2013 05:53:25PM *  13 points [-]

(Okay, I'll unpack the implication:) Assigning a probability of 10^(-3) would mean being really, really, really, really sure that the hypothesis is wrong. To be well-calibrated, you would have to be able to make ten thousand similar judgments with similar strengths of evidence and only be wrong about ten times, and if you can do that, you're very good at this sort of thing.

Assigning 10^(-13) -- i.e., suggesting that you're so good that you can do this and only be wrong one in ten million million times -- is just obviously wrong.

So I was implying that the fact that the book suggests that this kind of number can be a plausible outcome means that it isn't a very good place to learn the art of making Bayesian probability estimates. To learn to make well-calibrated estimates, I should try to learn from people who stand a snowball's chance in hell of making such estimates themselves.

For an example from someone who has a claim to actually being good at this sort of thing, see Gwern's Who wrote the Death Note script?.

Comment author: ESRogs 28 June 2013 07:12:04PM 1 point [-]

a claim to actually being good at this sort of thing

If I'm reading the chart on that page correctly, Gwern is extremely well calibrated. Is the accuracy row for each confidence column telling us what fraction of predictions Gwern assigned a given confidence to have been right? He's got 50% - 44%, 60% - 64%, 70% - 71%, 80% - 83%, 90% - 92%, and 100% - 96%. That's incredible!

Comment author: gwern 28 June 2013 07:35:07PM 2 points [-]

Is the accuracy row for each confidence column telling us what fraction of predictions Gwern assigned a given confidence to have been right?

Yes, something like that. I forget the exact details of how it bins.

That's incredible!

Thank you. That's years of practice and some useful heuristics at work there.