How do I choose the best metric to measure my calibration?

ChristianKl

3 How do I choose the best metric to measure my calibration?

by ChristianKl

4th Jan 2017

1 min read

3

This is a linkpost for http://stats.stackexchange.com/q/253443/3807

Personal Blog

3

New Comment

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 9:09 AM

[-]Manfred8y70

"Proper scoring rule" just means that you attain the best score by giving the most accurate probabilities you can. In that sense, any concave proper scoring rule will give you a good feedback mechanism. The reason people like log scoring rule is because it corresponds to information (the kind you can measure in bits and bytes), and so a given amount of score increase has some meaning in terms of you using your information better.

The information measured by your log score is identical to Shannon's idea of information carried by digital signals. When a binary event is completely unknown to you, you can gain 1 bit of information by learning about it. For events that you can predict to high accuracy, the entropy of the event (according to your distribution) is lower, and you gain less information by learning the result. In fact, if you look at the expected score, it goes to zero as the event becomes more and more predictable (though you're still incentivized to answer correctly).

But I think this leaves out something interesting that I don't have a good answer for, which that this straightforward interpretation only works when you, the human, don't screw up. When you do screw up, I'm not sure there's a clear interpretation of score.

Reply

[-]wubbles8y00

The logarithmic scoring rule measures the information carried by the event given your predictions. Reducing its expectation corresponds to reducing the information carried by the event when it arrives.