Nanashi comments on Calibration Test with database of 150,000+ questions - Less Wrong

37 Post author: Nanashi 14 March 2015 11:22AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (31)

You are viewing a single comment's thread. Show more comments above.

Comment author: Kindly 13 March 2015 10:08:50PM 0 points [-]

Well, I'm getting a reasonably exciting calibration curve with lots of ups and downs. Cool!

Bug: when I click "Display Calibration Curve" for a second time, the graph is displayed in a larger size. (Doing this sufficiently many times crashed Chrome.) Refreshing the page fixes this behavior.

Feature request: I would like to be able to see if my 50% correctness for 30% confidence is getting 1 out of 2 questions right or 5 out of 10. (Error bars of some sort would also work.)

Comment author: Nanashi 13 March 2015 10:50:16PM 0 points [-]

Good idea. I don't think the charts API I'm using will let me do error bars but a good alternative would be a secondary chart that's a bar graph of right vs total questions for each bucket. This would also give a good visual representation of the frequency with which you use various confidence levels.