Nanashi comments on Calibration Test with database of 150,000+ questions - LessWrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (31)
Good idea. I don't think the charts API I'm using will let me do error bars but a good alternative would be a secondary chart that's a bar graph of right vs total questions for each bucket. This would also give a good visual representation of the frequency with which you use various confidence levels.