You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Psy-Kosh comments on IBM's "Watson" program to compete against "Jeopardy" champions tonight - Less Wrong Discussion

10 Post author: NihilCredo 14 February 2011 03:28PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (26)

You are viewing a single comment's thread. Show more comments above.

Comment author: Psy-Kosh 17 February 2011 02:59:54AM 2 points [-]

The confidences are supposed to be probabilities? But they often summed to > 100%

Or is it "the procedure for generating the confidences is such that it'll be well calibrated for the highest ranking answer"?

Comment author: Dreaded_Anomaly 17 February 2011 03:47:05AM *  2 points [-]

No, sorry, that should say confidences everywhere, not probabilities. I had written it out incorrectly and then edited it, but I missed that one. Fixed now.

Comment author: Psy-Kosh 17 February 2011 03:51:16AM 0 points [-]

What I meant was "for the top three answers, the confidences would sometimes sum to > 100, so how does that work?"

Is the procedure defined as well calibrated only for the top answer, or is there something I'm missing?

Comment author: Dreaded_Anomaly 17 February 2011 04:34:20AM 1 point [-]

The confidence level compares the answer to other answers Watson's given in the past, based on how much the answer is supported by the evidence Watson has and uses. All the answers are generated and scored in parallel. It's not a comparison among the answers generated for a specific question, so it shouldn't necessarily add up to 100.

Quote from Chris Welty at last night's panel: "When [Watson] says 'this is my answer, 50% sure,' half the time he's right about that, and half the time he's wrong. When he says 80%, 20% of the time he's wrong."

Comment author: Psy-Kosh 17 February 2011 05:27:31AM 0 points [-]

Ah, thanks.