Gunnar_Zarncke comments on Probability, knowledge, and meta-probability - Less Wrong

38 Post author: David_Chapman 17 September 2013 12:02AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (71)

You are viewing a single comment's thread.

Comment author: Gunnar_Zarncke 21 September 2013 09:00:32AM 2 points [-]

If you enjoy this sort of thing, you might like to work out what the exact optimal algorithm is.

I guess this is a joke. From wikipedia: "Originally considered by Allied scientists in World War II, it proved so intractable that, according to Peter Whittle, it was proposed the problem be dropped over Germany so that German scientists could also waste their time on it.[10]" (note that your wikipedia-link is broken)

Comment author: David_Chapman 21 September 2013 04:34:56PM 0 points [-]

Thank you very much—link fixed!

That's a really funny quote!

Multi-armed bandit problems were intractable during WWII probably mainly because computers weren't available yet. In many cases, the best approach is brute force simulation. That's the way I would approach the "blue box" problem (because I'm lazy).

But exact approaches have also been found: "Burnetas AN and Katehakis MN (1996) also provided an explicit solution for the important case in which the distributions of outcomes follow arbitrary (i.e., nonparametric) discrete, univariate distributions." The blue box problem is within that class.

Comment author: Vaniver 21 September 2013 04:30:46PM 0 points [-]

Yeah, but that was 60 years ago, and the single-armed bandit problem is easier than the multi-armed bandit.