JulianMorrison comments on Bead Jar Guesses - Less Wrong

17 Post author: Alicorn 04 May 2009 06:59PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (127)

You are viewing a single comment's thread.

Comment author: JulianMorrison 05 May 2009 02:41:09AM 0 points [-]

Thinking about how an Occamian learner like AIXI would approach the problem, it would probably start from the simplest domain theory "beads have a color, red is a color I've heard mentioned, therefore all beads are red", p=1. If the first bead was grey, it would switch to "all beads are grey", p=0. The second bead is red, "half and half", p = 0.5, and so on, ratcheting up theories from the simplest first.

Comment author: Peter_de_Blanc 05 May 2009 01:59:00PM 2 points [-]

FYI, AIXI does not work like this; it uses a probability distribution over all Turing machines.

Comment author: Vladimir_Nesov 05 May 2009 09:32:31AM *  2 points [-]

This is not how AIXI [*] works. It considers all possible programs at the start, with some probability. The simplest program that fits the data is not the only one it considers; it just gets most of the probability mass. So, from the start, it will give some tiny probability to a hypothesis that the beads will spell War and Peace is morse code. Only when this hypothesis is falsified by the data, it will drop out of race.

[*] M. Hutter (2003). `A Gentle Introduction to The Universal Algorithmic Agent AIXI'. Tech. rep. [abstract/download]

Comment author: MBlume 05 May 2009 02:44:22AM 1 point [-]

red is a color I've heard mentioned, therefore all beads are red

and thus his bayes-score drops to -Infinity

Comment author: JulianMorrison 05 May 2009 06:25:22AM -1 points [-]

I don't think AIXI tries to maximize its Bayes score in one round - it tries to minimize the number of rounds until it converges on a good-enough model.