You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Stuart_Armstrong comments on Versions of AIXI can be arbitrarily stupid - Less Wrong Discussion

15 Post author: Stuart_Armstrong 10 August 2015 01:23PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (59)

You are viewing a single comment's thread.

Comment author: buybuydandavis 11 August 2015 07:48:02PM 5 points [-]

then an AIXI that follows one prior can be arbitrarily stupid with respect to another.

Yet another application of David Wolpert's No Free Lunch theorems.

We have dubbed the associated results NFL theorems because they demonstrate that if an algorithm performs well on a certain class of problems then it necessarily pays for that with degraded performance on the set of all remaining problems.

https://en.wikipedia.org/wiki/No_free_lunch_theorem

Comment author: MrMind 12 August 2015 10:09:58AM 1 point [-]

NFL works with algorithms operating on finite problems. With algorithms operating on unbounded problems, you can benefit from Blum's speedup theorem: for every algorithm and every computable measure of performance, there's a second algorithm performing better than the first on almost all inputs.

I suspect here's happening something similar: AIXI is finitely bias-able, and there are environments that can exploit that to arbitrarily constrain the agent's behaviour. If the analogy holds, there's then a class of environments for which AIXI, however finitely biased, is still optimally intelligent.

Comment author: Stuart_Armstrong 11 August 2015 09:17:21PM 1 point [-]

Yes. The no free lunch theorems are powerful in theory, but almost pointless in practice. I was hoping that AIXI could evade them, even in theory, but it seems to not be the case.

Comment author: buybuydandavis 26 August 2015 02:51:15AM 0 points [-]

The point of the NFL theorems in practice is to keep you from getting your hopes up that you'll get a free lunch.

Comment author: Stuart_Armstrong 26 August 2015 01:44:45PM 0 points [-]

So the point of no free lunch theorems is to tell you you won't get a free lunch? ^_^

Comment author: buybuydandavis 27 August 2015 12:38:50AM 0 points [-]

There's another real point - focusing on your prior as "fit" to the problem/universe.

The space of possible priors Wolpert considered were very unlike our experience - basically imposing no topological smoothness on points - every point is a ball from the urn of possible balls. That's just not the way it is. Choosing your prior, and exploiting the properties of your prior then becomes the way to advance.