Metaphilosophical Mysteries

Wei Dai

59 Metaphilosophical Mysteries

27th Jul 2010

3 min read

59

Creating Friendly AI seems to require us humans to either solve most of the outstanding problems in philosophy, or to solve meta-philosophy (i.e., what is the nature of philosophy, how do we practice it, and how should we program an AI to do it?), and to do that in an amount of time measured in decades. I'm not optimistic about our chances of success, but out of these two approaches, the latter seems slightly easier, or at least less effort has already been spent on it. This post tries to take a small step in that direction, by asking a few questions that I think are worth investigating or keeping in the back of our minds, and generally raising awareness and interest in the topic.

The Unreasonable Effectiveness of Philosophy

It seems like human philosophy is more effective than it has any right to be. Why?

First I'll try to establish that there is a mystery to be solved. It might be surprising so see the words "effective" and "philosophy" together in the same sentence, but I claim that human beings have indeed made a non-negligible amount of philosophical progress. To cite one field that I'm especially familiar with, consider probability and decision theory, where we went from having no concept of probability, to studies involving gambles and expected value, to subjective probability, Bayesian updating, expected utility maximization, and the Turing-machine-based universal prior, to the recent realizations that EU maximization with Bayesian updating and the universal prior are both likely to be wrong or incomplete.

We might have expected that given we are products of evolution, the amount of our philosophical progress would be closer to zero. The reason for low expectations is that evolution is lazy and shortsighted. It couldn't possibly have "known" that we'd eventually need philosophical abilities to solve FAI. What kind of survival or reproductive advantage could these abilities have offered our foraging or farming ancestors?

From the example of utility maximizers, we also know that there are minds in the design space of minds that could be considered highly intelligent, but are incapable of doing philosophy. For example, a Bayesian expected utility maximizer programmed with a TM-based universal prior would not be able to realize that the prior is wrong. Nor would it be able to see that Bayesian updating is the wrong thing to do in some situations.

Why aren't we more like utility maximizers in our ability to do philosophy? I have some ideas for possible answers, but I'm not sure how to tell which is the right one:

Philosophical ability is "almost" universal in mind space. Utility maximizers are a pathological example of an atypical mind.
Evolution created philosophical ability as a side effect while selecting for something else.
Philosophical ability is rare and not likely to be produced by evolution. There's no explanation for why we have it, other than dumb luck.

As you can see, progress is pretty limited so far, but I think this is at least a useful line of inquiry, a small crack in the problem that's worth trying to exploit. People used to wonder at the unreasonable effectiveness of mathematics in the natural sciences, especially in physics, and I think such wondering eventually contributed to the idea of the mathematical universe: if the world is made of mathematics, then it wouldn't be surprising that mathematics is, to quote Einstein, "appropriate to the objects of reality". I'm hoping that my question might eventually lead to a similar insight.

Objective Philosophical Truths?

Consider again the example of the wrongness of the universal prior and Bayesian updating. Assuming that they are indeed wrong, it seems that the wrongness must be objective truths, or in other words, it's not relative to how the human mind works, or has anything to do with any peculiarities of the human mind. Intuitively it seems obvious that if any other mind, such as a Bayesian expected utility maximizer, is incapable of perceiving the wrongness, that is not evidence of the subjectivity of these philosophical truths, but just evidence of the other mind being defective. But is this intuition correct? How do we tell?

In certain other areas of philosophy, for example ethics, objective truth either does not exist or is much harder to find. To state this in Eliezer's terms, in ethics we find it hard to do better than to identify "morality" with a huge blob of computation which is particular to human minds, but it appears that in decision theory "rationality" isn't similarly dependent on complex details unique to humanity. How to explain this? (Notice that "rationality" and "morality" otherwise share certain commonalities. They are both "ought" questions, and a utility maximizer wouldn't try to answer either of them or be persuaded by any answers we might come up with.)

These questions perhaps offer further entry points to try to attack the larger problem of understanding and mechanizing the process of philosophy. And finally, it seems worth noting that the number of people who have thought seriously about meta-philosophy is probably tiny, so it may be that there is a bunch of low-hanging fruit hiding just around the corner.

Meta-PhilosophyMind SpacePhilosophyWorld Modeling

Frontpage

59

New Comment

Rendering 0/266 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:36 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

59 Metaphilosophical Mysteries

by Wei Dai

27th Jul 2010

3 min read

266

59

The Unreasonable Effectiveness of Philosophy

It seems like human philosophy is more effective than it has any right to be. Why?

Why aren't we more like utility maximizers in our ability to do philosophy? I have some ideas for possible answers, but I'm not sure how to tell which is the right one:

Philosophical ability is "almost" universal in mind space. Utility maximizers are a pathological example of an atypical mind.
Evolution created philosophical ability as a side effect while selecting for something else.
Philosophical ability is rare and not likely to be produced by evolution. There's no explanation for why we have it, other than dumb luck.

Objective Philosophical Truths?

Meta-PhilosophyMind SpacePhilosophyWorld Modeling

Frontpage

59

Mentioned in

174References & Resources for LessWrong

165Meta Questions about Metaphilosophy

138Problems I've Tried to Legibilize

83Some Thoughts on Metaphilosophy

70Where do selfish values come from?

Load More (5/9)

New Comment

Rendering 0/266 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:36 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

More from Wei Dai

Curated and popular this week

266Comments

266

Comment Permalink

Sniffnoy16y20

Convergence is more the result of the updates than the original prior. All the initial prior has to be to result in convergence is not completely ridiculous (1, 0, infinitessimals, etc).

Are you certain of this? Could you provide some sort of proof or reference, please, ideally together with some formalization of what you mean by "completely ridiculous"? I'll admit to not having looked up a proof of convergence for the universal prior or worked it out myself, but what you say were really the case, there wouldn't actually be be very much special about the universal prior, and this convergence property of it wouldn't be worth pointing out - so I think I have good reason to be highly skeptical of what you suggest.

However, that doesn't usually last for very long - real organic agents are pretty quickly flooded with information about the state of the universe, and are then typically in a much better position to make probabililty estimates.

Better, yes. But good enough? Arbitrarily close?

You could build agents that were very confident in their priors - and updated them slowly - but only rarely would you want an agent that was handicapped in its ability to adapt and learn.

Sorry, but what does this even mean? I don't understand how this notion of "update speed" translates into the Bayesian setting.

timtyler16y-20

Re: "there wouldn't actually be be very much special about the universal prior"

Well, Occam's razor is something rather special. However, agents don't need an optimal version of it built into them as a baby - they can figure it out from their sensory inputs.

-1timtyler16y

Here's Shane Legg on the topic of how little priors matter when predicting the environment: "In some situations, for example with Solomonoff induction, the choice of the reference machine doesn’t matter too much. [...] the choice of reference machine really doesn’t matter except for very small data sets (which aren’t really the ones we’re interested in here). To see this, have a look at the Solomonoff convergence bound and drop a compiler constant in by the complexity of the environment. The end result is that the Solomonoff predictor needs to see just a few more bytes of the data sequence before it converges to essentially optimal predictions." * http://www.vetta.org/2009/05/on-universal-intelligence/

-1timtyler16y

Re: "I don't understand how this notion of "update speed" translates into the Bayesian setting." Say you think p(heads) is 0.5. If you see ten heads in a row, do you update p(heads) a lot, or a little? It depends on how confident you are of your estimate. If you had previously seen a thousand coin flips from the same coin, you might be confident of p(heads) being 0.5 - and therefore update little. If you were told that it was a biased coin from a magician, then your estimate of p(heads) being 0.5 might be due to not knowing which way it was biased. Then you might update your estimate of p(heads) rapidly - on seing several heads in a row. Like that.

See in context