reply to benelliott about Popper issues

-1 Post author: curi 07 April 2011 08:11AM

This is a discussion page because I got the message "Comment too long". Apparently the same formatting magic doesn't work here for quotes :(  It is a reply to:

http://lesswrong.com/lw/3ox/bayesianism_versus_critical_rationalism/3ulv

 

> > You can conjecture Bayes' theorem. You can also conjecture all the rest, however some things (such as induction, justificationism, foundationalism) contradict Popper's epistemology. So at least one of them has a mistake to fix. Fixing that may or may not lead to drastic changes, abandonment of the main ideas, etc

> Fully agreed. In principle, if Popper's epistemology is of the second, self-modifying type, there would be nothing wrong with drastic changes. One could argue that something like that is exactly how I arrived at my current beliefs, I wasn't born a Bayesian.

OK great.

If the changes were large enough, to important parts (for example if it lost the ability to self-modify) I wouldn't want to call it Popper's epistemology anymore (unless maybe the changes were made very gradually, with Popper's ideas being valued the whole time, and still valued at the end). It would be departing from his tradition too much, so it would be something else. A minor issue in some ways, but tradition matters.

> I can also see some ways to make induction and foundationalism easer to swallow.

> A discussion post sounds about right for this, if enough people like it you might consider moving it to the main site.

104 comments later it's at 0 karma. There is interest, but not so much liking. I don't think the main site is the right place for me ;-)

> > I think you are claiming that seeing a white swan is positive support for the assertion that all swans are white. (If not, please clarify).

> This is precisely what I am saying.

Based on what you say later, I'm not sure if you mean this in the same way I meant it. I meant: it is positive support for "all swans are white" *over* all theories which assert "all swans are black" (I disagree with that claim). If it doesn't support them *more than those other theories* then I regard it as vaccuous. I don't believe the math you offered meets this challenge over supporting "all swans are white" more than various opposites of it. I'm not sure if you intended it to.

> > If so, this gets into important issues. Popper disputed the idea of positive support. The criticism of the concept begins by considering: what is support? And in particular, what is the difference between "X supports Y" and "X is consistent with Y"?

> The beauty of Bayes is how it answers these questions. To distinguish between the two statements we express them each in terms of probabilities.

> "X is consistent with Y" is not really a Bayesian way of putting things, I can see two ways of interpreting it. One is as P(X&Y) > 0, meaning it is at least theoretically possible that both X and Y are true. The other is that P(X|Y) is reasonably large, i.e. that X is plausible if we assume Y.

Consistent means "doesn't contradict". It's the first one. Plausible is definitely not what I wanted.

> "X supports Y" means P(Y|X) > P(Y), X supports Y if and only if Y becomes more plausible when we learn of X. Bayes tells us that this is equivalent to P(X|Y) > P(X), i.e. if Y would suggest that X is more likely that we might think otherwise then X is support of Y.

This is true but fairly vaccous, in my view. I don't want to argue over what counts as significant. If you like it, shrug. It is important that, e.g., we reject ideas refuted by evidence. But I don't think this addresses the major problems in epistemology which come after we decide to reject things which are refuted by evidence.

The reason it doesn't is there's always infinitely many things supported by any evidence, in this sense. Infinitely many things which make wildly different predictions about the future, but identical predictions about whatever our evidence covers. If Y is 10 white swans, and X is "all swans are white" then X is supported, by your statement. But also supported are infinitely many different theories claiming that all swans are black, and that you hallucinated. You saw exactly what you would see if any of those theories were true, so they get as much support as anything else. There is nothing (in the concept of support) to differentiate between "all swans are white" and those other theories.

If you do add something else to differentiate, I will say the support concept is useless. The new thing does all the work. And further, the support concept is frequently abused. I have had people tell me that "all swans are black, but tomorrow you will hallucinated 10 white swans" is supported less by seeing 10 white swans tomorrow than "all swans are white" is, even though they made identical predictions (and asserted them with 100% probability, and would both have been definitely refuted by anything else). That kind of stuff is just wrong. I don't know if you think that kind of thing or not. What you said here does clearly disown it, nor advocate it. But that's the kind of thing that concerns me.

> Suppose we make X the statement "the first swan I see today is white" and Y the statement "all swans are white". P(X|Y) is very close to 1, P(X|~Y) is less than 1 so P(X|Y) > P(X), so seeing a white swan offers support for the view that all swans are white. Very, very weak support, but support nonetheless.

The problem I have is that it's not supported over infinitely many rivals. So how is that really support? It's useless. The only stuff not being supported is that which contradicts the evidence (like, literally contradicts, with no hallucination claims. e.g. a theory that predicts you will think you saw a green swan tomororw. but then you don't, just the white ones. that one is refuted). The inconsistent theories are refuted. The theories which make probabalistic predictions are partially supported. And the theories that say "screw probability, 100% every time" for all predictions get maximally supported, and between them support does not differentiate. (BTW I think it's ironic that I score better on support when I just stick 100% in front of every prediction in all theories I mention, while you score lower by putting in other numbers, and so your support concept discourages ever making predictions with under 100% confidence).

> (The above is not meant to be condescending, I apologise if you know all of it already).

It is not condescending. I think (following Popper) that explaining things is important and that nothing is obvious, and that communication is difficult enough without people refusing go over the "basics" in order to better understand each other. Of course this is a case where Popper's idea is not unique. Other people have said similar. But this idea, and others, are integrated into his epistemology closely. There's also *far more detail and precision* available, to explain *why* this stuff is true (e.g. lengthy theories about the nature of communication, also integrated into his epistemology). I don't think ideas about interpretting people's writing in kind ways, and miscommunication being a major hurdle, are so closely integrated with Bayesian approaches with are more math focussed and don't integrate so nicely with explanations.

My reply about support is basic stuff too, to my eye. But maybe not yours. I don't know. I expect not, since if it was you could have addressed it in advance. Oh well. It doesn't matter. Reply as you will. No doubt I'm also failing to address in advance something you regard as important.

> > To show they are correct. Popper's epistemology is different: ideas never have any positive support, confirmation, verification, justification, high probability, etc...

> This is a very tough bullet to bite.

Yes it is tough. Because this stuff has been integral to the Western philosophy tradition since Aristotle until Popper. That's a long time. It became common sense, intuitive, etc...

> > How do we decide which idea is better than the others? We can differentiate ideas by criticism. When we see a mistake in an idea, we criticize it (criticism = explaining a mistake/flaw). That refutes the idea. We should act on or use non-refuted ideas in preference over refuted ideas.

> One thing I don't like about this is the whole 'one strike and you're out' feel of it. It's very boolean,

Hmm. FYI that is my emphasis more than Popper's. I think it simplifies the theory a bit to regard all changes to theories as new theories. Keep in mind you can always invent a new theory with one thing changed. So the ways it matters have some limits, it's party just a terminology thing (terminolgoy has meaning, and some is better than others. Mine is chosen with Popperian considerations in mind. A lot of Popper's is chosen with considerations in mind of talking with his critics). Popper sometimes emphasized that it's important not to give up on theories too easily, but to look for ways to improve them when they are criticized. I agree with that. So, the "one strike you're out" way of expressing this is misleading, and isn't *substantially* implied in my statements (b/c of the possibility of creating new and similar theories). Other terminologies have different problems.

> the real world isn't usually so crisp. Even a correct theory will sometimes have some evidence pointing against it, and in policy debates almost every suggestion will have some kind of downside.

This is a substantive, not terminological, disagreement, I believe. I think it's one of the *advantages* of my terminology that it helped highlight this disagreement.

Note the idea that evidence "points" is the support idea.

In the Popperian scheme of things, evidence does not point. It contradicts, or it doesn't (given some interpretation and explanation, which are often more important than the evidence itself). That's it. Evidence can thus be used in criticisms, but is not itself inherently a criticism or argument.

So let me rephrase what you were saying. "Even a correct theory will sometimes have critical arguments against it".

Part of the Popperian view is that if an idea has one false aspect, it is false. There is a sense in which any flaw must be decisive. We can't just go around admitting mistakes into our ideas on purpose.

One way to explain the issue is: for each criticism, consider it. Judge if it's right or wrong. Do your best and act on the consequence. If you think the criticism is correct, you absolutely must reject the idea it criticizes. If you don't, then you can regard the theory as not having any *true* critical arguments against it, so that's fine.

When you reject an idea for having one false part, you can try to form a new theory to rescue the parts you still value. This runs into dangers of arbitrarily rescuing everything in an ad hoc way. There's two answers to that. The first is: who cares? Popperian epistemology is not about laying out rules to prevent you from thinking badly. It's about offering advice to help you think better. We don't really care very much if you find a way to game the system and do something dumb, such as making a series of 200 ad hoc and silly arguments to try to defend a theory you are attached to. All we'll do is criticize you for it. And we think that is good enough: there are criticisms of bad methodologies, but no formal rules that definitively ban them. Now the second answer, which Deutsch presents in The Fabric of Reality, is that when you modify theories you often ruin their explanation. If you don't, then the modification is OK, it's good to consider this new theory, it's worth considering. But if the explanation is ruined, that puts an end to trying to rescue it (unless you can come up with a good idea for a new way to modify it that wont' ruin the explanation).

This concept of ruining explanations is important and not simple. Reading the book would be great (it is polished! edited!) but I'll try to explain it briefly. This example is actually from his other book, _The Beginning of Infinity_ chapter 1. We'll start with a bad theory: the seasons are caused by Persephone's imprisonment, for 6 months of the year, in the underworld (via her mother Demeter's magic powers which she uses to express her emotions). This theory has a bad explanation in the first place, so it can be easily rescued when it's emprically contradicted. For example this theory predicts the seasons will be the same all over the globe, at the same time. That's false. But you can modify the theory very easily to account for the empirical data. You can say that Demeter only cares about the area where she lives. She makes it cold when Persephone is gone, and hot when she's present. The cold or hot has to go somewhere, so she puts it far away. So, the theory is saved by an ad hoc modification. It's no worse than before. Its substantive content was "Demeter's emotions and magic account for the seasons". And when the facts change, that explanation remains in tact. This is a warning against bad explanations (which can be criticized directly for being bad explanations, so there's no big problem here).

But when you have a good explanation, such as the real explanation for the seasons, based on the Earth orbitting the sun, and the axis being tilted, and so on, ad hoc modifications cause bigger problems. Suppose we found out the seasons are the same all around the world at the same time. That would refute the axis tilt theory of seasons. You could try to save it, but it's hard. If you added magic you would be ruining the axis tilt *explantion* and resorting to a very different explanation. I can't think of any way to save the axis tilt theory from the observation that the whole world has the same seasons as the same time, without contradicting or replacing its explanation. So that's why ad hoc modifications sometimes fail (for good explanatory theories only). In the cases where there is not a failure of this type -- if there is a way to keep a good explanation and still account for new data -- then that new theory is genuinely worth consideration (and if there is some thing wrong with it, you can criticize it).

> There is also the worry that there could be more than one non-refuted idea, which makes it a bit difficult to make decisions.

Yes I know. This is an important problem. I regard it as solved. For discussion of this problem, go to:

http://lesswrong.com/r/discussion/lw/551/popperian_decision_making/

> Bayesianism, on the other hand, when combined with expected utility theory, is perfect for making decisions.

Bayesianism works when you assume a bunch of stuff (e.g. some evidence), and you set up a clean example, and you choose an issue it's good at handling. I don't think it is very helpful in a lot of real world cases. Certaintly it helps in some. I regard Bayes' theorem itself as "how not to get probability wrong". That matters to a good amount of stuff. But hard real world scenarios usually have rival explanations of the proper interpretation of the available evidence, they have fallible evidence that is in doubt, they have often many different arguments that are hard to assign any numbers to, and so on. Using solomonoff induction is assign numbers, for example, doesn't work in practice as far as i know (e.g. people don't actually compute the numbers for dozens of political arugments using it). Another assumption being made is *what is a desirable (high utility) outcome* -- Bayesianism doesn't help you figure that out, it just lets you assume it (I see that as entrenching bias and subjectivism in reagards to morality -- we *can* make objective criticisms of moral values).

 

Comments (187)

Comment author: [deleted] 07 April 2011 03:17:26PM 5 points [-]

I want to emphasize this line from the long op, which I think is curi's best argument:

Hard real world scenarios usually have rival explanations of the proper interpretation of available evidence, they have fallible evidence that is in doubt, they have often many different arguments that are hard to assign any numbers to, and so on.

Therefore Bayesianism does not describe the way that we actually find out true things. I think this is a pretty compelling criticism of Bayes, does anyone have a stock answer?

Comment author: timtyler 07 April 2011 05:15:32PM *  2 points [-]

Therefore Bayesianism does not describe the way that we actually find out true things. I think this is a pretty compelling criticism of Bayes, does anyone have a stock answer?

It isn't a theory about human psychology in the first place.

Comment author: curi 07 April 2011 09:28:30PM 1 point [-]

Epistemologies are theories about how knowledge is created.

Humans create knowledge.

If you want to be an epistemology, address the problem of how they do it.

Comment author: timtyler 08 April 2011 12:50:13PM 1 point [-]

Humans do all kinds of things badly. They are becoming obsolete.

For a neater perspective that is more likely to stand the test of time, it is better to consider how machines create knowledge.

Comment author: komponisto 07 April 2011 04:39:13PM 2 points [-]

Firstly, "the way we actually do X is by Y" is never a valid criticism of a theory saying "the way we should do X is by Z". (Contemporary philosophers are extremely fond of this mistake, it must be said.) If we're not using Bayes, then maybe we're doing it wrong.

Secondly, that the fact that we don't consciously think in terms of numbers doesn't mean that our brains aren't running Bayes-like algorithms on a low level not accessible to conscious introspection.

Comment author: JoshuaZ 07 April 2011 06:31:00PM 2 points [-]

Secondly, that the fact that we don't consciously think in terms of numbers doesn't mean that our brains aren't running Bayes-like algorithms on a low level not accessible to conscious introspection.

Failure to perform correctly on the Monty Hall problem is cross-cultural. I haven't seen the literature in any detail but my impression is that the conjunction fallacy is also nearly universal. Whatever humans are doing it isn't very close to Bayes.

Comment author: komponisto 07 April 2011 08:51:01PM 3 points [-]

Emphasis on low-level. Thinking "hmm, the probability of this outcome is 1/3" is high-level, conscious cognition. The sense in which we're "Bayesians" is like the sense in which we're good at calculus: catching balls, not (necessarily) passing written tests.

The conjunction fallacy is a closer to being a legitimate counterargument, but I would remind you that "Bayes-like" does not preclude the possibility of deviations from Bayes.

Perhaps some general perspective would be helpful. My point of view is that "inference = Bayes" is basically an analytic truth. That is, "Bayesian updating" is the mathematically precise notion that best corresponds to the vague, confused human idea of "inference". The latter turns out to mean Bayesian updating in the same sense that our intuitive idea of "connectedness" turns out to mean this. As such, we can make our discourse strictly more informative by replacing talk of "inference" with talk of "Bayesian updating" throughout. We can talk about Bayesian updating done correctly, and done incorrectly. For example, instead of saying "humans don't update according to Bayes", we should rather say, "humans are inconsistent in their probability assignments".

Comment author: curi 07 April 2011 09:31:06PM -2 points [-]

That is, "Bayesian updating" is the mathematically precise notion that best corresponds to the vague, confused human idea of "inference".

I agree with you that "inference" is a vague and confused notion.

I don't agree that finding some math that somewhat corresponds to a bad idea, makes things better!

Popper's approach to it is to reject the idea and come up with better, non-confused ideas.

Comment author: komponisto 07 April 2011 09:44:10PM 4 points [-]

An idea becomes "non-confused" when it is turned into math. "Inference" may be a confused notion, but Bayesian updating isn't.

If Popper has better math than Bayes, so much the better. That's not the impression I get from your posts, however. The impression I get from your posts is that you meant to say "Hey! Check out this great heuristic that Karl Popper came up with for generating more accurate probabilities!" but instead it came out as "Bayes sucks! Go Popper!"

Comment author: curi 07 April 2011 09:47:59PM 0 points [-]

If the math is non-confused, and the idea is confused, then what's going on is not that the idea became non-confused but the math doesn't correspond to reality.

If Popper has better math than Bayes, so much the better.

He doesn't have a lot of math.

No matter how much math you have, you always face problems of considering issues like whether some mathematical objects correspond to some real life things, or not. And you can't settle those issues with math.

"Bayes sucks! Go Popper!"

You guys are struggling with problems, such as justificationism, which Popper solved. Also with instrumentalism, lack of appreciation for explanatory knowledge, foundationalism, etc

Comment author: komponisto 07 April 2011 10:03:15PM 1 point [-]

If the math is non-confused, and the idea is confused, then what's going on is not that the idea became non-confused but the math doesn't correspond to reality.

What? Only confused ideas correspond to reality? That makes no sense.

No matter how much math you have, you always face problems of considering issues like whether some mathematical objects correspond to some real life things, or not. And you can't settle those issues with math.

You settle those issues by experiment.

You guys are struggling with problems, such as justificationism, which Popper solved. Also with instrumentalism, lack of appreciation for explanatory knowledge, foundationalism, etc

I'm not sure I see the problem, frankly. As far as I can tell this would be like me telling you that you're "struggling with the problem of Popperianism".

Comment author: curi 07 April 2011 10:15:41PM 1 point [-]

If you take a confused idea, X. And you take some non-confused math, Y. Then they do not correspond precisely.

No matter how much math you have, you always face problems of considering issues like whether some mathematical objects correspond to some real life things, or not. And you can't settle those issues with math.

You settle those issues by experiment.

Can't be done. When you try to set up an experiment you always have to have philosophical theories. For example if you want to measure something, you need a theory about the nature of your measuring device. e.g. you'll want to come up with some mathematical properties and know if they correspond to the real physical object. So you run into the same problem again.

I'm not sure I see the problem, frankly.

How are theories justified?

How are theories induced? If you say using the solomonoff prior, then are the theories it offers always best? If not, that's a problem, right? If yes, what's the argument for that?

Comment author: curi 07 April 2011 09:29:51PM 1 point [-]

I actually don't agree with this. Those problems are caused by memes, not hardware.

Cross cultural is caused by the logic of the situation different early cultures were being in, and what mistakes are easy to make, being similar.

Comment author: JoshuaZ 08 April 2011 01:45:20AM 2 points [-]

I actually don't agree with this. Those problems are caused by memes, not hardware

How would you test this claim? (Note by the way that in the case of Monty Hall, the percentages don't change much from culture to culture. It is consistently between 75-90% refusing to switch in all tested cultures. This is actually one of the things that convinced me that this isn't memetic. )

Comment author: [deleted] 08 April 2011 01:45:53AM 0 points [-]

Firstly, "the way we actually do X is by Y" is never a valid criticism of a theory saying "the way we should do X is by Z". (Contemporary philosophers are extremely fond of this mistake, it must be said.) If we're not using Bayes, then maybe we're doing it wrong.

Let me go further. The way people with a good track record of finding out true things (for instance, komponisto) actually go about finding out true things is by collecting explanations and criticisms of those explanations, not by computing priors and posteriors.

What would it mean to be doing it wrong? I can only think of: believing a lot of false things. So tell me some false things that I could come to believe by Popperian methods, that I wouldn't come to believe by Bayesian methods, or even better show me that the converse happens much more rarely.

Secondly, that the fact that we don't consciously think in terms of numbers doesn't mean that our brains aren't running Bayes-like algorithms on a low level not accessible to conscious introspection.

Sure. For instance there's good evidence that our brains judge what color something is by a Bayesian process. But why should I take advice about epistemology from such an algorithm?

Comment author: jimrandomh 07 April 2011 12:10:09PM 11 points [-]

At this point, I have to conclude that you just plain don't understand Bayesian epistemology well enough to criticize it. I also suspect that you have become too strongly opinionated on this topic to be able to learn, at least until you get some distance.

The principle difference between Bayesian and Popperian epistemology is that Bayesianism is precise; it puts all necessary ambiguity in the prior, and assumes only noncontroversial, well-tested mathematical mathematical axioms, and everything thereafter is deductively sound math. In Popper, the ambiguity (which is still necessary) is in the definitions and spreads through the whole system, making its predictions much less concrete and thus making it hard to falsify.

To make progress in epistemology beyond Popper, you must switch from English to math. It takes a lot of work and a lot of time to rebuild a fuzzy English understanding of epistemology as a precise math understanding, but you will find that the precise math reproduces the same predictions, and many more predictions that the fuzzy English could never have made.

Comment author: [deleted] 07 April 2011 03:19:23PM 6 points [-]

The principle difference between Bayesian and Popperian epistemology is that Bayesianism is precise; it puts all necessary ambiguity in the prior, and assumes only noncontroversial, well-tested mathematical mathematical axioms, and everything thereafter is deductively sound math.

I think you're overselling it. Here are two big weaknesses of Bayesian epistemology as I understand it:

  1. it cannot handle uncertainty about unproved mathematical truths.

  2. It does not describe the way any existing intelligence actually operates, or even could operate in principle. (That last clause is the problem of writing an AI.)

I have never seen on this website any argument resolved or even approached on semirigorous Bayesian lines, except a couple of not-so-successful times between Yudkowsky and Hanson. Popper, or the second-hand accounts of him that I understand, seems to describe the way that I (and I think you!) actually think about things: we collect a big database of explanations and of criticisms of those explanations, and we decide the merits of those explanations and criticisms using our messy judgement.

In cases that satisfy some mild assumptions (but not so mild as to handle the important problem 1.!) this might be equivalent to Bayesianism. But equivalences go both ways, and Popper seems to be what we actually practice -- what's the problem?

Comment author: komponisto 07 April 2011 04:59:18PM 0 points [-]

Here are two big weaknesses of Bayesian epistemology as I understand it:

1.it cannot handle uncertainty about unproved mathematical truths.

Why not? You just use an appropriate formalization of mathematics, and treat it as uncertainty about the behavior of a proof-searching machine.

I have never seen on this website any argument resolved or even approached on semirigorous Bayesian lines, except a couple of not-so-successful times between Yudkowsky and Hanson

I can think of at least one concrete example. But I'm guessing you were familiar with that example (and numerous other smaller-scale ones) and rejected it, so you must mean something different than I do by "argument approached on semirigorous Bayesian lines".

equivalences go both ways, and Popper seems to be what we actually practice -- what's the problem?

Perhaps there isn't any, except insofar as the poster is claiming that Bayes is wrong because it isn't Popper.

Comment author: JoshuaZ 07 April 2011 06:29:10PM 0 points [-]

Why not? You just use an appropriate formalization of mathematics, and treat it as uncertainty about the behavior of a proof-searching machine.

Unfortunately this isn't helpful. Consider for example a Turing machine that seems to halt on all inputs, and we know that when this one halts it halts with either a 0 or a 1. Does this machine represent a computable sequence (hence should have non-zero probability assigned if one is using a Solomonoff prior)? If we haven't resolved that question we don't know. But in order to use any form of prior over computable sequences we need to assume that we have access to what actually represents a computable hypothesis and what doesn't. There are other problems as well.

Comment author: komponisto 07 April 2011 09:15:36PM 0 points [-]

I'm having trouble parsing your third (I don't know what it means for a Turing machine to [fail to] "represent a computable sequence", especially since I thought that a "computable sequence" was by definition the output of a Turning machine) and fourth (we don't know what?) sentences, but if your general point is what I think it is ("after formalizing logical uncertainty, we'll still have meta-logical uncertainty left unformalized!"), that's simply a mathematical fact, and not an argument against the possibility of formalizing logical uncertainty in the first place.

Comment author: JoshuaZ 08 April 2011 01:52:30AM *  1 point [-]

I'm having trouble parsing your third (I don't know what it means for a Turing machine to [fail to] "represent a computable sequence", especially since I thought that a "computable sequence" was by definition the output of a Turning machine)

A sequence f(n) is computable if there's a Turing machine T that given input n halts with output f(n). But, not all Turing machines halt on all inputs. It isn't hard to make Turing machines that go into trivial infinite loops, and what is worse, Turing machines can fail to halt in much more ugly and non-obvious ways to the point where the question "Does the Turing machine M halt on input n" is not in general decidable. This is known at the Halting theorem. So if I'm using some form of Solomonoff prior I can't even in general tell whether a machine describes a point in my hypothesis space.

Comment author: komponisto 08 April 2011 02:15:18AM 0 points [-]

What I don't understand is your argument that there is a specific problem with logical uncertainty that doesn't apply to implementing Solomonoff induction in general. Yes, the halting problem is undecidable, so you can't decide if a sequence is computable; but assuming you've already got a Solomonoff-induction machine that can say "my probability that it will rain tomorrow is 50%", why can't it also say "my probability that the Riemann Hypothesis is true is 50%"?

Comment author: JoshuaZ 08 April 2011 02:23:04AM 1 point [-]

but assuming you've already got a Solomonoff-induction machine that can say "my probability that it will rain tomorrow is 50%", why can't it also say "my probability that the Riemann Hypothesis is true is 50%"?

That's actually a really good example. It isn't that difficult to make a Turing machine that halts if and only if the Riemann hypothesis is true. So a system using Solomonoff induction has to recognize for starters whether or not that Turing machine halts. Essentially, in the standard version of Solomonoff induction, you need to assume that you have access to indefinitely large computing power. You can try making models about what happens when you have limited computational power in your entity (In some sense AIXI implementations and implementations of Bayesian reasoning need to do close to this). But if one doesn't assume that one has indefinite computing power then a lot of the results about how different priors behave no longer hold (or at least the proofs don't obviously go through). For more detail on that sort of thing I'd recommend talking to cousin_it or jimrandomh since they've thought and know a lot more about these issues than I do.

Comment author: komponisto 08 April 2011 03:19:17PM *  0 points [-]

It isn't that difficult to make a Turing machine that halts if and only if the Riemann hypothesis is true. So a system using Solomonoff induction has to recognize for starters whether or not that Turing machine halts.

Only in the sense that a human trying to solve the Riemann Hypothesis also has to recognize whether the same Turing machine halts.

When I talk about "going meta", I really mean it: when the Solomonoff machine that I have in mind is considering "whether this sequence is computable" or "whether the Riemann Hypothesis is true" or more generally "whether this Turing machine halts", it is going to be doing so the same way a human does: by using a model of the mathematical object in question that isn't actually equivalent to that same mathematical object. It won't be answering the question "natively", the way a computer typically adds 3+5 (i.e. by specific addition algorithms built into it); instead, it will be more closely analogous to a computer being programmed to simulate three apples being combined with five apples on its display screen, and then count the apples by recognizing their visual representation.

So the upshot is that to be able to give an answer the question "what is your probability that this Turing machine halts?", the Solomonoff AI does not need to solve anything equivalent to the halting problem. It just needs to examine the properties of some internal model corresponding to the label "Turing machine", which need not be an actual Turing machine. It is in this way that uncertainty about mathematical truths is handled.

It should go without saying that this isn't directly of use in building such an AI, because it doesn't tell you anything about how to construct the low-level algorithms that actually run it. But this thread isn't about how to build a Bayesian AI; rather, it's about whether a Bayesian AI is something that it makes sense to build. And my point here is that "Well, if you had a Bayesian AI, it wouldn't be able to give you probability estimates concerning the truth of mathematical statements" is not a valid argument on the latter question.

Comment author: [deleted] 08 April 2011 03:56:57PM 0 points [-]

So the upshot is that to be able to give an answer the question "what is your probability that this Turing machine halts?", the Solomonoff AI does not need to solve anything equivalent to the halting problem.

By "Solomonoff AI" do you mean "some computable approximation of a Solomonoff AI"? My impression is that the Solomonoff prior just does solve the halting problem, and that this is a standard proof that it is uncomputable.

when the Solomonoff machine that I have in mind is considering "whether this sequence is computable" or "whether the Riemann Hypothesis is true" or more generally "whether this Turing machine halts", it is going to be doing so the same way a human does.

Humans are bad at this. Is there some reason to think that a "the Solomonoff machine you have in mind" will be better at it?

Comment author: timtyler 07 April 2011 05:02:00PM 0 points [-]

Here are two big weaknesses of Bayesian epistemology as I understand it:

it cannot handle uncertainty about unproved mathematical truths.

Why do you think that?

It does not describe the way any existing intelligence actually operates, or even could operate in principle. (That last clause is the problem of writing an AI.)

Solomonoff induction is uncomputable? So: use a computable approximation.

Comment author: curi 07 April 2011 07:38:33PM 5 points [-]

Solomonoff induction is uncomputable? So: use a computable approximation.

What is the argument that the approximation you use is good? What I mean is, when you approximate you are making changes. Some possible changes you could make would create massive errors. Others -- the type you are aiming for -- only create small errors that don't spread all over. What is your method of creating an approximation of the second type?

Comment author: JoshuaZ 08 April 2011 03:19:11AM 0 points [-]

There's a large amount of math behind this sort of thing, and frankly, given your other comments I'm not sure that you have enough background. It might help to just read up on Bayeisian machine learning which needs to deal with just this sort of issue. Then keep in mind that there are theorems that given some fairly weak conditions to rule out pathological cases one approximate any distribution by a computable distribution to arbitrary accuracy. You need to be careful about what metric you are using but it turns out to be true for a variety of different notions of approximating and different metrics. While this is far from my area of expertise, so I'm by no means an expert on this, my impression is that the theorems are essentially of the same flavor as the theorems one would see in a real analysis course about approximating functions with continuous functions or polynomial functions.

Comment author: timtyler 07 April 2011 09:02:45PM 0 points [-]

Making computable approximations of Solomonoff induction is a challenging field which it seems inappropriate to try and cram into a blog comment. Probably the short answer is "by using stochastic testing".

Comment author: [deleted] 08 April 2011 02:03:09AM 3 points [-]

Why do you think that?

If you think I'm mistaken, please say so and elaborate.

Solomonoff induction is uncomputable? So: use a computable approximation.

It's hard for me to believe that you haven't thought of this, but it's difficult to "approximate" an uncomputable function. Think of any enormous computable function f(n) you like. Any putative "approximation" of the busy beaver function is off by a factor larger than f(n). I bet Solomonoff is similarly impossible to approximate -- am I wrong?

Comment author: timtyler 08 April 2011 03:11:48AM 1 point [-]

I am not aware of any particular issues regarding Bayesian epistemology handling uncertainty about unproved mathematical truths. How is that different from other cases where there is uncertainty?

Using a computable approximation of Solomonoff induction is a standard approach. If you don't have a perfect compressor, you just use the best one you have. It is the same with Solomonoff induction.

Comment author: [deleted] 08 April 2011 02:01:34PM 0 points [-]

I am not aware of any particular issues regarding Bayesian epistemology handling uncertainty about unproved mathematical truths. How is that different from other cases where there is uncertainty?

I outlined the problem with mathematical uncertainty here. The only reason I believe that this is an open problem is on Yudkowsky's say-so in his reply.

Using a computable approximation of Solomonoff induction is a standard approach. If you don't have a perfect compressor, you just use the best one you have. It is the same with Solomonoff induction.

Standard approach to what? I don't know what a "compressor" or "perfect compressor" is, if those are technical terms.

To me, the question is whether an approximation to Solomonoff induction has approximately the same behavior as Solomonoff induction. I think it can't, for instance because no approximation of the busy beaver function (even the "best compressor you have") behaves anything like the busy beaver function. If you think this is a misleading way of looking at it please tell me.

Comment author: JoshuaZ 08 April 2011 02:12:25PM 1 point [-]

To me, the question is whether an approximation to Solomonoff induction has approximately the same behavior as Solomonoff induction. I think it can't, for instance because no approximation of the busy beaver function (even the "best compressor you have") behaves anything like the busy beaver function. If you think this is a misleading way of looking at it please tell me.

Solomonoff induction can't handle the Busy Beaver function because Busy Beaver is non-computable. So it isn't an issue for approximations of Solomonoff (except in so far as they can't handle it either).

Comment author: [deleted] 08 April 2011 02:16:18PM 0 points [-]

I am not saying that "Solomonoff can't handle Busy Beaver." (I'm not even sure I know what you mean.) I am saying that Solomonoff is analogous to Busy Beaver, for instance because they are both noncomputable functions. Busy Beaver is non-approximatable in a strong sense, and so I think that Solomonoff might also be non-approximatable in an equally strong sense.

Comment author: timtyler 08 April 2011 05:14:29PM 0 points [-]

Kolmogorov complexity is uncomputable, but you can usefully approximate Kolmogorov complexity for many applications using PKZIP. The same goes for Solomonoff induction. Its prior is based on Kolmogorov complexity.

Comment author: timtyler 08 April 2011 04:29:55PM *  0 points [-]

I outlined the problem with mathematical uncertainty here.

Don't agree with the premise there. As to what Yudkowsky is talking about by saying "Logical uncertainty is an open problem" - it beats me. There's really only uncertainty. Uncertainty about mathematics is much the same as other kinds of uncertainty.

Comment author: [deleted] 08 April 2011 04:32:53PM 0 points [-]

What premise?

Comment author: timtyler 08 April 2011 04:53:45PM 0 points [-]

The first 5 lines of the post - this bit::

Any prior P, of any agent, has at least one of the following three properties:

Comment author: [deleted] 08 April 2011 10:49:05PM 0 points [-]

I can prove that to you, unless I made a mistake. Are you saying you can defeat it a priori by telling me a prior that doesn't have any of those three properties?

Comment author: khafra 07 April 2011 03:36:13PM 0 points [-]

My thinking seems to me more qualitatively bayesian than popperian. I don't have a good enough memory to keep all the criticisms I've ever heard of each theory I provisionally accept in mind. Instead, when I encounter a criticism that seems worth considering, I decrease my belief in the theory by an amount corresponding to the strenghth of the criticism. If I then go on to find evidence that weakens the criticism, strengthens the original theory, or weakens all possible alternate theories, I increase my belief in the original theory again.

Comment author: curi 07 April 2011 09:18:04PM 0 points [-]

You raise an interesting issue which is: what is the strength of a criticism? How is that determined?

For example, your post is itself a criticism of Popperian epistemology. What is the strength of your post?

By not using strengths of arguments, I don't have this problem. Strengths of arguments remind me of proportional representation voting where every side gets a say. PR voting makes a mess of things, not just in practice but also in terms of rigorous math (e.g. Arrow's Theorem)

Comment author: Sniffnoy 08 April 2011 04:58:43AM 1 point [-]

What does Arrow's theorem have to do with proportional representation? Arrow's theorem deals with single-winner ordinal voting systems. Is there some generalization that covers proportional representation as well?

Comment author: curi 08 April 2011 09:37:10AM *  -1 points [-]

For one thing, all elections have a single overall outcome that wins.

Comment author: Sniffnoy 08 April 2011 10:12:14AM 1 point [-]

Indeed, but "single-winner" has a technical meaning here that's rather more restrictive than that. Unless each voter could choose their vote aribitrarily from among the set of those overall outcomes, it's not single-winner.

Comment author: JoshuaZ 08 April 2011 01:47:17AM 1 point [-]

By not using strengths of arguments, I don't have this problem.

Do you intend to treat all criticism equally?

Comment author: [deleted] 07 April 2011 03:48:19PM *  -1 points [-]

I'm suspicious of the notion of "increasing" and "decreasing" a belief. Did you pick those words exactly because you didn't want to use the word "updating"? Why not?

My guess is that having a bad memory is as much a disadvantage for Bayesians as for Popperians.

Comment author: khafra 07 April 2011 04:06:25PM 3 points [-]

I'm suspicious of your suspicion. Is it purely because of the terms I used, or do you really have no beliefs you hold more tenuously than others?

If I ask you whether the sun rose this morning, will you examine a series of criticisms of that idea for validity and strength?

If I ask you whether it's dangerous to swim after a heavy meal, you'll probably check snopes to verify your suspicion that it's an old wives' tale, and possibly check any sources cited at snopes. But will you really store the details of all those arguments in your memory as metadata next to "danger of swimming after a heavy meal," or just mark it as "almost certainly not dangerous"?

Comment author: curi 07 April 2011 09:19:36PM 3 points [-]

To save on storage, learn powerful explanations. Sometimes ideas can be integrated into better ideas that elegantly cover a lot of ground. Finding connections between fields is an important part of learning.

Learning easy to remember rules of thumb -- and improving them with criticism when they cause problems -- is also valuable for some applications.

Comment author: [deleted] 07 April 2011 11:18:40PM 1 point [-]

I don't think the shortcuts I take on easy questions are very demonstrative of anything.

I'm suspicious of your suspicion. Is it purely because of the terms I used, or do you really have no beliefs you hold more tenuously than others?

I know what it means to think an outcome is likely or not likely. I don't know what it means for a belief to be tenuous or not tenuous.

Comment author: David_Gerard 07 April 2011 12:21:00PM 4 points [-]

At this point, I have to conclude that you just plain don't understand Bayesian epistemology well enough to criticize it.

LessWrong is seriously lacking a proper explanation of Bayesian epistemology (as opposed to the theorem itself). Do you have one handy?

Comment author: timtyler 07 April 2011 05:12:41PM 0 points [-]

http://yudkowsky.net/rational/bayes has a section on Bayesian epistemology that compares it to Popper's ideas.

Bayesian epistemology boils down to: use probabilities to represent your confidence in your beliefs, use Bayes's theorem to update your confidences - and try to choose a sensible prior.

Comment author: curi 07 April 2011 07:39:50PM 2 points [-]

Of course I've read that.

It first of all is focussed on Bayes' theorem without a ton of epistemology.

It second of all does not discuss Popper's ideas but only nasty myths about them. See the original post here:

http://lesswrong.com/lw/54u/bayesian_epistemology_vs_popper/

Comment author: [deleted] 07 April 2011 12:50:49PM *  -2 points [-]

Yes, http://wiki.lesswrong.com/wiki/Sequences . Specifically the first four 'sequences' there. Many people on LW will say "Read the sequences!" as an answer to almost everything, like they're some kind of Holy Writ, which can be offputting, but in this case Yudkowsky really does answer most of the objections you've been raising and is the simplest explanation I know of.

ETA - That's weird. When I posted this reply I could have sworn that the username on the comment above wasn't DavidGerard but one I didn't recognise. I wouldn't point DG to the sequences because I know he's read them, but would point a newbie to them. Apologies for the brainfart,

Comment author: [deleted] 07 April 2011 12:59:33PM 3 points [-]

I've read the Sequences, and I don't know what you are referring to--the first four Sequences do explain Bayes-related ideas and how to apply them in everyday life, but they don't address all of the criticism that curi and others have pointed out. Did you have a specific post or series of posts in mind?

Comment author: [deleted] 07 April 2011 01:19:51PM 0 points [-]

The criticisms here that haven't been just noise have mostly boiled down to the question of choosing one hypothesis from an infinite sample space. I'm not sure exactly where in the sequences EY covered this, but I know he did, and did in one of those four, because I reread them recently. Sorry I can't be more help.

Comment author: David_Gerard 07 April 2011 01:27:18PM *  4 points [-]

What I'm meaning to point out is the absence of something with a title along the lines of "Bayesian epistemology" that explains Bayesian epistemology specifically.

What LW has is "here's the theorem" and "everything works by this theorem", without the second one being explained in coherent detail. I mean, Bayes structure is everywhere. But just noting that is not an explanation.

There's potential here for an enormously popular front-page post that gets linked all over the net forever, because the SEP article is so awful ...

Comment author: curi 07 April 2011 09:14:31PM 1 point [-]

At this point, I have to conclude that you just plain don't understand Bayesian epistemology well enough to criticize it.

What you should do is say specifically what I got wrong (just one thing is fine). Then you'll be making a substantive statement!

making its predictions much less concrete and thus making it hard to falsify.

What predictions? It is a philosophical theory.

To make progress in epistemology beyond Popper, you must switch from English to math.

Your conception of epistemology is different than ours. We seek things like explanations that help us to understand the world.

Comment author: jimrandomh 07 April 2011 09:29:44PM 3 points [-]

What you should do is say specifically what I got wrong (just one thing is fine). Then you'll be making a substantive statement!

Ok, here's one. You criticize Bayesian updating for invoking infinitely many hypotheses, as a fundamental problem. In fact, the problem of infinite sets is an issue, but it's resolved in Jaynes' book by a set of rules in which one never deals with infinities directly, but rather with convergent limiting expressions, which are mathematically well-behaved in ways that infinities aren't. This ensures, among other things, that any set of hypotheses (whether finite or infinite) has only finite total plausibility, and lets us compute plausibilities for whole sets at once (ideally, picking out one element and giving it a high probability, and assigning a low total probability to the infinitely many other hypotheses).

What predictions? It is a philosophical theory.

Both theories make predictions about the validity of models using evidence - that is, they predict whether future observations will agree with the model.

Your conception of epistemology is different than ours. We seek things like explanations that help us to understand the world.

No, our conceptions of epistemology are the same. Math does help us understand the world, in ways that natural language can't.

Comment author: curi 07 April 2011 09:42:01PM -2 points [-]

Ok, here's one. You criticize Bayesian updating for invoking infinitely many hypotheses, as a fundamental problem.

No, I didn't say that. I invoked them, because they matter. You then claims Jaynes' deals with the problem. Yet Yudkowsky concedes it is a problem. I don't think you understood me, rather than vice versa.

Both theories make predictions about the validity of models using evidence

Popper never made a prediction like that. And this rather misses some points. Some models for using evidence (e.g. induction) are literally incapable of making predictions (therefore people who do make predictions must be doing something else). Here Popper was not making a prediction, and also was pointing out prediction isn't the right way to judge some theories.

No, our conceptions of epistemology are the same. Math does help us understand the world, in ways that natural language can't.

Can you write philosophical explanations in math? Of course math helps for some stuff, but not everything.

Comment author: jimrandomh 07 April 2011 09:52:57PM 2 points [-]

Ok, here's one. You criticize Bayesian updating for invoking infinitely many hypotheses, as a fundamental problem.

No, I didn't say that. I invoked them, because they matter. You then claims Jaynes' deals with the problem. Yet Yudkowsky concedes it is a problem

Here's where you've really gone astray. You're trying to figure out math by reading what people are saying about it. That doesn't work. In order to understand math, you have to look at the math itself. I'm not sure what statement by Yudkowsky you're referring to, but I'll bet it was something subtly different.

Both theories make predictions about the validity of models using evidence

Popper never made a prediction like that.

Uh, wait a second. Did you really just say that Popper doesn't provide a method for using evidence to decide whether models are valid? There must be some sort of misunderstanding here.

Comment author: curi 07 April 2011 10:17:18PM -1 points [-]

The only way evidence is used is that criticisms may refer to it.

I'm not trying to figure out math, I'm trying to discuss the philosophical issues.

Comment author: JoshuaZ 09 April 2011 05:03:14PM 2 points [-]

The only way evidence is used is that criticisms may refer to it.

Please reread what Jim wrote. You seem to be in agreement with his statement that evidence is used.

I'm not trying to figure out math, I'm trying to discuss the philosophical issues.

Unfortunately, they are interrelated. There's a general pattern here: some people (such as Jaynes and Yudkowsky) are using math as part of their philosophy. In the process of that they are making natural language summaries and interpretations of those claims. You are taking those natural language statements as if that was all they had to say and then trying to apply your intuition of on ill-defined natural language statements rather than read those natural language statements in the context of the formalisms and math they care about. You can't divorce the math from the philosophy.

Comment author: timtyler 08 April 2011 12:41:17PM *  1 point [-]

I'm not sure what statement by Yudkowsky you're referring to, but I'll bet it was something subtly different.

I am pretty sure it was this one - where: Yudkowsky goes loopy.

Comment author: benelliott 07 April 2011 11:09:10AM *  3 points [-]

Damn, I had a reply but it took so long to type that I lost internet connection.

Basically, with your point about supporting infinitely many theories, I refer you back to my comment that started this whole discussion.

As for the 'one strike and you're out' approach to criticism, I have three big problems with it:

The first is that if 'I don't understand' counts as a criticism, and you have claimed it does, then we need to reject every scientific theory we currently have since someone, somewhere, doesn't understand it.

Second, you accused Jaynes and Yudkowsky of being unscholarly when they said that Popper believed falsification is possible. Popper clearly believes refutation is possible, what is the difference between this and falsification?

Third, it leaves no room for weak criticisms.

Imagine I have a coin, which I think might be double-headed, although I'm not sure, it could also just be an ordinary coin. For some reason I am not allowed to examine both sides, all I can do is flip it and examine whichever side comes on top. If I was a Popperian how would I reason about this?

If tails comes on top then I have criticism of the 'double-headed' theory strong enough to refute it.

If heads comes on top then I do not have a strong enough criticism to refute either theory. This apparently means I do not have any criticism at all.

After 1000 heads I still don't have any criticism. At this point I get tired of flipping so I decide to make my decision. I still have two theories, so I criticise both of them for not refuting the other and reject them both. I sit down and try to come up with a better theory.

To me, this seems nuts. After 1000 heads, the 'ordinary coin' theory should have been thoroughly refuted. I would happily bet my life for a penny on it. If you wish to claim that is has been, then you must tell me where exactly, between 1 and 1000, you draw the line between refuted and not refuted.

Comment author: curi 07 April 2011 09:27:32PM 1 point [-]

The first is that if 'I don't understand' counts as a criticism, and you have claimed it does, then we need to reject every scientific theory we currently have since someone, somewhere, doesn't understand it.

Sort of. It's not perfect. As far as scientific progress, it should be improved on. Indefinitely.

In the mean time we have to make decisions. For making decisions, we never directly use canonical scientific theories. Instead, you make a conjecture like, "QM isn't perfect. But I can use it for building this space ship, and my spaceship will fly". This conjecture is itself open to criticism. It could be a bad idea, depending on the details of the scenario. But it is not open to criticism by some guy in Africa not understanding QM, which doesn't matter.

Second, you accused Jaynes and Yudkowsky of being unscholarly when they said that Popper believed falsification is possible.

That's not what they said. Check out the actual quotes, e.g. Yudkowsky said "Karl Popper's idea that theories can be definitely falsified". That is not Popper's idea. Ideas cannot be "definitely falsified" but only fallibly/conjecturally/tentatively falsified.

Third, it leaves no room for weak criticisms.

That's a merit!

Well, it does leave room for them as ideas which you might remember and try to improve on later.

If you wish to claim that is has been, then you must tell me where exactly, between 1 and 1000, you draw the line between refuted and not refuted.

I would conjecture an explanation of when to stop and why. It would depend on what my goal was with the coin flipping. If that explanation wasn't refuted by criticism, I would use it.

For example, I might be flipping coins to choose which to use for the coin flipping olympics. my goal might be to keep my job. So I might stop after 5 flips all the same and just move on to the next coin. That would work fine for my purposes.

Comment author: benelliott 07 April 2011 10:16:04PM 1 point [-]

In the mean time we have to make decisions. For making decisions, we never directly use canonical scientific theories. Instead, you make a conjecture like, "QM isn't perfect. But I can use it for building this space ship, and my spaceship will fly". This conjecture is itself open to criticism. It could be a bad idea, depending on the details of the scenario. But it is not open to criticism by some guy in Africa not understanding QM, which doesn't matter.

This, kind of feels like cheating. You use all its predictions but never give it credit for making them.

Besides, are you really suggesting that 'someone doesn't understand this' is a legitimate criticism. If this was correct it would mean that scientific truth is partly dependant on human minds, and that the laws of physics themselves change with our understanding of them.

That's not what they said. Check out the actual quotes, e.g. Yudkowsky said "Karl Popper's idea that theories can be definitely falsified". That is not Popper's idea. Ideas cannot be "definitely falsified" but only fallibly/conjecturally/tentatively falsified.

Yudkowsky doesn't use 'definitely' to mean 'with certainty'.

That's a merit!

No its not!

Weak criticisms are important, because they add up to strong ones and sometimes they are all we have to decide by.

I would conjecture an explanation of when to stop and why. It would depend on what my goal was with the coin flipping. If that explanation wasn't refuted by criticism, I would use it.

For example, I might be flipping coins to choose which to use for the coin flipping olympics. my goal might be to keep my job. So I might stop after 5 flips all the same and just move on to the next coin. That would work fine for my purposes.

You're moving the goal posts.

I wasn't asking what you would do in pragmatic terms, I was asking at which point would you consider the theory refuted. You have claimed your thinking process is based on examining criticisms and refuting theories when they are valid, so when is it valid?

Comment author: curi 07 April 2011 10:23:19PM 1 point [-]

This, kind of feels like cheating. You use all its predictions but never give it credit for making them.

I don't know what you mean by "give credit". I'm happy to hand out all sorts of credit.

If I don't have a criticism of using all QMs predictions, then I'll use them. That someone doesn't understand QM isn't a criticism of this. That's only a criticism of explanations of QM.

If this was correct it would mean that scientific truth is partly dependant on human minds,

It would mean that what ideas are valuable is partly dependent on what people exist to care about them.

Yudkowsky doesn't use 'definitely' to mean 'with certainty'.

What does it mean?

He shouldn't write stuff that, using the dictionary definitions, is a myth about Popper, and not clarify. Even if you're right he isn't excused.

Weak criticisms are important, because they add up to strong ones and sometimes they are all we have to decide by.

I think they can't and don't. I think this is a big like saying 3 wrong answers add up to a right answer.

If an argument is false, why should it count for anything? Why would you ever want a large number of false arguments (false as best you can judge them) to trump one true argument (true as best you judge it)?

I wasn't asking what you would do in pragmatic terms, I was asking at which point would you consider the theory refuted.

I would tentatively, fallibly consider the theory "it is a fair coin" refuted after, say, 20 flips. Why 20? I conjectured 20 and don't have a criticism of it. For coin flipping in particular, if I had any rigorous needs, I would use some math in accordance with them.

Comment author: benelliott 07 April 2011 10:40:41PM *  2 points [-]

It would mean that what ideas are valuable is partly dependent on what people exist to care about them.

Be careful with arguing that an idea's value is a different thing to its truth, you're starting to sound like an apologist.

If I don't have a criticism of using all QMs predictions, then I'll use them. That someone doesn't understand QM isn't a criticism of this. That's only a criticism of explanations of QM.

If those explanations are helpful to some people they shouldn't be rejected simply because they are not helpful to others. After all, without them we would never have the predictions.

He shouldn't write stuff that, using the dictionary definitions, is a myth about Popper, and not clarify. Even if you're right he isn't excused.

I don't know about the dictionary definitions, but in everyday conversation 'definitely' doesn't mean 'with certainty'. As Wittgenstein pointed out, these words are frequently used in contexts where the speaker might be wrong for dozens of reasons, and knows it. For instance, "I definitely left my keys by the microwave" is frequently false, and is generally only said by people who are feeling uncertain about it.

I would tentatively, fallibly consider the theory "it is a fair coin" refuted after, say, 20 flips. Why 20? I conjectured 20 and don't have a criticism of it. For coin flipping in particular, if I had any rigorous needs, I would use some math in accordance with them.

I conjecture 21, you don't have any criticism of that either. I now have a criticism of 20, which is that it fails to explain why my conjecture is wrong.

I think they can't and don't. I think this is a big like saying 3 wrong answers add up to a right answer.

If an argument is false, why should it count for anything? Why would you ever want a large number of false arguments (false as best you can judge them) to trump one true argument (true as best you judge it)?

A weak criticism is not the same as an invalid criticism. It just means a criticism that slightly erodes a position, without single-handedly bringing the whole thing crashing down.

The coin-flip thing was intended as an example.

Comment author: curi 08 April 2011 12:19:23AM 1 point [-]

If those explanations are helpful to some people they shouldn't be rejected simply because they are not helpful to others

You are taking rejection as a bigger deal than it is. The theory that "X is the perfect explanation" for X that confuses some people is false.

So we reject it.

We can accept other theories, e.g. that X is flawed but, for some particular purpose, is appropriate to use.

I don't know about the dictionary definitions,

It means "without doubt". Saying things like "I have no doubt that X" when there is doubt is just dumb.

I conjecture 21, you don't have any criticism of that either. I now have a criticism of 20, which is that it fails to explain why my conjecture is wrong.

The problem situation is under specified. When you ask ambiguous questions like what should I do in [under specified situation] then you get multiple possible answers and it's hard to do much in the way of criticizing.

In real world situations (which have rich context), it's not so hard to decide. But when I gave an example like that you objected.

I can't criticize 20 vs 21 unless I have some goal in mind, some problem we are trying to solve. (If there is no problem to be solved, I won't flip at all.) If the problem is figuring out if the coin is fair, with certainty, that is not solvable, so I won't flip at all. If it is figuring it out with a particular probability, given a few reasonable background assumptions, then I will look up the right math to use. If it's something else, what?

A weak criticism is not the same as an invalid criticism. It just means a criticism that slightly erodes a position, without single-handedly bringing the whole thing crashing down.

This is an important issue. I think your statement here is imprecise.

A criticism might demolish one single idea which is part of a bigger idea.

If it demolishes zero individual ideas, then where is the erosion?

If it demolishes one little idea, then that idea is refuted. And the big idea needs to replace it with something else which is not refuted, or find a way to do without.

Comment author: benelliott 08 April 2011 07:32:33AM 2 points [-]

You are taking rejection as a bigger deal than it is. The theory that "X is the perfect explanation" for X that confuses some people is false.

Maybe QM is exactly right, and maybe it is just too complicated for some people to understand. There is no need to be so harsh in your criticism process, why not just admit that a theory can be right without being perfect in every other respect.

It means "without doubt". Saying things like "I have no doubt that X" when there is doubt is just dumb.

Yet everyone does it. Language is a convention, not a science. If you are using a word differently from everyone else then you are wrong, the dictionary has no authority on the matter.

The problem situation is under specified. When you ask ambiguous questions like what should I do in [under specified situation] then you get multiple possible answers and it's hard to do much in the way of criticizing.

This is a flaw. Bayes can handle any level of information.

I can't criticize 20 vs 21 unless I have some goal in mind, some problem we are trying to solve. (If there is no problem to be solved, I won't flip at all.) If the problem is figuring out if the coin is fair, with certainty, that is not solvable, so I won't flip at all. If it is figuring it out with a particular probability, given a few reasonable background assumptions, then I will look up the right math to use. If it's something else, what?

Can you really not see why the above is moving the goal posts. Earlier, you said that you think by coming up with conjectures, and criticising them, and only then make decisions. Now you are putting the decision making process in the driving seat and saying that everything is based on that. So is Popperianism purely pragmatic? Is the whole conjecture and criticism thing not really the important part, and in fact its all based on decision strategies. Or do you use the conjecture-criticism thing to try and reach the correct answer, as you have previously stated, and then use that for decision making.

If it demolishes zero individual ideas, then where is the erosion?

It makes the idea less likely, less plausible, by a small amount. The coin flip is intended to illustrate it. Saying that you will use Bayes in the coin flip example and nowhere else is like saying you believe Newton's laws work 'inside the laboratory' but you're going to keep using Aristotle outside.

Comment author: GuySrinivasan 07 April 2011 05:16:59PM 0 points [-]

Following curi's steps, we'd lower our standards. How do you feel about the theory "I don't want to spend more time on this and getting 1000 heads if it's double-headed is 2^1000 more likely than getting 1000 heads if it's ordinary so I'll make the same decisions I'd make if I knew it were double-headed unless I get a rough estimate of at least a factor of 2^990 difference in how much I care about the outcome of one of those decisions".

Comment author: benelliott 07 April 2011 05:24:43PM 1 point [-]

What you appear to be suggesting amounts to Bayesian epistemology done wrong.

Comment author: curi 07 April 2011 09:20:44PM 2 points [-]

For coin flipping analysis, use Bayes' theorem (not Bayesian epistemology).

Comment author: benelliott 07 April 2011 10:08:27PM 1 point [-]

If Bayes generates the right answer here, whereas naive Popperian reasoning without it goes spectacularly wrong, maybe this should be suggesting something. Also it ignores my main point that Poppers theory does not admit weak criticisms, of which the coin coming up heads is just one example.

Comment author: timtyler 08 April 2011 12:53:42PM 0 points [-]

Whether you have a double-headed coin or not is still a form of knowledge.

The Bayes' theorem:good, Bayesian epistemology:bad perspecitive won't wash.

Comment author: Peterdjones 14 April 2011 04:34:04PM 2 points [-]

Neither the potential infinity of theories, nor the possibility of error favour Popper over Bayes.

"The reason it doesn't is there's always infinitely many things supported by any evidence, in this sense. Infinitely many things which make wildly different predictions about the future, but identical predictions about whatever our evidence covers. If Y is 10 white swans, and X is "all swans are white" then X is supported, by your statement. But also supported are infinitely many different theories claiming that all swans are black, and that you hallucinated. You saw exactly what you would see if any of those theories were true, so they get as much support as anything else. There is nothing (in the concept of support) to differentiate between "all swans are white" and those other theories."

And that doesn't matter unless they (a) all have equal prior,and (b) will continue to be supported equally by any future evidence. (a) Is never the case, but doesn't help that much since, without a solution to (b), the relative rankings of various theories will never change from their priors, making evidence irrelevant. (b) is also never the case. The claims that 100% of swans are white, 90% are white, 80%, and so on will not all be equaly supported by a long series of observations of white swans. A Popperian could argue that the the theories that are becoming relatively less supported are becoming partially refuted. But relative refutation of T is relative support for not-T, just because of the meaning of relative. The Popperian can only rescue the situation by showing that there is absolute refutation, but no absolute support.

"If you do add something else to differentiate, I will say the support concept is useless. The new thing does all the work. And further, the support concept is frequently abused. I have had people tell me that "all swans are black, but tomorrow you will hallucinated 10 white swans" is supported less by seeing 10 white swans tomorrow than "all swans are white" is, even though they made identical predictions (and asserted them with 100% probability, and would both have been definitely refuted by anything else)."

They have different priors. The halucination theory is a skeptical hypothesis, and it is well known that skeptical hypotheses can't be refuted empirically. But we can still give them low priors. Or regard them as bad explanations--for instance, reject them because they are unfalsifiable or too easy to vary.

Comment author: [deleted] 07 April 2011 09:32:51AM 2 points [-]

"I see that as entrenching bias and subjectivism in reagards to morality -- we can make objective criticisms of moral values."

You keep asserting that. You keep failing to provide a shred of evidence.

"BTW I think it's ironic that I score better on support when I just stick 100% in front of every prediction in all theories I mention, while you score lower by putting in other numbers, and so your support concept discourages ever making predictions with under 100% confidence"

That's true right up until you see the first black swan. All else being equal, simpler explanations are always to be favoured over more complex ones. Look up Kolmogrov complexity and minimum message length.

Comment author: curi 07 April 2011 09:46:31AM 2 points [-]

You keep asserting that. You keep failing to provide a shred of evidence.

I posted arguments. What did you not like about them? Post a criticism of something I said. "No evidence" is just a request for justification which I regard as impossible, but I did give arguments.

That's true right up until you see the first black swan.

At which point infinitely many of my 100% theories will be refuted. And infinitely many will remain. You can never win at that game using finite evidence. For any finite set of evidence, infinitely many 100% type theories predict all of it perfectly.

Comment author: thakil 07 April 2011 10:28:03AM 3 points [-]

What arguments? I had a look through what you've written, and found this "There are objective facts about how to live, call them what you will. Or, maybe you'll say there aren't. If there are, then it's not objectively wrong to be a mass murderer. Do you really want to go there into full blown relativism and subjectivism?"

This is hardly an argument for the truth content of a statement. Just because the consequences of a theory of moral behaviour make us feel bad doesn't mean they are not true- we should be interested in whether the statements conflict with how the universe seems to work. The notion of morality independent to sentient beings has always seemed fundamentally absurd to me, and I have yet to find a decent argument in its favour.

The worry of slipping into moral relativism is that we are trapped in a position where we can't punish mass murderers. But theres lots of sensible reasons for mass murderers to punish mass murder, and not indulge in it themselves. One would have to get exceptional utility out of murdering to counteract all the downsides with performing such an action.

The problem here is, as often happens with moral discussion, that wrong is not well defined. You say wrong to mean a grand moral force of the universe, but it could mean "is this a sensible action for this being to take, given their goals and desires". It might turn out that given all that said person IS benefited most by mass murder

Comment author: curi 07 April 2011 10:43:05AM *  2 points [-]

As I recall I gave a citation where to find Popper discussing morality (it is The World of Parmenides).

And I explained that moral knowledge is created using the same method as any other kind of knowledge. And i said that that method is (conjectures and refutations).

questioning if people want to advocate strong relativism or subjectivism is an argument, too. if you aren't aware of the already existing arguments against relativism or subjectivism, then it's incomplete for you. you could always ask.

you haven't understood my view. i didn't say it's a moral force. the issue of "what is the right action, given my goals and desires?" is 100% objective, and it is a moral issue. i don't know why you expected me to disagree about that. there is a fact of the matter about it. that is one of the major parts of morality. but there is also a second part: the issue of what are good goals and desires to have?

how can that be objective, you wonder? well for example some sets of goals contradict each other. that allows for a type of objective moral argument, about what goals/values/preferences/utility-functions to have, against contradictory goals.

there's others. to start with, read: http://www.curi.us/1169-morality

Comment author: benelliott 07 April 2011 12:30:08PM 5 points [-]

"what is the right action, given my goals and desires?" is 100% objective

Bayes, combined with Von Neuman Mortenson utility theory answers this, at least in principle.

You keep acting as if it is a flaw that Bayes only predicts. Is it a flaw that Newton's laws of motion do not explain the price of gold? Narrowness is a virtue, attempting to spread your theory as wide as possible ends up forcing it into places where it doesn't belong.

Comment author: curi 07 April 2011 06:19:56PM 1 point [-]

If bayes wants to be an epistemology then it must do more than predict. Same for Newton.

If you want to have math which doesn't dethrone Popper, but is orthogonal, you're welcome to do that and i'd stop complaining (much). However Yudkowsky says Bayesian Epistemology dethrones and replaces Popper. He regards it as a rival theory to Popper's. Do you think Yudkowsky was wrong about that?

Comment author: timtyler 07 April 2011 07:40:07PM *  1 point [-]

Yudkowsky says Bayesian Epistemology dethrones and replaces Popper. He regards it as a rival theory to Popper's. Do you think Yudkowsky was wrong about that?

It replaces Popperian epistemology where their domains overlap - namely: building models from observations and using them to predict the future. It won't alone tell you what experiments to perform in order to gather more data - there are other puzzle pieces for dealing with that.

Comment author: curi 07 April 2011 08:23:29PM 1 point [-]

There's no overlap there b/c Popperian epistemology doesn't provide the specific details of how to do that. Popperian epistemology is fully compatible with, and can use, Bayes' theorem and any other pure math or logic insights.

Popperian epistemology contradicts your "other puzzle pieces". And without them, Bayes' theorem alone isn't epistemology.

Comment author: timtyler 07 April 2011 08:54:30PM *  2 points [-]

It replaces Popperian epistemology where their domains overlap - namely: building models from observations and using them to predict the future.

There's no overlap there b/c Popperian epistemology doesn't provide the specific details of how to do that.

Except for the advice on induction? Or has induction merely been rechristened as corroboration? Popper enthusiasts usually seem to deny doing that.

Comment author: curi 07 April 2011 08:55:50PM 1 point [-]

Induction doesn't work.

building models from observations and using them to predict the future

I thought you were referring to things you can do with Bayes' theorem and some input. If you meant something more, provide the details of what you are proposing.

Comment author: benelliott 07 April 2011 07:28:42PM 1 point [-]

The most common point of Popper's philosophy that I hear (including from my Popperian philosophy teacher) is the whole "black swan white swan" thing, which Bayes does directly contradict, and dethrone (though personally I'm not a big fan of that terminology).

The stuff you talked about with conjectures and criticisms does not directly contradict Bayes and if the serious problems with 'one strike and you're out' criticisms are fixed it I may be persuaded to accept both it and Bayes.

Bayes is not meant to be an epistemology all on its own. It only starts becoming one when you put it together with Solomonoff Induction, Expected Utility Theory, Cognitive Science and probably a few other pieces of the puzzle that haven't been found yet. I presume the reason it is referred to as Bayesian rather than Solomonoffian or anything else is that Bayes is the both most frequently used and the oldest part.

Comment author: curi 07 April 2011 07:33:14PM *  1 point [-]

The black swan thing is not that important to Popper's ideas, it is merely a criticism of some of Popper's opponents.

How does Bayes dethrone it? By asserting that white swans support "all swans are white"? I've addressed that at length (still going through overnight replies, if someone answered my points i'll try to find it).

Solomonoff Induction, Expected Utility Theory, Cognitive Science

Well I don't have a problem with Bayes' theorem itself, of course (pretty much no one does, right? i hope not lol). It's these surrounding ideas that make an epistemology that I think are mistaken, and all of which Popper's epistemology contradicts. (I mean the take on cognitive science popular here, not the idea of doing cognitive science).

Comment author: benelliott 07 April 2011 07:58:17PM 1 point [-]

(still going through overnight replies, if someone answered my points i'll try to find it)

I think I answered your points a few days ago with my first comment of this discussion.

In short, yes, there are infinitely many hypotheses whose probabilities are raised by the white swan, and yes those include both "all swans are white" and "all swans are black and I am hallucinating" but the former has a higher prior, at least for me, so it remains more probable by several orders of magnitude. For evidence to support X it doesn't have to only support X. All that is required is that X does better at predicting than the weighted average of all alternatives.

I have had people tell me that "all swans are black, but tomorrow you will hallucinated 10 white swans" is supported less by seeing 10 white swans tomorrow than "all swans are white" is, even though they made identical predictions (and asserted them with 100% probability, and would both have been definitely refuted by anything else).

Just to be clear I am happy to say those people were completely wrong. It would be nice if nobody ever invented a poor argument to defend a good conclusion but sadly we do not live in that world.

Comment author: curi 07 April 2011 08:21:58PM 1 point [-]

I think I answered your points a few days ago with my first comment of this discussion.

But then I answered your answer, right? If I missed one that isn't pretty new, let me know.

but the former has a higher prior

so support is vacuous and priors do all the real work. right?

and priors have their own problems (why that prior?).

Just to be clear I am happy to say those people were completely wrong. It would be nice if nobody ever invented a poor argument to defend a good conclusion but sadly we do not live in that world.

OK. I think your conception of support is unsubstantive but not technically wrong.

Comment author: thakil 07 April 2011 11:05:32AM 0 points [-]

Mm, I'm not sure I entirely agree with that link- we might accept that most long term goals on maximisation will lead to what most people might recognise as morality, but I don't know if all goals are long term. It also makes no argument as to one goal being "better" than another. Theres sensible reasons for me to discourage people having the desire to kill me, for example, but I don't see that one could argue that I'm right and he's wrong. If someone is born with just one innate desire, that of killing me, its in her interest to pursue that goal. Now she might well act morally elsewhere while engaging fully in her training towards killing me, but at some point where she is confident that she will be able to kill me, she should drop everything else and kill me. Of course after this her life is empty, but she only had that one desire, and she had to fufill it at some point- she got absolutely no value from everything else.

Now was she wrong to pursue that goal? I don't see how I can condemn her. I obviously will do everything in my power to stop her, and I would hope others in society would have goals which are interrupted by my untimely demise, but I don't see where condemnation comes in here. We had conflicting goals, and mine seem "nicer" from an intuitive argument, but if I lived in a world where everyone had a strong desire to see me dead then I imagine it would feel "nicer" to them for me to die, and "nasty" for me to survive.

Comment author: curi 07 April 2011 06:17:07PM 2 points [-]

Not all goals are long term.

One of the purposes of the dialog is to explain that the foundations are not very important. That means you don't have to figure out the correct foundations or starting place to have objective morality. You can start wherever you want, because rather little depends on the starting place.

Once you do make a ton of progress, when you're much wiser, if your starting place was squirrels you'd be able to reconsider it because it's so silly. The same holds for any other particularly dumb starting place.

The ones that will be harder to change later are specifically the ones that you don't see as bad -- that you don't want to change. The ones that are either correct or you don't yet have enough knowledge to see the problems with them.

If someone is born with just one innate desire

Innate desires aren't morality. It's a bad argument "I was born this way therefore I should be this way". That's getting and ought from an is.

Moving on, one way to move past the squirrel scenario, which enables you to criticize the squirrel starting point and many others, is you consider other scenarios. Drop the squirrels and put in something else, like minimizing bison. Put in a way variety of stuff. It's not too important what it is. Any kind of value, taken seriously, and which has something to say long term. Even wanting to kill someone will work if you also want them to stay dead forever (if you really want to make sure to destroy all the information that could be used to resurrect them later with advanced technology, and you want to know what kind of remains could be used for that and what would violate the laws of physics, then you will need advanced knowledge).

So, you try the same thought experiment with bison-minimizing, or killing-forever.

You find that some of the conclusions are the same, and some are different.

Take only the ones that are the same for thousands of starting points.

Those are the non-parochial ones. They are the ones that don't depend on your culture and biases. There is the objective content.

The parts that vary by starting points are wrong.

That's what I think. And I think this argument isn't bogged down in being totally subjective from the start. Maybe it's not perfect, but that just means we could make an even better argument in the future.

Getting back to your original point, this shared content across many starting points does say stuff about what to do short term. It doesn't give complete arbitrary freedom of action in the short term. It's also not totally restrictive, but that's good (it might get a lot more restrictive if made more precise. but that would also be good. knowing how to live well in high detail would be a good thing even if it gave you less non-immoral options. as long as we don't jump the gun and create very precise rules before we understand how to work them out well, we'll be ok.)

Comment author: thakil 07 April 2011 07:26:11PM 1 point [-]

Sorry, but I just do not see how you can claim desires are not morality when you have yet to provide a basis for what it is! I see no reason to believe that those bases with common conclusions are somehow better. They might feel better, but thats not good enough

Comment author: curi 07 April 2011 07:28:57PM *  2 points [-]

you have yet to provide a basis

I've argued that morality is at least largely, if not entirely, independent of basis. So asking me for a basis isn't the right question.

Can you give an example of a starting point you think avoids the common conclusions such as liberalism?

Comment author: thakil 07 April 2011 08:32:06PM *  0 points [-]

You have shown that an argument can be made that given a number of seemingly dissimilar, long term goals, e can make arguments which convincingly argue that to achieve them one should act in a manner people would generally consider moral. I am not convinced squirrel morality gives me an answer on specific moral questions (abortion say) but I can see how one might manage it. You have yet to convince me that short term bases will do the same: I am reasonably confident that many wil not. To claim theses bases as inferior seems to be begging the question to me.

As to your specific question: how about a basis of wanting to prevent liberalism? It would certainly be difficult to achieve and counter productive, but to claim that those respective properties are bad begs the question: you need morality to condemn purposes which are going to cause nothing but pain for all involved.

Comment author: curi 07 April 2011 08:52:02PM 2 points [-]

how about a basis of wanting to prevent liberalism?

If you were just to destroy the world, or build a static society and die of a meteor strike one day b/c your science never advanced, then life could evolve on another planet.

You need enough science and other things to be able to affect the whole universe. And for that you need liberalism temporarily. Then at the very very end, when you're powerful enough to easily do whatever you want to the whole universe (needs to be well within your power, not at the limits of your power, or it's too risky, you might fail) then finally you can destroy or control everything.

So that goal leads straight to billions of years of liberalism. And that does mean freedom of abortion: ruining people's lives to punish them for having sex does not make society wealthier, does not promote progress, etc... But does increase the risk of everyone dying of meteor before you advance enough to deal with such a problem.

short term bases

Accomplish short term things, in general, depends on principles. Suppose I want a promotion at work within the next few years. It's important to have the right kind of philosophy. I'll have a better shot at it if I think well. So I'll end up engaging with some big ideas. Not every single short term basis will lead somewhere interesting. If it's really short, it's not so important. Also consider this: we can conjecture that life is nice. People cannot use short term bases, which don't connect to big ideas, to criticize this. If they want to criticize it, they will have to engage with some big ideas, so then we get liberalism again.

Comment author: [deleted] 07 April 2011 10:10:55AM 3 points [-]

The problem is, nobody else here (or very few people here) regards justification as impossible, You're essentially saying you refuse to engage by the same evidentiary rules as anyone else here. You're not going to change anyone's mind without providing justification.

"At which point infinitely many of my 100% theories will be refuted. And infinitely many will remain. "

Like I said, look up Kolmogrov Complexity and minimum message length. At any given time, the simplest of those 'theories' consistent with all data is the one with the highest probability.

Comment author: curi 07 April 2011 10:25:50AM 2 points [-]

Can you tell how ideas are justified, without creating a regress or other severe problem? Tell me the type of justificationism that works, then I will accept it.

Comment author: prase 07 April 2011 02:14:05PM 1 point [-]

At which point infinitely many of my 100% theories will be refuted. And infinitely many will remain. You can never win at that game using finite evidence. For any finite set of evidence, infinitely many 100% type theories predict all of it perfectly.

It seems that your objection is basically that if I toss a coin seventeen times and it ends up in a sequence of HTTTHTHHHHTHTHTTH, there is a specific theory T1 (namely, that the physical laws cause the sequence to be HTTTHTHHHHTHTHTTH) which scores higher than the clearly correct explanation T2 (i.e. the probability of each sequence is the same 2^(-17)). But this is precisely why priors depend on the Kolmogorov complexity of hypotheses: with such a prior, the posterior of T2 will be higher than the posterior of T1.

And, after all, you don't have infinitely many theories. Theories live in brains, not in an infinite Platonic space of ideas. Why should we care whether there are infinitely many ways to formulate a theory so absurd that nobody would think of it but still compatible with the evidence? Solomonoff induction tells you to ignore them, which agrees with the common sense.

Comment author: curi 07 April 2011 07:22:13PM 1 point [-]

Selectively ignoring theories, even when we're aware of them, is just bias, isn't it?

I'm a bit surprised that someone here is saying to me "OK so mathematically, abstractly, we're screwed, but in practice it's not a big deal, proceed anyway". Most people here respect math and abstract thinking, and don't dismiss problems merely for involving substantial amounts of theory.

Of course a prior can arbitrarily tell you which theories to prefer over others. But why those? You're getting into problems of arbitrary foundations.

Comment author: prase 07 April 2011 08:39:26PM 1 point [-]

Bias is a systematic error in judgement, something which yields bad results. It is incorrect to apply that label to heuristics which are working well.

I haven't told you that we are abstractly screwed, but it's no big deal. We are not screwed, on the contrary, the Solomonoff induction is a consistent algorithm which works well in practice. It is as arbitrary as any axioms are arbitrary. You can't do any better if you want to have any axioms at all, or any method at all. If your epistemology isn't completely empty, it can be criticised for being arbitrary without regard to its actual details. And after all, what ultimately matters is whether it works practically, not some perceived lack of arbitrariness.

Comment author: calef 07 April 2011 08:32:35PM 1 point [-]

We're fundamentally incapable of making statements about reality without starting on some sort of arbitrary foundation.

And I think describing it as "selectively ignoring" is doing it an injustice. We're deductively excluding, and it there were some evidence to appear that would contradict that exclusion, those theories would no longer be excluded.

I'm actually have trouble finding a situation in which a fallibilist would accept/reject a proposition, and a Bayesian would do the opposite of the fallibilist. And I don't mean epistemological disagreements, I mean disagreements of the form "Theory Blah is not false."

Comment author: curi 07 April 2011 08:36:54PM 2 points [-]

We're fundamentally incapable of making statements about reality without starting on some sort of arbitrary foundation.

This is something Popper disputes. He says you can start in the middle, or anywhere. Why can't that be done?

And I think describing it as "selectively ignoring" is doing it an injustice. We're deductively excluding

I was talking about the theories that can't be deductively excluded b/c they make identical predictions for all available evidence.