You've all seen it. Sentences like "this sentence is false": if they're false, they're true, and vice versa, so they can't be either true or false. Some people solve this problem by doing something really complicated: they introduce infinite type hierarchies wherein every sentence you can express is given a "type", which is an ordinal number, and every sentence can only refer to sentences of lower type. "This sentence is false" is not a valid sentence there, because it refers to itself, but no ordinal number is less than itself. Eliezer Yudkowsky mentions but says little about such things. What he does say, I agree with: ick!
In addition to the sheer icky factor involved in this complicated method of making sure sentences can't refer to themselves, we have deeper problems. In English, sentences can refer to themselves. Heck, this sentence refers to itself. And this is not a flaw in English, but something useful: sentences ought to be able to refer to themselves. I want to be able to write stuff like "All complete sentences written in English contain at least one vowel" without having to write it in Spanish or as an incomplete sentence.1 How can we have self-referential sentences without having paradoxes that result in the universe doing what cheese does at the bottom of the oven? Easy: use fuzzy logic.
Now, take a nice look at the sentence "this sentence is false". If your intuition is like mine, this sentence seems false. (If your intuition is unlike mine, it doesn't matter.) But obviously, it isn't false. At least, it's not completely false. Of course, it's not true, either. So it's not true or false. Nor is it the mythical third truth value, clem2, as clem is not false, making the sentence indeed false, which is a paradox again. Rather, it's something in between true and false--"of medium truth", if you will.
So, how do we represent "of medium truth" formally? Well, the obvious way to do that is using a real number. Say that a completely false sentence has a truth value of 0, a completely true sentence has a truth value of 1, and the things in between have truth values in between.3 Will this work? Why, yes, and I can prove it! Well, no, I actually can't. Still, the following, trust me, is a theorem:
Suppose there is a set of sentences, and there are N of them, where N is some (possibly infinite) cardinal number, and each sentence's truth value is a continuous function of the other sentences' truth values. Then there is a consistent assignment of a truth value to every sentence. (More tersely, every continuous function [0,1]^N -> [0,1]^N for every cardinal number N has at least one fixed point.)
So for every set of sentences, no matter how wonky their self- and cross-references are, there is some consistent assignment of truth values to them. At least, this is the case if all their truth values vary continuously with each other. This won't happen under strict interpretations of sentences such as "this sentence's truth value is less than 0.5": this sentence, interpreted as black and white, has a truth value of 1 when its truth value is below 0.5 and a truth value of 0 when it's not. This is inconsistent. So, we'll ban such sentences. No, I don't mean ban sentences that refer to themselves; that would just put us back where we started. I mean we should ban sentences whose truth values have "jumps", or discontinuities. The sentence "this sentence's truth value is less than 0.5" has a sharp jump in truth value at 0.5, but the sentence "this sentence's truth value is significantly less than 0.5" does not: as its truth value goes down from 0.5 down to 0.4 or so, it also goes up from 0.0 up to 1.0, leaving us a consistent truth value for that sentence around 0.49.
Edit: I accidentally said "So, we'll not ban such sentences." That's almost the opposite of what I wanted to say.
Now, at this point, you probably have some ideas. I'll get to those one at a time. First, is all this truth value stuff really necessary? To that, I say yes. Take the sentence "the Leaning Tower of Pisa is short". This sentence is certainly not completely true; if it were, the Tower would have to have a height of zero. It's not completely false, either; if it were, the Tower would have to be infinitely tall. If you tried to come up with any binary assignment of "true" and "false" to sentences such as these, you'd run into the Sorites paradox: how tall would the Tower be if any taller tower were "tall" and any shorter tower were "short"? A tower a millimeter higher than what you say would be "tall", and a tower a millimeter shorter would be "short", which we find absurd. It would make a lot more sense if a change of height of one millimeter simply changed the truth value of "it's short" by about 0.00001.
Second, isn't this just probability, which we already know and love? No, it isn't. If I say that "the Leaning Tower of Pisa is extremely short", I don't mean that I'm very, very sure that it's short. If I say "my mother was half Irish", I don't mean that I have no idea whether she was Irish or not, and might find evidence later on that she was completely Irish. Truth values are separate from probabilities.
Third and finally, how can this be treated formally? I say, to heck with it. Saying that truth values are real numbers from 0 to 1 is sufficient; regardless of whether you say that "X and Y" is as true as the product of the truth values of X and Y or that it's as true as the less true of the two, you have an operation that behaves like "and". If two people have different interpretations of truth values, you can feel free to just add more functions that convert between the two. I don't know of any "laws of truth values" that fuzzy logic ought to conform to. If you come up with a set of laws that happen to work particularly well or be particularly elegant (percentiles? decibels of evidence?), feel free to make it known.
1. ^ The term "sentence fragment" is considered politically incorrect nowadays due to protests by incomplete sentences. "Only a fragment? Not us! One of us standing alone? Nothing wrong with that!"
2. ^ I made this word up. I'm so proud of it. Don't you think it's cute?
3. ^ Sorry, Eliezer, but this cannot be consistently interpreted such that 0 and 1 are not valid truth values: if you did that, then the modest sentence "this sentence is at least somewhat true" would always be truer than itself, whereas if 1 is a valid truth value, it is a consistent truth value of that sentence.
The crisp portion of such a self-reference system will be equivalent to a Kripke fixed-point theory of truth, which I like. It won't be the least fixed point, however, which is the one I prefer; still, that should not interfere with the normal mathematical reasoning process in any way.
In particular, the crisp subset which contains only statements that could safely occur at some level of a Tarski hierarchy will have the truth values we'd want them to have. So, there should be no complaints about the system coming to wrong conclusions, except where problematically self-referential sentences are concerned (sentences which are assigned no truth value in the least fixed point).
So; the question is: do the sentences which are assigned no truth value in Kripke's construction, but are assigned real-numbered truth values in the fuzzy construction, play any useful role? Do they add mathematical power to the system?
For those not familiar with Kripke's fixed points: basically, they allow us to use self-reference, but to say that any sentence whose truth value depends eventually on its own truth value might be truth-value-less (ie, meaningless). The least fixed point takes this to be the case whenever possible; other fixed points may assign truth values when it doesn't cause trouble (for example, allowing "this sentence is true" to have a value).
If discourse about the fuzzy value of (what I would prefer to call) meaningless sentences adds anything, then it is by virtue of allowing structures to be defined which could not be defined otherwise. It seems that adding fuzzy logic will allow us to define "essentially fuzzy" structures... concepts which are fundamentally ill-defined... but in terms of the crisp structures that arise, correct me if I'm wrong, but it seems fairly clear to me that nothing will be added that couldn't be added just as well (or, better) by adding talk about the class of real-valued functions that we'd be using for the fuzzy truth-functions.
To sum up: reasoning in this way seems to have no bad consequences, but I'm not sure it is useful...
By the way, how would you incorporate probabilities into binary logic? Either you can include statements about probabilities in binary logic ("probability on top of logic"), or you can assign probabilities to binary logic statements ("logic on top of probability theory"). The situation is just analogous to that of fuzziness. If you do #1, that means binary logic is the most fundamental layer. If you do #2, I can also do an analogous thing with fuzziness.