Comment author: Wei_Dai 20 June 2013 08:11:10AM *  1 point [-]

And as for how the copyable entities will reason about their own existence

I'm not interested so much in how they will reason, but in how they should reason.

The sort of thing that might eventually give us insight into whether we're copyable or not would be understanding the effect of microscopic noise on the sodium-ion channels, whether the noise can be grounded in PMDs, etc.

When you say "we" here, do you literally mean "we" or do you mean "biological humans"? Because I can see how understanding the effect of microscopic noise on the sodium-ion channels might give us insight into whether biological humans are copyable, but it doesn't seem to tell us whether we are biological humans or for example digital simulations (and therefore whether your proposed solution to the philosophical puzzles is of any relevance to us). I thought you were proposing that if your theory is correct then we would eventually be able to determine that by introspection, since you said copyable minds might have no subjective experience or a different kind of subjective experience.

Comment author: ScottAaronson 20 June 2013 01:15:45PM 2 points [-]

(1) Well, that's the funny thing about "should": if copyable entities have a definite goal (e.g., making as many additional copies as possible, taking over the world...), then we simply need to ask what form of reasoning will best help them achieve the goal. If, on the other hand, the question is, "how should a copy reason, so as to accord with its own subjective experience? e.g., all else equal, will it be twice as likely to 'find itself' in a possible world with twice as many copies?" -- then we need some account of the subjective experience of copyable entities before we can even start to answer the question.

(2) Yes, certainly it's possible that we're all living in a digital simulation -- in which case, maybe we're uncopyable from within the simulation, but copyable by someone outside the simulation with "sysadmin access." But in that case, what can I do, except try to reason based on the best theories we can formulate from within the simulation? It's no different than with any "ordinary" scientific question.

(3) Yes, I raised the possibility that copyable minds might have no subjective experience or a different kind of subjective experience, but I certainly don't think we can determine the truth of that possibility by introspection -- or for that matter, even by "extrospection"! :-) The most we could do, maybe, is investigate whether the physical substrate of our minds makes them uncopyable, and therefore whether it's even logically coherent to imagine a distinction between them and copyable minds.

Comment author: Wei_Dai 19 June 2013 03:13:00AM 2 points [-]

But as long as we're trading hypotheticals: what if minds (or rather, the sorts of minds we have) can only be associated with uncopyable physical substrates?

If that turns out to be the case, I don't think it would much diminish either my intellectual curiosity about how problems associated with mind copying ought to be solved nor the practical importance of solving such problems (to help prepare for a future where most minds will probably be copyable, even if my own isn't).

various things that confused me for years and that I discuss in the essay (Newcomb, Boltzmann brains, the "teleportation paradox," Wigner's friend, the measurement problem, Bostrom's observer-counting problems...) all seemed to beckon me in that direction from different angles

It seems likely that in the future we'll be able to build minds that are very human-like, but copyable. For example we could take someone's gene sequence, put them inside a virtual embryo inside a digital simulation, let it grow into an infant and then raise it in a virtual environment similar to a biological human child's. I'm assuming that you don't dispute this will be possible (at least in principle), but are saying that such a mind might not have the same kind of subjective experience as we do. Correct?

Now suppose we built such a mind using your genes, and gave it an upbringing and education similar to yours. Wouldn't you then expect it to be puzzled by all the things that you mentioned above, except it would have to solves those puzzles in some way other than by saying "I can get around these confusions if I'm not copyable"? Doesn't that suggest to you that there have to be solutions to those puzzles that do not involve "I'm not copyable" and therefore the existence of the puzzles shouldn't have beckoned you in the direction of thinking that you're uncopyable?

So I decided that, given the immense perplexities associated with copyable minds (which you know as well as anyone), the possibility that uncopyability is essential to our subjective experience was at least worth trying to "steelman" (a term I learned here) to see how far I could get with it.

If you (or somebody) eventually succeed in showing that uncopyability is essential to our subjective experience, that would mean that by introspecting on the quality of our subjective experience, we would be able to determine whether or not we are copyable, right? Suppose we take a copyable mind (such as the virtual Scott Aaronson clone mentioned above), make another copy of it, then turn one of the two copies into an uncopyable mind by introducing some freebits into it. Do you think these minds would be able to accurately report whether they are copyable, and if so, by what plausible mechanism?

Comment author: ScottAaronson 20 June 2013 07:29:05AM 1 point [-]

(1) I agree that we can easily conceive of a world where most entities able to pass the Turing Test are copyable. I agree that it's extremely interesting to think about what such a world would be like --- and maybe even try to prepare for it if we can. And as for how the copyable entities will reason about their own existence -- well, that might depend on the goals of whoever or whatever set them loose! As a simple example, the Stuxnet worm eventually deleted itself, if it decided it was on a computer that had nothing to do with Iranian centrifuges. We can imagine that each copy "knew" about the others, and "knew" that it might need to kill itself for the benefit of its doppelgangers. And as for why it behaved that way --- well, we could answer that question in terms of the code, or in terms of the intentions of the people who wrote the code. Of course, if the code hadn't been written by anyone, but was instead (say) the outcome of some evolutionary process, then we'd have to look for an explanation in terms of that process. But of course it would help to have the code to examine!

(2) You argue that, if I were copyable, then the copies would wonder about the same puzzles that the "uncopyable" version wonders about -- and for that reason, it can't be legitimate even to try to resolve those puzzles by assuming that I'm not copyable. Compare to the following argument: if I were a character in a novel, then that character would say exactly the same things I say for the same reasons, and wonder about the same things that I wonder about. Therefore, when reasoning about (say) physics or cosmology, it's illegitimate even to make the tentative assumption that I'm not a character in a novel. This is a fun argument, but there are several possible responses, among them: haven't we just begged the question, by assuming there is something it's like to be a copyable em or a character in a novel? Again, I don't declare with John Searle that there's obviously nothing that it's like, if you think there is then you need your head examined, etc. etc. On the other hand, even if I were a character in a novel, I'd still be happy to have that character assume it wasn't a character -- that its world was "real" -- and see how far it could get with that assumption.

(3) No, I absolutely don't think that we can learn whether we're copyable or not by "introspecting on the quality of our subjective experience," or that we'll ever be able to do such a thing. The sort of thing that might eventually give us insight into whether we're copyable or not would be understanding the effect of microscopic noise on the sodium-ion channels, whether the noise can be grounded in PMDs, etc. If you'll let me quote from Sec. 2.1 of my essay: "precisely because one can’t decide between conflicting introspective reports, in this essay I’ll be exclusively interested in what can be learned from scientific observation and argument. Appeals to inner experience—including my own and the reader’s—will be out of bounds."

Comment author: shminux 18 June 2013 08:05:44PM -1 points [-]

How would one test experimentally whether the uncertainty in question is Knightian? Assuming we do our best to make many repeatable runs of some experiment, what set of outcomes would point to the MaxEnt (or any) prior being inadequate?

Comment author: ScottAaronson 19 June 2013 04:56:18PM 2 points [-]

shminux: I don't know any way, even in principle, to prove that uncertainty is Knightian. (How do you decisively refute someone who claims that if only we had a better theory, we could calculate the probabilities?) Though even here, there's an interesting caveat. Namely, I also would have thought as a teenager that there could be no way, even in principle, to "prove" something is "truly probabilistic," rather than deterministic but with complicated hidden parameters. But that was before I learned the Bell/CHSH theorem, which does pretty much exactly that (if you grant some mild locality assumptions)! So it's at least logically possible that some future physical theory could demand Knightian uncertainty in order to make internal sense, in much the same way that quantum mechanics demands probabilistic uncertainty.

But setting aside that speculative possibility, there's a much more important point in practice: namely, it's much easier to rule out that a given source of uncertainty is Knightian, or at least to place upper bounds on how much Knightian uncertainty it can have. To do so, you "merely" have to give a model for the system so detailed that, by using it, you can:

(1) calculate the probability of any event you want, to any desired accuracy,

(2) demonstrate, using repeated tests, that your probabilities are well-calibrated (e.g., of the things you say will happen roughly 60% of the time, roughly 60% of them indeed happen, and moreover the subset of those things that happen passes all the standard statistical tests for not having any further structure), and

(3) crucially, provide evidence that your probabilities don't merely reflect epistemic ignorance. In practice, this would almost certainly mean providing the causal pathways by which the probabilities can be traced down to the quantum level.

Admittedly, (1)-(3) sound like a tall order! But I'd say that they've already been done, more or less, for all sorts of complicated multi-particle quantum systems (in chemistry, condensed-matter physics, etc.): we can calculate the probabilities, compare them against observation, and trace the origin of the probabilities to the Born rule.

Of course, if you have a large ensemble of identical copies of your system (or things you regard as identical copies), then that makes validating your probabilistic model a lot more straightforward, for then you can replace step (2) by direct experimental estimation of the probabilities. But in the above, I was careful never to assume that we had lots of identical copies --- since if the freebit picture were accepted, then in many cases of interest to us we wouldn't!

Comment author: gjm 18 June 2013 02:07:23PM 0 points [-]

What would make them "count as reductios that there's probably nothing that it's like to be that program", and how?

Comment author: ScottAaronson 18 June 2013 07:24:42PM *  8 points [-]

Alright, consider the following questions:

  • What's it like to be simulated in homomorphically encrypted form (http://en.wikipedia.org/wiki/Homomorphic_encryption)---so that someone who saw the entire computation (including its inputs and outputs), and only lacked a faraway decryption key, would have no clue that the whole thing is isomorphic to what your brain is doing?

  • What's it like to be simulated by a reversible computer, and immediately "uncomputed"? Would you undergo the exact same set of experiences twice? Or once "forwards" and then once "backwards" (whatever that means)? Or, since the computation leaves no trace of its ever having happened, and is "just a convoluted implementation of the identity function," would you not experience anything?

  • Once the code of your brain is stored in a computer, why would anyone even have to bother running the code to evoke your subjective experience? And what counts as running it? Is it enough to do a debugging trace with pen and paper?

  • Suppose that, purely for internal error-correction purposes, a computer actually "executes" you three times in parallel, then outputs the MAJORITY of the results. Is there now one conscious entity or three? (Or maybe 7, for every nonempty subset of executions?)

Crucially, unlike some philosophers (e.g. John Searle), I don't pound the table and declare it "obvious" that there's nothing that it's like to be simulated in the strange ways above. All I say is that I don't think I have any idea what it's like, in even the same imperfect way that I can imagine what it's like to be another human being (or even, say, an unclonable extraterrestrial) by analogy with my own case. And that's why I'm not as troubled as some people are, if some otherwise-plausible cosmological theory predicts that the overwhelming majority of "copies" of me should be Boltzmann brains, computer simulations, etc. I view that as a sign, not that I'm almost certainly a copy (though I might be), but simply that I don't yet know the right way to think about this issue, and maybe that there's a good reason (lack of freebits??) why the supposed "copies" shouldn't even be included in my reference class.

Comment author: Manfred 18 June 2013 05:03:22PM *  1 point [-]

how does the superintelligence know it should assume the uniform distribution, and not some other distribution?

Symmetry arguments? And since our superintelligence understands the working of your brain minus this qubit, the symmetry isn't between choices A and B, but rather between the points on the Bloch sphere of the qubit. Learning that in some microscopically independent trial a qubit had turned out in such a way that you chose B doesn't give the superintelligence any information about your qubit, and so wouldn't change its prediction.

A less-super intelligence, who was uncertain about the function (your brain) that mapped qubits onto decisions, would update in favor of the functions that produced B - the degree to which this mattered would depend on its probability distribution over functions.

I don't deny the enormous value of MaxEnt and other Bayesian-prior-choosing heuristics in countless statistical applications. Indeed, if you forced me at gunpoint to bet on something about which I had Knightian uncertainty, then I too would want to use Bayesian methods, making judicious use of those heuristics!

This still seems weird, though I believe in freebits by your requirement. Why would you want to use Bayesian methods if no guess (in the form of a probability, to be scored according to some rule that rewards good guesses) is better than another on Knightian problems? And if some guess is better than another, why not use the best guess? That's what using probability is all about - if you didn't have incomplete information, you wouldn't need to guess at all.

Comment author: ScottAaronson 18 June 2013 06:21:52PM *  4 points [-]

Well, I can try to make my best guess if forced to -- using symmetry arguments or any other heuristic at my disposal -- but my best guess might differ from some other, equally-rational person's best guess. What I mean by a probabilistic system's being "mechanistic" is that the probabilities can be calculated in such a way that no two rational people will disagree about them (as with, say, a radioactive decay time, or the least significant digit of next week's Dow Jones average).

Also, the point of my "Earth C" example was that symmetry arguments can only be used once we know the reference class of things to symmetrize over -- but which reference class to choose is precisely the sort of thing about which rational people can disagree, with no mathematical or empirical way to resolve the disagreement. (And moreover, where it's not even obvious that there is a "true, pre-existing answer" out there in the world, or what it would mean for something to count as such an answer.)

Comment author: torekp 18 June 2013 12:05:01AM 3 points [-]

Sorry about misrepresenting you. I should have said "associating it with free will" instead of "calling it free will". I do think the association is a mistake. Admittedly it fits with a long tradition, in theology especially, of seeing freedom of action as being mutually exclusive with causal determination. It's just that the tradition is a mistake. Probably a motivated one (it conveniently gets a deity off the hook for creating and raising such badly behaved "children").

Comment author: ScottAaronson 18 June 2013 02:10:40PM *  4 points [-]

Well, all I can say is that "getting a deity off the hook" couldn't possibly be further from my motives! :-) For the record, I see no evidence for a deity anything like that of conventional religions, and I see enormous evidence that such a deity would have to be pretty morally monstrous if it did exist. (I like the Yiddish proverb: "If God lived on earth, people would break His windows.") I'm guessing this isn't a hard sell here on LW.

Furthermore, for me the theodicy problem isn't even really connected to free will. As Dostoyevsky pointed out, even if there is indeterminist free will, you would still hope that a loving deity would install some "safety bumpers," so that people could choose to do somewhat bad things (like stealing hubcaps), but would be prevented from doing really, really bad ones (like mass-murdering children).

One last clarification: the whole point of my perspective is that I don't have to care about so-called "causal determination"---either the theistic kind or the scientific kind---until and unless it gets cashed out into actual predictions! (See Sec. 2.6.)

Comment author: Wei_Dai 17 June 2013 09:29:41AM *  11 points [-]

As far as I can see, the only hope for avoiding these difficulties is if—because of chaos, the limits of quantum measurement, or whatever other obstruction—minds can’t be copied perfectly from one physical substrate to another, as can programs on standard digital computers.

Even if Aaronson's speculation that human minds are not copyable turns out to be correct, that doesn't rule out copyable minds being built in the future, either de novo AIs or what he (on page 58) calls "mockups" of human minds that are functionally close enough to the originals to fool their close friends. The philosophical problems with copyable minds will still be an issue for those minds, and therefore minds not being copyable can't be the only hope of avoiding these difficulties.

To put this another way, suppose Aaronson definitively shows that according to quantum physics, minds of biological humans can't be copied exactly. But how does he know that he is actually one of the original biological humans, and not for example a "mockup" living inside a digital simulation, and hence copyable? I think that is reason enough for him to directly attack the philosophical problems associated with copyable minds instead of trying to dodge them.

Comment author: ScottAaronson 18 June 2013 01:15:46PM *  8 points [-]

Wei, I completely agree that people should "directly attack the philosophical problems associated with copyable minds," and am glad that you, Eliezer, and others have been trying to do that! I also agree that I can't prove I'm not living in a simulation --- nor that that fact won't be revealed to me tomorrow by a being in the meta-world, who will also introduce me to dozens of copies of myself running in other simulations. But as long as we're trading hypotheticals: what if minds (or rather, the sorts of minds we have) can only be associated with uncopyable physical substrates? What if the very empirical facts that we could copy a program, trace its execution, predict its outputs using an abacus, run the program backwards, in heavily-encrypted form, in one branch of a quantum computation, at one step per millennium, etc. etc., were to count as reductios that there's probably nothing that it's like to be that program --- or at any rate, nothing comprehensible to beings such as us?

Again, I certainly don't know that this is a reasonable way to think. I myself would probably have ridiculed it, before I realized that various things that confused me for years and that I discuss in the essay (Newcomb, Boltzmann brains, the "teleportation paradox," Wigner's friend, the measurement problem, Bostrom's observer-counting problems...) all seemed to beckon me in that direction from different angles. So I decided that, given the immense perplexities associated with copyable minds (which you know as well as anyone), the possibility that uncopyability is essential to our subjective experience was at least worth trying to "steelman" (a term I learned here) to see how far I could get with it. So, that's what I tried to do in the essay.

Comment author: Manfred 17 June 2013 02:11:19PM *  8 points [-]

Take two heretofore identical Earths A and B in an infinite universe and are about to diverge based on your decision, and it's not impossible for a superintelligence to predict this decision, not even probabilistically, because it is based on a freebit:

Suppose you just have the one freebit - it's your standard issue qubit, and you keep it in a box at absolute zero in case you need to make a decision. If this superintelligence can predict you perfectly except for this one qubit, why wouldn't it just assign a uniform probability distribution to the qubit's values and then simulate you for different qubit values to obtain a probability distribution?

I'm probably just confused about something.

Comment author: ScottAaronson 18 June 2013 12:33:08PM *  6 points [-]

The relevant passage of the essay (p. 65) goes into more detail than the paraphrase you quoted, but the short answer is: how does the superintelligence know it should assume the uniform distribution, and not some other distribution? For example, suppose someone tips it off about a third Earth, C, which is "close enough" to Earths A and B even if not microscopically identical, and in which you made the same decision as in B. Therefore, this person says, the probabilities should be adjusted to (1/3,2/3) rather than (1/2,1/2). It's not obvious whether the person is right---is Earth C really close enough to A and B?---but the superintelligence decides to give the claim some nonzero credence. Then boom, its prior is no longer uniform. It might still be close, but if there are thousands of freebits, then the distance from uniformity will quickly get amplified to almost 1.

Your prescription corresponds to E. T. Jaynes's "MaxEnt principle," which basically says to assume a uniform (or more generally, maximum-entropy) prior over any degrees of freedom that you don't understand. But the conceptual issues with MaxEnt are well-known: the uniform prior over what, exactly? How do you even parameterize "the degrees of freedom that you don't understand," in order to assume a uniform prior over them? (You don't want to end up like the guy interviewed on Jon Stewart, who earnestly explained that, since the Large Hadron Collider might destroy the earth and might not destroy it, the only rational inference was that it had a 50% chance of doing so. :-) )

To clarify: I don't deny the enormous value of MaxEnt and other Bayesian-prior-choosing heuristics in countless statistical applications. Indeed, if you forced me at gunpoint to bet on something about which I had Knightian uncertainty, then I too would want to use Bayesian methods, making judicious use of those heuristics! But applied statisticians are forced to use tricks like MaxEnt, precisely because they lack probabilistic models for the systems they're studying that are anywhere near as complete as (let's say) the quantum model of the hydrogen atom. If you believe there are any systems in nature for which, given the limits of our observations, our probabilistic models can never achieve hydrogen-atom-like completeness (even in principle)---so that we'll always be forced to use tricks like MaxEnt---then you believe in freebits. There's nothing more to the notion than that.

Comment author: Manfred 17 June 2013 01:46:21PM *  6 points [-]

I think that the "absolute prediction" question is answered. I mean, I'm acquiring bits of information you can't physically know all the time just by entangling with air molecules that haven't reached you yet. But there's a separate question of "how important is that?" which is a combination of at least two different questions: first "how big an impact does flipping a qubit have on human cognitive actions?" and second "how much do I care that someone can't predict me exactly, if they can predict my macroscopic actions out to a time horizon of minutes / days / years?"

I think you're more concerned about absolute prediction relative to "pretty good" prediction than I am, which is a shame because that's the totally subjective part of the question :)

Comment author: ScottAaronson 18 June 2013 08:19:17AM 0 points [-]

As a point of information, I too am only interested in predicting macroscopic actions (indeed, only probabilistically), not in what you call "absolute prediction." The worry, of course, is that chaotic amplification of small effects would preclude even "pretty good" prediction.

Comment author: DSherron 17 June 2013 06:04:15PM -1 points [-]

1) I'm not so clear that the map/territory distinction in QM is entirely relevant to the map/territory distinction with regard to uncertainty. The fact that we do not know a fact, does not describe in any way the fact itself. Even if we could not, by physical law, possibly know the fact, this still does not equate to the fact having inherent unknowability. There is no functional difference between uncertainty caused by the proposed QM effects, and uncertainty caused by any other factor. So long as we are uncertain for any reason, we are uncertain. We can map that uncertainty with a probability distribution, which will exist in the map; in the territory everything is (at least according to the Many-Worlds interpretation Eliezer ascribes to, and which is the best explanation I've seen to date although I've not studied the subject extensively) determined, even if our experiences are probabilistic. Even if it turns out that reality really does just throw dice sometimes, that won't change our ability to map probability over our uncertainty. The proposed source of randomness is not any more or less "really random" than other QM effects, and we can still map a probability distribution over it. The point of drawing the map/territory distinction is to avoid the error of proposing special qualities onto parts of the territory that should only exist on the map. "Here there be randomness" is fine on the map to mark your uncertainty, but don't read that as ascribing a special "random" property to the territory itself. "Randomness" is not a fundamental feature of reality, as a concept, even if sometimes reality is literally picking numbers at random; you would be mistaken if you tried to draw on that fundamental "randomness" in any way that was not exactly equivalent to any other uncertainty, because on the map it looks exactly the same.

2) I'm not really qualified to evaluate such claims.

3) Refusing to bet is, itself, just making a different bet. Refusing a 50/50 bet with a 1000:1 payoff is just as stupid as picking the wrong side. Any such refusal is just selecting the alternative option of "100%: no loss no gain," and decision theories are certainly able to handle options of that nature. Plus, often in reality there is no way to avoid a bet entirely; usually "do nothing" is just one side of the bet. You can't ever avoid the results of decision theory; you are guaranteed to get worse payoffs on average by refusing to take the recommended actions, even in the real world. This is minimized by computational limitations, such that you usually wouldn't be implementing a decision theory anyway or would only be approximating one, but you will still lose out on average.

Comment author: ScottAaronson 17 June 2013 07:19:50PM *  11 points [-]

"Even if we could not, by physical law, possibly know the fact, this still does not equate to the fact having inherent unknowability."

I think the sentence above nicely pinpoints where I part ways from you and Eliezer. To put it bluntly, if a fact is impossible for any physical agent to learn, according to the laws of physics, then that's "inherently unknowable" enough for me! :-) Or to say it even more strongly: I don't actually care much whether someone chooses to regard the unknowability of such a fact as "part of the map" or "part of the territory" -- any more than, if a bear were chasing me, I'd worry about whether aggression was an intrinsic attribute of the bear, or an attribute of my human understanding of the bear. In the latter case, I mostly just want to know what the bear will do. Likewise in the former case, I mostly just want to know whether the fact is knowable -- and if it isn't, then why! I find it strange that, in the free-will discussion, so many commentators seem to pass over the empirical question (in what senses can human decisions actually be predicted?) without even evincing curiosity about it, in their rush to argue over the definitions of words. (In AI, the analogue would be the people who argued for centuries about whether a machine could be conscious, without --- until Turing --- ever cleanly separating out the "simpler" question, of whether a machine could be built that couldn't be empirically distinguished from entities we regard as conscious.) A central reason why I wrote the essay was to try to provide a corrective to this (by my lights) anti-empirical tendency.

"you would be mistaken if you tried to draw on that fundamental "randomness" in any way that was not exactly equivalent to any other uncertainty, because on the map it looks exactly the same."

Actually, the randomness that arises from quantum measurement is empirically distinguishable from other types of randomness. For while we can measure a state |psi> in a basis not containing |psi>, and thereby get a random outcome, we also could've measured |psi> in a basis containing |psi> -- in which case, we would've confirmed that a measurement in the first basis must give a random outcome, whose probability distribution is exactly calculable by the Born rule, and which can't be explainable in terms of subjective ignorance of any pre-existing degrees of freedom unless we give up on locality.

But the more basic point is that, if freebits existed, then they wouldn't be "random," as I use the term "random": instead they'd be subject to Knightian uncertainty. So they couldn't be collapsed with the randomness arising from (e.g.) the Born rule or statistical coarse-graining, for that reason even if not also for others.

"Refusing to bet is, itself, just making a different bet."

Well, I'd regard that statement as the defining axiom of a certain limiting case of economic thinking. In practice, however, most economic agents exhibit some degree of risk-aversion, which could be defined as "that which means you're no longer in the limiting case where everything is a bet, and the only question is which bet maximizes your expected utility."

View more: Next