Quote from Richard Feynman explaining why there are no objects here.
I've begun a STEM-compatible attempt to explain a "no objectively-given objects" ontology in "Boundaries, objects, and connections." That's supposed to be the introduction to a book chapter that is extensively drafted but not yet polished enough to publish.
Really glad you are working on this also!
I would not say that selves don't exist (although it's possible that I have done so somewhere, sloppily).
Rather, that selves are both nebulous and patterned ("empty forms," in Tantric terminology).
Probably the clearest summary of that I've written so far is "Selfness," which is supposed to be the introduction to a chapter of the Meaningness book that does not yet otherwise exist.
Renouncing the self is characteristic of Sutrayana.
[FWOMP Summoned spirit appears in Kaj's chalk octogram. Gouts of eldritch flame, etc. Spirit squints around at unfamiliar environment bemusedly. Takes off glasses, holds them up to the candlelight, grimaces, wipes glasses on clothing, replaces on nose. Grunts. Speaks:]
Buddhism is a diverse family of religions, with distinct conceptions of enlightenment. These seem to be quite different and contradictory.
According to one classification of disparate doctrines, Buddhism can be divided into Vajrayana (Tantra plus Dzogchen) and Sutrayana (everything else, excep...
A puzzling question is why your brain doesn't get this right automatically. In particular, deciding whether to gather some food before sleeping is an issue mammals have faced in the EEA for millions of years.
Temporal difference learning seems so basic that brains ought to implement it reasonably accurately. Any idea why we might do the wrong thing in this case?
One aspect of what I consider the correct solution is that the only question that needs to be answered is "do I think putting a coin in the box has positive or negative utility", and one can answer that without any guess about what it is actually going to do.
What is your base rate for boxes being able to drive you mad if you put a coin in them?
Can you imagine any mechanism whereby a box would drive you mad if you put a coin in it? (I can't.)
Well, the problem I was thinking of is "the universe is not a bit string." And any unbiased representation we can make of the universe as a bit string is going to be extremely large—much too large to do even sane sorts of computation with, never mind Solomonoff.
Maybe that's saying the same thing you did? I'm not sure...
I can't guarantee you won't get blown up
Yes—this is part of what I'm driving at in this post! The kinds of problems that probability and decision theory work well for have a well-defined set of hypotheses, actions, and outcomes. Often the real world isn't like that. One point of the black box is that the hypothesis and outcome spaces are effectively unbounded. Trying to enumerate everything it could do isn't really feasible. That's one reason the uncertainty here is "Knightian" or "radical."
In fact, in the real world, "and then ...
Part of the motivation for the black box experiment is to show that the metaprobability approach breaks down in some cases. Maybe I ought to have made that clearer! The approach I would take to the black box does not rely on metaprobability, so let's set that aside.
So, your mind is already in motion, and you do have priors about black boxes. What do you think you ought to in this case? I don't want to waste your time with that... Maybe the thought experiment ought to have specified a time limit. Personally, I don't think enumerating things the boxes could possibly do would be helpful at all. Isn't there an easier approach?
Yes, I'm not at all committed to the metaprobability approach. In fact, I concocted the black box example specifically to show its limitations!
Solomonoff induction is extraordinarily unhelpful, I think... that it is uncomputable is only one reason.
I think there's a fairly simple and straightforward strategy to address the black box problem, which has not been mentioned so far...
This is interesting—it seems like the project here would be to construct a universal, hierarchical ontology of every possible thing a device could do? This seems like a very big job... how would you know you hadn't left out important possibilities? How would you go about assigning probabilities?
(The approach I have in mind is simpler...)
Well, regardless of the value of metaprobability, or its lack of value, in the case of the black box, it doesn't seem to offer any help in finding a decision strategy. (I find it helpful in understanding the problem, but not in formulating an answer.)
How would you go about choosing a strategy for the black box?
The problem of what to expect from the black box?
I'd think about it like this: suppose that I hand you a box with a slot in it. What do you expect to happen if you put a quarter into the slot?
To answer this we engage our big amount of human knowledge about boxes and people who hand them to you. It's very likely that nothing at all will happen, but I've also seen plenty of boxes that also emit sound, or gumballs, or temporary tattoos, or sometimes more quarters. But suppose that I have previously handed you a box that emits more quarters sometimes when y...
Hi, I have a site tech question. (Sorry if this is the wrong place to post that!—I couldn't find any other.)
I can't find a way to get email notifications of comment replies (i.e. when my inbox icon goes red). If there is one, how do I turn it on?
If there isn't one, is that a deliberate design feature, or a limitation of the software, or...?
Thanks (and thanks especially to whoever does the system maintenance here—it must be a big job.)
Then why use it instead of learning the standard terms and using those?
The standard term is A_p, which seemed unnecessarily obscure.
Re the figure, see the discussion here.
(Sorry to be slow to reply to this; I got busy and didn't check my LW inbox for more than a month.)
Thank you very much—link fixed!
That's a really funny quote!
Multi-armed bandit problems were intractable during WWII probably mainly because computers weren't available yet. In many cases, the best approach is brute force simulation. That's the way I would approach the "blue box" problem (because I'm lazy).
But exact approaches have also been found: "Burnetas AN and Katehakis MN (1996) also provided an explicit solution for the important case in which the distributions of outcomes follow arbitrary (i.e., nonparametric) discrete, univariate distributions." The blue box problem is within that class.
Thanks, yes! I.e. who is this "everyone else," and where do they treat it the same way Jaynes does? I'm not aware of any examples, but I have only a basic knowledge of probability theory.
It's certainly possible that this approach is common, but Jaynes wasn't ignorant, and he seemed to think it was a new and unusual and maybe controversial idea, so I kind of doubt it.
Also, I should say that I have no dog in this fight at all; I'm not advocating "Jaynes is the greatest thing since sliced bread", for example. (Although that does seem to be the opinion of some LW writers.)
Thanks, that's really funny! "On the other hand" is my general approach to life, so I'm happy to argue with myself.
And yes, I'm steelmanning. I think this approach is an excellent one in some cases; it will break down in others. I'll present a first one in the next article. It's another box you can put coins in that (I'll claim) can't usefully be modeled in this way.
Here's the quote from Jaynes, by the way:
...What are we doing here? It seems almost as if we are talking about the ‘probability of a probability’. Pending a better understanding of wha
Yes, meta-probabilities are probabilities, although somewhat odd ones; they obey the normal rules of probability. Jaynes discusses this in his Chapter 18; his discussion there is worth a read.
The statement "probability estimates are not, by themselves, adequate to make rational decisions" was meant to describe the entire sequence, not this article.
I've revised the first paragraph of the article, since it seems to have misled many readers. I hope the point is clearer now!
Are you claiming there's no prior distribution over sequences which reflects our knowledge?
No. Well, not so long as we're allowed to take our own actions into account!
I want to emphasize—since many commenters seem to have mistaken me on this—that there's an obvious, correct solution to this problem (which I made explicit in the OP). I deliberately made the problem as simple as possible in order to present the A_p framework clearly.
Are we talking about the Laplace vs. fair coins?
Not sure what you are asking here, sorry...
We could also try to summarize some features of such epistemic states by talking about the instability of estimates - the degree to which they are easily updated by knowledge of other events
Yes, this is Jaynes' A_p approach.
this will be a derived feature of the probability distribution, rather than an ontologically extra feature of probability.
I'm not sure I follow this. There is no prior distribution for the per-coin payout probabilities that can accurately reflect all our knowledge.
...I reject that this is a good reason for probability theorists to
So, let me try again to explain why I think this is missing the point... I wrote "a single probability value fails to capture everything you know about an uncertain event." Maybe "simple" would have been better than "single"?
The point is that you can't solve this problem without somehow reasoning about probabilities of probabilities. You can solve it by reasoning about the expected value of different strategies. (I said so in the OP; I constructed the example to make this the obviously correct approach.) But those strategies c...
Glad you liked it!
I also get "stop after two losses," although my numbers come out slightly differently. However, I suck at this sort of problem, so it's quite likely I've got it wrong.
My temptation would be to solve it numerically (by brute force), i.e. code up a simulation and run it a million times and get the answer by seeing which strategy does best. Often that's the right approach. However, sometimes you can't simulate, and an analytical (exact, a priori) answer is better.
I think you are right about the sportsball case! I've updated my meta...
Yup, it's definitely wrong! I was hoping no one would notice. I thought it would be a distraction to explain why the two are different (if that's not obvious), and also I didn't want to figure out exactly what the right math was to feed to my plotting package for this case. (Is the correct form of the curve for the p=0 case obvious to you? It wasn't obvious to me, but this isn't my area of expertise...)
Decisions are made on the basis of expected value, not probability.
Yes, that's the point here!
your analysis of the first bet ignores the value of the information gained from it in executing your options for further play thereafter.
By "the first bet" I take it that you mean "your first opportunity to put a coin in a green box" (rather than meaning "brown box").
My analysis of that was "you should put some coins in the box", exactly because of the information gain.
...This statement indicates a lack of understandin
I don't think is demonstrated at all by this example.
Yes, I see your point (although I don't altogether agree). But, again, what I'm doing here is setting up analytical apparatus that will be helpful for more difficult cases later.
In the mean time, the LW posts I pointed to here may motivate more strongly the claim that probability alone is an insufficient guide to action.
I'm sure you know more about this than I do! Based on a quick Wiki check, I suspect that formally the A_p are one type of hyperprior, but not all hyperpriors are A_p (a/k/a metaprobabilities).
Hyperparameters are used in Bayesian sensitivity analysis, a/k/a "Robust Bayesian Analysis", which I recently accidentally reinvented here. I might write more about that later in this sequence.
It may be helpful to read some related posts (linked by lukeprog in a comment on this post): Estimate stability, and Model Stability in Intervention Assessment, which comments on Why We Can't Take Expected Value Estimates Literally (Even When They're Unbiased). The first of those motivates the A_p (meta-probability) approach, the second uses it, and the third explains intuitively why it's important in practice.
Jeremy, I think the apparent disagreement here is due to unclarity about what the point of my argument was. The point was not that this situation can't be analyzed with decision theory; it certainly can, and I did so. The point is that different decisions have to be made in two situations where the probabilities are the same.
Your discussion seems to equate "probability" with "utility", and the whole point of the example is that, in this case, they are not the same.
Jeremy, thank you for this. To be clear, I wasn't suggesting that meta-probability is the solution. It's a solution. I chose it because I plan to use this framework in later articles, where it will (I hope) be particularly illuminating.
I would take issue with the first section of this article in which you establish single probability (expected utility) calculations as insufficient for the problem.
I don't think it's correct to equate probability with expected utility, as you seem to do here. The probability of a payout is the same in the two situations....
Hi!
I’ve been interested in how to think well since early childhood. When I was about ten, I read a book about cybernetics. (This was in the Oligocene, when “cybernetics” had only recently gone extinct.) It gave simple introductions to probability theory, game theory, information theory, boolean switching logic, control theory, and neural networks. This was definitely the coolest stuff ever.
I went on to MIT, and got an undergraduate degree in math, specializing in mathematical logic and the theory of computation—fields that grew out of philosophical investi...
Can you recommend an explanation of the complete class theorem(s)? Preferably online. I've been googling pretty hard and I've turned up almost nothing. I'd like to understand what conditions they start from (suspecting that maybe the result is not quite as strong as "Bayes Rules!"). I've found only one paper, which basically said "what Wald proved is extremely difficult to understand, and probably not what you wanted."
Thank you very much!
A collection of collections of advice for graduate students! http://vlsicad.ucsd.edu/Research/Advice/
A collection of advice for graduate students I put together some time ago: http://www.cs.indiana.edu/mit.research.how.to.html
It was meant specifically for people at the MIT AI Lab, but much of it is applicable to other STEM fields.
Regarding the development of agreeableness/empathy: there are meditation techniques specifically intended to do this. (They are variously called "Metta", "Lojong", "Tonglen", or (yuck) "loving kindness meditation"; all of which are pretty similar.) These originate in Mahayana Buddhism, but don't have any specifically religious content. They are often taught in conjunction with mindfulness meditation.
I don't know whether there have been any serious studies on these methods, but anecdotally they are highly effective...
Have you read Minsky's _Society of Mind_? It is an AI-flavored psychological model of subagents that draws heavily on psychotherapeutic ideas. It seems quite similar in flavor to what you propose here. It inspired generations of students at the MIT AI Lab (although attempts to code it never worked out).