Are you saying that people can't deal with regular probability theory, but can deal with two-level "probabilities of probabilities"?
Not in full generality. There may be instances though. I don't know how to articulate my intuitions here, without going into examples that are sufficiently involved so that they'd derail the conversation. If nothing else, it's true that a probability estimate does not suffice to capture the knowledge that one has about an event and that one can better use probabilities as an input into one's epistemology if one keeps this in mind.
Some people have maintained that there are events to which there's no rational basis for assigning probabilities. For example, John Maynard Keynes wrote of "uncertainty" in the following sense:
This sort of uncertainty is sometimes referred to as Knightian uncertainty.
MIRI is interested in making probabilistic predictions about events such as the creation of general artificial intelligence, which are without precedent, and which therefore cannot be assigned probabilities via frequentist means. Some of these events are presumably of the type that Keynes had in mind. At MIRI's request, I did a literature review looking for arguments against there being a rational basis for assigning probabilities to such events.
Definitions of subjective probability
One can attempt to define the subjective probability that an agent assigns to an event to be, intuitively, the number that it would assign if it were to make a very large number of predictions with a view toward, for each x, assigning probability x% to a collection of events of which x% actually occur. Eliezer discusses the mathematical formalism behind this in A Technical Explanation of a Technical Explanation.
Other definitions of subjective probabilities have been given by Ramsey (1931), de Finetti (1937), Koopman (1940), Good (1950), Savage (1954), Davidson and Suppes (1956), Kraft, Pratt and Seidenberg (1959), Anscombe and Aumann (1963) and Wakker (1989). (Fishburn (1986) gives a survey of the literature.) I have not studied the mathematical formalisms of most of these papers, but here's a definition inspired by them (one which is immune to some of the criticisms that have been raised against some of the definitions).
Assume that for each number p between 0 and 1, there is a random process R that yields an outcome O' with "objective" probability p. Here "objective" probability refers to a probability that can be determined via physics or frequentist means. Your subjective probability of an event E is defined as follows. Suppose that you have an event F, that you strongly desire to happen, and a choice between the following options:
Consider the set S of values of p such that you'd prefer #2 over #1. Then your subjective probability q of E is defined to be the greatest lower bound of S.
(F is usually taken to be a monetary reward arising from a bet.)
For example, suppose that E and F are the both the event "humanity survives for millions of years" and you have the opportunity to push a button that will guarantee this with probability p and otherwise guarantee that this does not happen. If you're willing to push it when p = 99.999%, that means that you assign a probability less than 99.999% to humanity surviving for millions of years. If you're not willing to push it when p = 0.001%, that means that you assign a probability greater than 0.001% to humanity surviving for millions of years.
Some objections to the definition are:
These two objections also apply to the definition that Eliezer discusses in A Technical Explanation of a Technical Explanation.
Addressing these points in turn:
Pragmatic objections to assigning subjective probabilities
Even if subjective probabilities are well-defined (up to the two issues mentioned above), assigning a subjective probability in a given instance could be bad for one's epistemology. Some proponents of the idea of Knightian uncertainty may implicitly adhere to this position. Some ways in which assigning a subjective probability can lead one astray are given below.
Overconfidence in models
Suppose that one has a model of the world that one thinks is probably right and according to which the probability of an event E is extremely small. If one forgets that the model might be wrong, one might erroneously conclude that the probability of E occurring is extremely small. (Yvain discussed this in Confidence levels inside and outside an argument.)
This appears to be close to Keynes' objection to assigning subjective probabilities. I have not studied Keynes' original work, but several people who have written about him seem to implicitly ascribe this position to him. For example, in a book review discussing Keynes, John Gray wrote:
One can assign a probability to one's model of the world being accurate, to account for model uncertainty. Keynes' position is perhaps best interpreted as a statement about effect size: a claim that the probability that one should assign to one's model being inaccurate is large.
Insensitivity to robustness of evidence
Kyburg (1968) argues that probabilities don't adequately pick up on robustness of evidence. He gives the example of drawing balls from an urn with black and white balls of unknown relative frequencies. He says that there's a big difference between
saying
A single probability estimate does not pick up on how much one should update in response to incoming evidence. If one assigns a probability p to an event, one might mentally categorize the event in the reference class "events with probability p" and update too little or too much in response to incoming evidence on account of anchoring on other events of probability p (for which the probability is more robustly established or less robustly established than for the event in question).
This may be addressed by replacing a subjective probability of an event with a probability distribution for an event: for each number p between 0 and 1, associating a probability qp that the event occurs with probability p. Quoting page 67 of Handbook of Risk Theory
Probability, knowledge, and meta-probability discusses E.T. Jaynes' approach to this.
Suppression of dependency of events
Given two events A and B to which one assigns probabilities p and q, the numbers p and q do not suffice to determine the probability that events A and B both occur. If one assigns probabilities to events, and forgets where the probabilities came from, there's a risk of tacitly assuming that the events are independent, and assigning probability pq to the conjunction of p and q, when the probability of the conjunction could be much higher or much lower. According to chapter 1 of Nate Silver's book The Signal and the Noise, similar mistakes contributed to the 2008 financial crisis: people in finance assigned a much smaller probability of a very large number of houses' prices dropping than they did to a smaller number of houses' prices dropping, even though the prices of different houses were correlated.
Conclusion
While some people have said that subjective probabilities of arbitrary events are not meaningful, there are definitions that make the notion of subjective probability meaningful, though arguably only as an intervals rather than as numbers. Using intervals rather than numbers addresses some of the objections that have been raised.
A large part of the debate about whether one should assign subjective probabilities to arbitrary events is perhaps best conceptualized as a debate about how large the probability intervals that one assigns should be. In Worst Case Scenarios (pg 160) Sunstein wrote
In any given instance, one has the question of how much can be said. If you have a model of the world M that's accurate with probability at least p and M predicts an event E with probability at least q, then the probability of E is at least pq. If p is low, then this doesn't give a good lower bound on the probability of E. But suppose you have 2 independent models M1, and M2, where Mi is accurate with probability at least pi and where Mi predicts E with probability at least qi. Then the probability of E is bounded below by p1q1 + p2q2 - p1q1p2q2. So by using model combination you can get a better lower bound on the probability of E (although in practice the models used may not be fully independent, and if they're positively correlated then the lower bound will be worse).
The ways in which assigning subjective probabilities can be bad for one's epistemology seem to fall under the broad heading "failing to incorporate all of one's knowledge when assigning a probability and then using it uncritically, or forgetting that the probability that you assign to an event does not fully capture your knowledge pertaining to the event." These issues can be at least partially mitigated by keeping them in mind.