Instead of using "optimal general prior", I should have said that I was pessimistic about the existence of a standard for evaluating priors (or, more properly, prior probability distributions) that is optimal in all circumstances, if that's any clearer.
Having thought about the problem some more, though, I think my pessimism may have been premature.
A prior probability distribution is nothing more than a weighted set of hypotheses. A perfect Bayesian would consider every possible hypothesis, which is impossible unless hypotheses are countable, and they aren't; the ideal for Bayesian reasoning as I understand it is thus unattainable, but this doesn't mean that there are benefits to be found in moving toward that ideal.
So, perfect Bayesian or not, we have some set of hypotheses which need to be located before we can consider them and assign them a probabilistic weight. Before we acquire any rational evidence at all, there is necessarily only one factor that we can use to distinguish between hypotheses: how hard they are to locate. If it is also true that hypotheses which are easier to locate make more predictions and that hypotheses which make more predictions are more useful (and while I have not seen proofs of these propositions I'm inclined to suspect that they exist), then we are perfectly justified in assigning a probability to a hypothesis based on it's locate-ability.
This reduces the problem of prior probability evaluation to the problem of locate-ability evaluation, to which it seems maxent and its fellows are proposed answers. It's again possible there is no objectively best way to evaluate locate-ability, but I don't yet see a reason for this to be so.
Again, if I've mis-thought or failed to justify a step in my reasoning, please call me on it.
If it is also true that hypotheses which are easier to locate make more predictions
This doesn't sound right to me. Imagine you're tossing a coin repeatedly. Hypothesis 1 says the coin is fair. Hypothesis 2 says the coin repeats the sequence HTTTHHTHTHTTTT over and over in a loop. The second hypothesis is harder to locate, but makes a stronger prediction.
The proper formalization for your concept of locate-ability is the Solomonoff prior. Unfortunately we can't do inference based on it because it's uncomputable.
Maxent and friends aren't motivated by a des...
This thread is for the discussion of Less Wrong topics that have not appeared in recent posts. If a discussion gets unwieldy, celebrate by turning it into a top-level post.
This thread brought to you by quantum immortality.