While figuring out my error in my solution to the Ultimate Newcomb's Problem, I ran across this (distinct) reformulation that helped me distinguish between what I was doing and what the problem was actually asking.
... but that being said, I'm not sure if my answer to the reformulation is correct either.
The question, cleaned for Discussion, looks like this:
You approach the boxes and lottery, which are exactly as in the UNP. Before reaching it, you come to sign with a flashing red light. The sign reads: "INDEPENDENT SCENARIO BEGIN."
Omega, who has predicted that you will be confused, shows up to explain: "This is considered an artificially independent experiment. Your algorithm for solving this problem will not be used in my simulations of your algorithm for my various other problems. In other words, you are allowed to two-box here but one-box Newcomb's problem, or vice versa."
This is motivated by the realization that I've been making the same mistake as in the original Newcomb's Problem, though this justification does not (I believe) apply to the original. The mistake is simply this: that I assumed that I simply appear in medias res. When solving the UNP, it is (seems to be) important to remember that you may be in some very rare edge case of the main problem, and that you are choosing your algorithm for the problem as a whole.
But if that's not true - if you're allowed to appear in the middle of the problem, and no counterfactual-yous are at risk - it sure seems like two-boxing is justified - as khafra put it, "trying to ambiently control basic arithmetic".
(Speaking of which, is there a write up of ambient decision theory anywhere? For that matter, is there any compilation of decision theories?)
EDIT: (Yes to the first, though not under that name: Controlling Constant Programs.)
Right, which would be silly, so I wouldn't do that.
Oh, I see what's confusing me. The "Interrupted" version of the classic Newcomb's Problem is this: replace Omega with DumbBot that doesn't even try to predict your actions, it just gives you outcomes at random. So you can't affect your counterfactual selves, and don't even bother - just two-box.
This problem - which I should rename to the Interrupted Ultimate Newcomb's Problem - does require Omega. It would look like this: from Omega's end, Omega simulates a jillion people, as you put it, and finds all the people who produce primes or nonprimes (depending on the primality of 1033), and then poses this question only to those people. From your point of view, though, you know neither the primality of 1033 nor your own eventual answer, so it seems like you can ambiently control 1033 to be composite - and the versions of you that didn't make whatever choice you make are never part of the experiment, so who cares?