Suppose we could look into the future of our Everett branch and pick out those sub-branches in which humanity and/or human/moral values have survived past the Singularity in some form. What would we see if we then went backwards in time and look at how that happened? Here's an attempt to answer that question, or in other words to enumerate the not completely disastrous Singularity scenarios that seem to have non-negligible probability. Note that the question I'm asking here is distinct from "In what direction should we try to nudge the future?" (which I think logically ought to come second).
- Uploading first
- Become superintelligent (self-modify or build FAI), then take over the world
- Take over the world as a superorganism
- self-modify or build FAI at leisure
- (Added) stasis
- Competitive upload scenario
- (Added) subsequent singleton formation
- (Added) subsequent AGI intelligence explosion
- no singleton
- IA (intelligence amplification) first
- Clone a million von Neumanns (probably government project)
- Gradual genetic enhancement of offspring (probably market-based)
- Pharmaceutical
- Direct brain/computer interface
- What happens next? Upload or code?
- Code (de novo AI) first
- Scale of project
- International
- National
- Large Corporation
- Small Organization
- Secrecy - spectrum between
- totally open
- totally secret
- Planned Friendliness vs "emergent" non-catastrophe
- If planned, what approach?
- "Normative" - define decision process and utility function manually
- "Meta-ethical" - e.g., CEV
- "Meta-philosophical" - program AI how to do philosophy
- If emergent, why?
- Objective morality
- Convergent evolution of values
- Acausal game theory
- Standard game theory (e.g., Robin's idea that AIs in a competitive scenario will respect human property rights due to standard game theoretic considerations)
- If planned, what approach?
- Competitive vs. local FOOM
- Scale of project
- (Added) Simultaneous/complementary development of IA and AI
Sorry if this is too cryptic or compressed. I'm writing this mostly for my own future reference, but perhaps it could be expanded more if there is interest. And of course I'd welcome any scenarios that may be missing from this list.
Yes, the argument would only remove a reason for seeing this as a strict logical impossibility (for us).
Sufficiently smart AGI precommits to cooperate with every other super-intelligence it meets that has made a similar precommitment. This acausally ensures that a big set of super-minds will cooperate with the AGI if they meet it, thereby producing huge tracts of expected value.
The AGI also precommits to cooperate with some super-minds that don't exist yet, by leaving their potential creators alone -- it won't interfere in the slightest with any species or star system that might produce a super-mind. This protects the AGI from counterfactual interference that would have prevented its existence, and more importantly, protects it from retaliation by hypothetical super-minds that care about protection from counterfactuals. 2.1: It does not precommit to leaving its own creators alone so they have a chance to create paperclip-maximizers in all shapes and sizes. The AGI's simulation of a stronger mind arose before any other super-mind, and knows this holds true for its own planet -- so the sim does not care about the fate of counterfactual future rivals from said planet. Nor does the AGI itself perceive a high expected value to negotiating with people it decided to kill before it could start modelling them.
As for the problem with #2, while I agree that the trap in the linked OP fails, the one in the linked comment seems valid. You still have to bite the bullet and accommodate the whims of parents with unrealistically good predictive abilities, in this hypothetical. (I guess they taught you about TDT for this purpose.) Or let's say that branch-jumping works but the most cheerful interpretation of it does not -- let's say you have to negotiate acausally with a misery-maximizer and a separate joy-minimizer to ensure your existence. I don't know exactly how that bullet would taste, but I don't like the looks of it.
It could make this precommitment before before learning that it was the oldest on its planet. Even if it did not actually make this precommitment, a well-programmed AI should abide by any precommitments it would have made if it had thought of them; otherwise it could lose expected utilons when it faces a problem that it could have made a precomm... (read more)