Eliezer_Yudkowsky comments on Open Thread: February 2010 - Less Wrong

1 Post author: wedrifid 01 February 2010 06:09AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (738)

You are viewing a single comment's thread.

Comment author: Eliezer_Yudkowsky 01 February 2010 08:30:11AM 6 points [-]

A query to Unknown, with whom I have this bet going:

Unknown: When someone designs a superintelligent AI (it won't be Eliezer), without paying any attention to Friendliness (the first person who does it won't), and the world doesn't end (it won't), it will be interesting to hear Eliezer's excuses.

EY: Unknown, do you expect money to be worth anything to you in that situation? If so, I'll be happy to accept a $10 payment now in exchange for a $1000 inflation-adjusted payment in that scenario you describe.

I recently found within myself a tiny shred of anticipation-worry about actually surviving to pay off the bet. Suppose that the rampant superintelligence proceeds to take over its future light cone but, in the process of dissembling existing humans, stores their mind-state. Some billions of years later, the superintelligence runs across an alien civilization which succeeded on their version of the Friendly AI problem and is at least somewhat "friendly" in the ordinary sense, concerned about other sentient lives; and the superintelligence ransoms us to them in exchange for some amount of negentropy which outweighs our storage costs. The humans alive at the time are restored and live on, possibly having been rescued by the alien values of the Super Happy People or some such, but at least surviving.

In this event, who wins the bet?

Comment author: CannibalSmith 01 February 2010 09:53:51AM 5 points [-]

SIAI: Utopia or hundred times your money back!

Eliezer, would you accept a bet $100 against $10000?

Comment author: Eliezer_Yudkowsky 01 February 2010 10:35:48AM 4 points [-]

On the same problem? I might attach some extra terms and conditions this time around, like "offer void (stakes will be returned) if the AI has the power and desire to use us for paperclips but our lives are ransomed by some other entity with the power to offer the AI more paperclips than it could produce by consuming us", "offer void if the explanation of the Fermi Paradox is a previously existing superintelligence which shuts down any new superintelligences produced", and "offer void if the AI consumes our physical bodies but we continue via the sort of weird anthropic scenario introduced in The Finale of the Ultimate Meta Mega Crossover." With those provisos, my probability drops off the bottom of the chart. I'm still not sure about the bet, though, because I want to keep my total of outstanding bets to something I can honor if they all simultaneously go wrong (no matter how surprising that would be to me), and this would use up $10,000 of that, even if it's on a sure thing - I might be able to get a better price on some other sure thing.

Comment author: Unknowns 01 February 2010 12:06:19PM 0 points [-]

If we survive by an anthropic situation (it's hard to see how that could preserve several persons together, but just in case), then you win the bet, since that would more like a second world than a continuation of this one.

If the AI is shut down before it has had a chance to operate, the bet wouldn't have been settled yet, so you wouldn't have to pay anything.

Anyway, I'm still going to win.

Comment author: ciphergoth 01 February 2010 08:42:23AM 5 points [-]

You definitely win. If I say "you'll get killed doing that" and you are, I shan't expect to pay back my winnings when you're reanimated.

Comment author: pjeby 01 February 2010 08:17:35PM 3 points [-]

Perhaps you've already defined "superintelligent" as meaning "self-directed, motivated, and recursively self-improving" rather than merely "able to provide answers to general questions faster and better than human beings"... but if you haven't, it seems to me that the latter definition of "superintelligent" would have a much higher probability of you losing the bet. (For example, a Hansonian "em" running on faster hardware and perhaps a few software upgrades might fit the latter definition.)

Comment author: Unknowns 01 February 2010 08:38:21AM 0 points [-]

I think I would win the bet. It wouldn't exactly be "the end of the world", but just a very strange future of the world.

Comment author: Eliezer_Yudkowsky 01 February 2010 08:41:42AM 3 points [-]

Really? Huh. To me that seems both pretty world-endy and strongly against the spirit of what was implied by your original statement... would you predict this outcome? Is it something that your model allows to happen? I know it's not something I would feel compelled to make excuses for - more like "I TOLD YOU SO!"

What exactly do you think happens in the scenario described?

Comment author: Unknowns 01 February 2010 10:04:01AM 1 point [-]

Ok, if you're sufficiently worried about the possibility of that outcome, I'll be happy to grant it to your side of the bet... even though at the time, it seemed to me clear that your assertion that the world would end meant that we wouldn't continue as conscious beings.

I definitely wouldn't predict that outcome. I would be very surprised, since I think the world will continue in the usual way. But is it really that likely even on your model?