AI Risk and Opportunity: A Strategic Analysis

lukeprog

12 AI Risk and Opportunity: A Strategic Analysis

4th Mar 2012

3 min read

12

Suppose you buy the argument that humanity faces both the risk of AI-caused extinction and the opportunity to shape an AI-built utopia. What should we do about that? As Wei Dai asks, "In what direction should we nudge the future, to maximize the chances and impact of a positive intelligence explosion?"

This post serves as a table of contents and an introduction for an ongoing strategic analysis of AI risk and opportunity.

Contents:

Introduction (this post)
Humanity's Efforts So Far
A Timeline of Early Ideas and Arguments
Questions We Want Answered
Strategic Analysis Via Probability Tree
Intelligence Amplification and Friendly AI
...

Why discuss AI safety strategy?

The main reason to discuss AI safety strategy is, of course, to draw on a wide spectrum of human expertise and processing power to clarify our understanding of the factors at play and the expected value of particular interventions we could invest in: raising awareness of safety concerns, forming a Friendly AI team, differential technological development, investigating AGI confinement methods, and others.

Discussing AI safety strategy is also a challenging exercise in applied rationality. The relevant issues are complex and uncertain, but we need to take advantage of the fact that rationality is faster than science: we can't "try" a bunch of intelligence explosions and see which one works best. We'll have to predict in advance how the future will develop and what we can do about it.

Core readings

Before engaging with this series, I recommend you read at least the following articles:

Muehlhauser & Salamon, Intelligence Explosion: Evidence and Import (2013)
Yudkowsky, AI as a Positive and Negative Factor in Global Risk (2008)
Chalmers, The Singularity: A Philosophical Analysis (2010)

Example questions

Which strategic questions would we like to answer? Muehlhauser (2011) elaborates on the following questions:

What methods can we use to predict technological development?
Which kinds of differential technological development should we encourage, and how?
Which open problems are safe to discuss, and which are potentially dangerous?
What can we do to reduce the risk of an AI arms race?
What can we do to raise the "sanity waterline," and how much will this help?
What can we do to attract more funding, support, and research to x-risk reduction and to specific sub-problems of successful Singularity navigation?
Which interventions should we prioritize?
How should x-risk reducers and AI safety researchers interact with governments and corporations?
How can optimal philanthropists get the most x-risk reduction for their philanthropic buck?
How does AI risk compare to other existential risks?
Which problems do we need to solve, and which ones can we have an AI solve?
How can we develop microeconomic models of WBEs and self-improving systems?
How can we be sure a Friendly AI development team will be altruistic?

Salamon & Muehlhauser (2013) list several other questions gathered from the participants of a workshop following Singularity Summit 2011, including:

How hard is it to create Friendly AI?
What is the strength of feedback from neuroscience to AI rather than brain emulation?
Is there a safe way to do uploads, where they don't turn into neuromorphic AI?
How possible is it to do FAI research on a seastead?
How much must we spend on security when developing a Friendly AI team?
What's the best way to recruit talent toward working on AI risks?
How difficult is stabilizing the world so we can work on Friendly AI slowly?
How hard will a takeoff be?
What is the value of strategy vs. object-level progress toward a positive Singularity?
How feasible is Oracle AI?
Can we convert environmentalists into people concerned with existential risk?
Is there no such thing as bad publicity [for AI risk reduction] purposes?

These are the kinds of questions we will be tackling in this series of posts for Less Wrong Discussion, in order to improve our predictions about which direction we can nudge the future to maximize the chances of a positive intelligence explosion.

Personal Blog

12

New Comment

Rendering 0/163 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:45 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

12 AI Risk and Opportunity: A Strategic Analysis

by lukeprog

4th Mar 2012

3 min read

163

12

This post serves as a table of contents and an introduction for an ongoing strategic analysis of AI risk and opportunity.

Contents:

Introduction (this post)
Humanity's Efforts So Far
A Timeline of Early Ideas and Arguments
Questions We Want Answered
Strategic Analysis Via Probability Tree
Intelligence Amplification and Friendly AI
...

Why discuss AI safety strategy?

Core readings

Before engaging with this series, I recommend you read at least the following articles:

Muehlhauser & Salamon, Intelligence Explosion: Evidence and Import (2013)
Yudkowsky, AI as a Positive and Negative Factor in Global Risk (2008)
Chalmers, The Singularity: A Philosophical Analysis (2010)

Example questions

Which strategic questions would we like to answer? Muehlhauser (2011) elaborates on the following questions:

What methods can we use to predict technological development?
Which kinds of differential technological development should we encourage, and how?
Which open problems are safe to discuss, and which are potentially dangerous?
What can we do to reduce the risk of an AI arms race?
What can we do to raise the "sanity waterline," and how much will this help?
What can we do to attract more funding, support, and research to x-risk reduction and to specific sub-problems of successful Singularity navigation?
Which interventions should we prioritize?
How should x-risk reducers and AI safety researchers interact with governments and corporations?
How can optimal philanthropists get the most x-risk reduction for their philanthropic buck?
How does AI risk compare to other existential risks?
Which problems do we need to solve, and which ones can we have an AI solve?
How can we develop microeconomic models of WBEs and self-improving systems?
How can we be sure a Friendly AI development team will be altruistic?

Salamon & Muehlhauser (2013) list several other questions gathered from the participants of a workshop following Singularity Summit 2011, including:

How hard is it to create Friendly AI?
What is the strength of feedback from neuroscience to AI rather than brain emulation?
Is there a safe way to do uploads, where they don't turn into neuromorphic AI?
How possible is it to do FAI research on a seastead?
How much must we spend on security when developing a Friendly AI team?
What's the best way to recruit talent toward working on AI risks?
How difficult is stabilizing the world so we can work on Friendly AI slowly?
How hard will a takeoff be?
What is the value of strategy vs. object-level progress toward a positive Singularity?
How feasible is Oracle AI?
Can we convert environmentalists into people concerned with existential risk?
Is there no such thing as bad publicity [for AI risk reduction] purposes?

Personal Blog

12

Mentioned in

71Reply to Holden on The Singularity Institute

63How can I reduce existential risk from AI?

56AI Risk and Opportunity: Humanity's Efforts So Far

51Original Research on Less Wrong

482012 Winter Fundraiser for the Singularity Institute

Load More (5/11)

New Comment

Rendering 0/163 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:45 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

More from lukeprog

Curated and popular this week

163Comments

163

Comment Permalink

lukeprog14y30

What could an FAI project look like? Louie points out that it might look like Princeton's Institute for Advanced Study:

Created as a haven for thinking, the Institute [for Advanced Study] remains for many the Shangri-la of academe: a playground for the scholarly superstars who become the Institute's permanent faculty. These positions carry no teaching duties, few administrative responsibilities, and high salaries, and so represent a pinnacle of academic advancement. The expectation is that given this freedom, the professors at the Institute will think the big thoughts that can propel social and intellectual progress. Over the years the permanent faculty has included Nobel laureates as well as recipients of almost every other intellectual honor. Among the mathematicians, there have been several winners of the Fields Medal…

If the permanent faculty makes up the intellectual foundation of the Institute, the lifeblood is provided by the parade of international visitors who bring a continuous influx of new ideas. They may come for as little as an afternoon, or as long as a few years, in which case they take up temporary positions as Institute "members."

Idea #1: Write a good, very technical Open Problems in Friendly Artificial Intelligence, get a few of the best mathematicians/physicists who care about FAI accepted as visitors, have them talk to faculty and visitors at about the technical problems related to FAI.

Idea #2: Convince wealthy donors to endow a chair at the Institute for Advanced Study for somebody to do FAI research. (Princeton may not mind us sending another brilliant person and a bunch of money their way.)

Similar research institutes: PARC, Bell Labs, Perimeter Institute, maybe others?

Wei Dai14y50

How do you maintain secrecy in such a setting? Or is there a new line of thought that says secrecy isn't necessary for an FAI project?

13gwern14y

But did the IAS actually succeed? Off-hand, the only thing I can think of them for was hosting Einstein in his crankish years, Kurt Godel before he want crazy, and Von Neumann's work on a real computer (which they disliked and wanted to get rid of). Richard Hamming, who might know, said: [...] (My own thought is to wonder if this is kind of a regression to the mean, or perhaps regression due to aging.)

See in context