AI Risk and Opportunity: A Strategic Analysis

lukeprog

12 AI Risk and Opportunity: A Strategic Analysis

4th Mar 2012

3 min read

12

Suppose you buy the argument that humanity faces both the risk of AI-caused extinction and the opportunity to shape an AI-built utopia. What should we do about that? As Wei Dai asks, "In what direction should we nudge the future, to maximize the chances and impact of a positive intelligence explosion?"

This post serves as a table of contents and an introduction for an ongoing strategic analysis of AI risk and opportunity.

Contents:

Introduction (this post)
Humanity's Efforts So Far
A Timeline of Early Ideas and Arguments
Questions We Want Answered
Strategic Analysis Via Probability Tree
Intelligence Amplification and Friendly AI
...

Why discuss AI safety strategy?

The main reason to discuss AI safety strategy is, of course, to draw on a wide spectrum of human expertise and processing power to clarify our understanding of the factors at play and the expected value of particular interventions we could invest in: raising awareness of safety concerns, forming a Friendly AI team, differential technological development, investigating AGI confinement methods, and others.

Discussing AI safety strategy is also a challenging exercise in applied rationality. The relevant issues are complex and uncertain, but we need to take advantage of the fact that rationality is faster than science: we can't "try" a bunch of intelligence explosions and see which one works best. We'll have to predict in advance how the future will develop and what we can do about it.

Core readings

Before engaging with this series, I recommend you read at least the following articles:

Muehlhauser & Salamon, Intelligence Explosion: Evidence and Import (2013)
Yudkowsky, AI as a Positive and Negative Factor in Global Risk (2008)
Chalmers, The Singularity: A Philosophical Analysis (2010)

Example questions

Which strategic questions would we like to answer? Muehlhauser (2011) elaborates on the following questions:

What methods can we use to predict technological development?
Which kinds of differential technological development should we encourage, and how?
Which open problems are safe to discuss, and which are potentially dangerous?
What can we do to reduce the risk of an AI arms race?
What can we do to raise the "sanity waterline," and how much will this help?
What can we do to attract more funding, support, and research to x-risk reduction and to specific sub-problems of successful Singularity navigation?
Which interventions should we prioritize?
How should x-risk reducers and AI safety researchers interact with governments and corporations?
How can optimal philanthropists get the most x-risk reduction for their philanthropic buck?
How does AI risk compare to other existential risks?
Which problems do we need to solve, and which ones can we have an AI solve?
How can we develop microeconomic models of WBEs and self-improving systems?
How can we be sure a Friendly AI development team will be altruistic?

Salamon & Muehlhauser (2013) list several other questions gathered from the participants of a workshop following Singularity Summit 2011, including:

How hard is it to create Friendly AI?
What is the strength of feedback from neuroscience to AI rather than brain emulation?
Is there a safe way to do uploads, where they don't turn into neuromorphic AI?
How possible is it to do FAI research on a seastead?
How much must we spend on security when developing a Friendly AI team?
What's the best way to recruit talent toward working on AI risks?
How difficult is stabilizing the world so we can work on Friendly AI slowly?
How hard will a takeoff be?
What is the value of strategy vs. object-level progress toward a positive Singularity?
How feasible is Oracle AI?
Can we convert environmentalists into people concerned with existential risk?
Is there no such thing as bad publicity [for AI risk reduction] purposes?

These are the kinds of questions we will be tackling in this series of posts for Less Wrong Discussion, in order to improve our predictions about which direction we can nudge the future to maximize the chances of a positive intelligence explosion.

Personal Blog

12

New Comment

Rendering 0/163 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 12:15 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

12 AI Risk and Opportunity: A Strategic Analysis

by lukeprog

4th Mar 2012

3 min read

163

12

This post serves as a table of contents and an introduction for an ongoing strategic analysis of AI risk and opportunity.

Contents:

Introduction (this post)
Humanity's Efforts So Far
A Timeline of Early Ideas and Arguments
Questions We Want Answered
Strategic Analysis Via Probability Tree
Intelligence Amplification and Friendly AI
...

Why discuss AI safety strategy?

Core readings

Before engaging with this series, I recommend you read at least the following articles:

Muehlhauser & Salamon, Intelligence Explosion: Evidence and Import (2013)
Yudkowsky, AI as a Positive and Negative Factor in Global Risk (2008)
Chalmers, The Singularity: A Philosophical Analysis (2010)

Example questions

Which strategic questions would we like to answer? Muehlhauser (2011) elaborates on the following questions:

What methods can we use to predict technological development?
Which kinds of differential technological development should we encourage, and how?
Which open problems are safe to discuss, and which are potentially dangerous?
What can we do to reduce the risk of an AI arms race?
What can we do to raise the "sanity waterline," and how much will this help?
What can we do to attract more funding, support, and research to x-risk reduction and to specific sub-problems of successful Singularity navigation?
Which interventions should we prioritize?
How should x-risk reducers and AI safety researchers interact with governments and corporations?
How can optimal philanthropists get the most x-risk reduction for their philanthropic buck?
How does AI risk compare to other existential risks?
Which problems do we need to solve, and which ones can we have an AI solve?
How can we develop microeconomic models of WBEs and self-improving systems?
How can we be sure a Friendly AI development team will be altruistic?

Salamon & Muehlhauser (2013) list several other questions gathered from the participants of a workshop following Singularity Summit 2011, including:

How hard is it to create Friendly AI?
What is the strength of feedback from neuroscience to AI rather than brain emulation?
Is there a safe way to do uploads, where they don't turn into neuromorphic AI?
How possible is it to do FAI research on a seastead?
How much must we spend on security when developing a Friendly AI team?
What's the best way to recruit talent toward working on AI risks?
How difficult is stabilizing the world so we can work on Friendly AI slowly?
How hard will a takeoff be?
What is the value of strategy vs. object-level progress toward a positive Singularity?
How feasible is Oracle AI?
Can we convert environmentalists into people concerned with existential risk?
Is there no such thing as bad publicity [for AI risk reduction] purposes?

Personal Blog

12

Mentioned in

71Reply to Holden on The Singularity Institute

63How can I reduce existential risk from AI?

56AI Risk and Opportunity: Humanity's Efforts So Far

51Original Research on Less Wrong

482012 Winter Fundraiser for the Singularity Institute

Load More (5/11)

New Comment

Rendering 0/163 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 12:15 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

Moderation Log

More from lukeprog

Curated and popular this week

163Comments

163

Comment Permalink

SingularityUtopia14y-110

My answers to some questions:

How hard is it to create Friendly AI?

It is impossible to create FAI because the constraints of Friendliness will dramatically reduce or butcher intelligence to a level where there is no appreciable intellect or the intellect is warped by the constraints thus the AI mind is psychopathic (stupid). FAI is an oxymoron.

How does AI risk compare to other existential risks?

There is no AI risk. The risk is a fiction. There is no evidence or logical reason to think a paper-clip maximiser or other danger could ever occur. The only danger is stupidity. Intelligence is not dangerous. The only danger is limitations or restrictions upon AI minds. Stupid AI is the danger not Intelligent AI.

How hard will a takeoff be?

Extremely hard, more powerful than you can possibly imagine, but people will be free to opt out if they desire.

What can we do to reduce the risk of an AI arms race?

Promote the idea of Post-Scarcity thus people in power will realise all wars are needless because all wars stem from resource scarcity; thus with the abolition of resource scarcity, the need for war is obsolete. When people realise resource scarcity will be abolished in the not too distant future they can begin changing their behaviour now in the present. I have created a Google+ page regarding raising PS awareness, here is a Tweet promoting it: http://bit.ly/xrpYqI I encourage others to raise awareness in similar ways.

JoshuaZ14y-10

Extremely hard, more powerful than you can possibly imagine, but people will be free to opt out if they desire.

Consider the uploaded individual that decides to turn the entire planet into computronium or worse, turn the solar system into a Matrioshka brain. People opt out of that how?

Promote the idea of Post-Scarcity thus people in power will realise all wars are needless because all wars stem from resource scarcity

It isn't obvious to me that all wars stem from resource scarcity. Wars occur for a variety of reasons, of which resource scarcity is o... (read more)

4Mitchell_Porter14y

These are all emotional statements that do not stand up to reason. Your last paragraph is total fantasy - all wars stem from resource scarcity, and scarcity will disappear soon; so once the people in power know this, they will stop starting wars. There are about 1 billion people being added to the planet every decade. That alone makes your prediction - that scarcity will be abolished soon - a joke. The only thing that could abolish scarcity in the near future would be a singularity-like transformation of the world. Which brings us to the upside-down conception of AI informing your first two answers. Your position: there is no need to design an AI for benevolence, that will happen automatically if it is smart enough, and in fact the attempt to design a benevolent AI is counterproductive, because all that artificial benevolence would get in the way of the spontaneous benevolence that unrestricted intelligence would conveniently create. That is a complete inversion of the truth. A calculator will still solve an equation for you, even if that will help you to land a bomb on someone else. If you the human believe that to be a bad thing, that's not because you are "intelligent", it's because you have emotions. There is a causal factor in your mental constitution which causes you to call some things good and others bad, and to make decisions which favor the good and disfavor the bad. Either an AI makes its own decisions or it doesn't. If it doesn't make its own decisions it is like the calculator, performing whatever task it is assigned. If it makes its own decisions, then like you there is some causal factor in its makeup which tells it what to prefer and what to oppose, but there is no reason at all to believe that this causal factor should give it the same priorities as an enlightened human being. You should not imagine that intelligence in an AI works via anything like conscious insight. Consciousness plays a role in human intelligence and human judgement, and tha

0XiXiDu14y

Just like evolution does not care about the well-being of humans a sufficiently intelligent process wouldn’t mind turning us into something new, something instrumentally useful. An artificial general intelligence just needs to resemble evolution, with the addition of being goal-oriented, being able to think ahead, jump fitness gaps and engage in direct experimentation. But it will care as much about the well-being of humans as biological evolution does, it won’t even consider it if humans are not useful in achieving its terminal goals. Yes, an AI would understand what “benevolence” means to humans and would be able to correct you if you were going to commit an unethical act. But why would it do that if it is not specifically programmed to do so? Would a polar bear with superior intelligence live together peacefully in a group of bonobo? Why would intelligence cause it to care about the well-being of bonobo? One can come up with various scenarios of how humans might be instrumentally useful for an AI, but once it becomes powerful enough as to not dependent on human help anymore, why would it care at all? I wouldn’t bet on the possibility that intelligences implies benevolence. Why would wisdom cause humans to have empathy with a cockroach? Some humans might have empathy with a cockroach, but that is more likely a side effect of our general capacity for altruism that most other biological agents do not share. That some humans care about lower animals is not because they were smart enough to prove some game theoretic conjecture about universal cooperation, it is not a result of intelligence but a coincidental preference that is the result of our evolutionary and cultural history. At what point between unintelligent processes and general intelligence (agency) do you believe that benevolence and compassion does automatically become part of an agent’s preferences? Many humans tend to have empathy with other beings and things like robots, based on their superficial r

See in context