Will the world's elites navigate the creation of AI just fine?

lukeprog

One open question in AI risk strategy is: Can we trust the world's elite decision-makers (hereafter "elites") to navigate the creation of human-level AI (and beyond) just fine, without the kinds of special efforts that e.g. Bostrom and Yudkowsky think are needed?

Some reasons for concern include:

Otherwise smart people say unreasonable things about AI safety.
Many people who believed AI was around the corner didn't take safety very seriously.
Elites have failed to navigate many important issues wisely (2008 financial crisis, climate change, Iraq War, etc.), for a variety of reasons.
AI may arrive rather suddenly, leaving little time for preparation.

But if you were trying to argue for hope, you might argue along these lines (presented for the sake of argument; I don't actually endorse this argument):

If AI is preceded by visible signals, elites are likely to take safety measures. Effective measures were taken to address asteroid risk. Large resources are devoted to mitigating climate change risks. Personal and tribal selfishness align with AI risk-reduction in a way they may not align on climate change. Availability of information is increasing over time.
AI is likely to be preceded by visible signals. Conceptual insights often take years of incremental tweaking. In vision, speech, games, compression, robotics, and other fields, performance curves are mostly smooth. "Human-level performance at X" benchmarks influence perceptions and should be more exhaustive and come more rapidly as AI approaches. Recursive self-improvement capabilities could be charted, and are likely to be AI-complete. If AI succeeds, it will likely succeed for reasons comprehensible by the AI researchers of the time.
Therefore, safety measures will likely be taken.
If safety measures are taken, then elites will navigate the creation of AI just fine. Corporate and government leaders can use simple heuristics (e.g. Nobel prizes) to access the upper end of expert opinion. AI designs with easily tailored tendency to act may be the easiest to build. The use of early AIs to solve AI safety problems creates an attractor for "safe, powerful AI." Arms races not insurmountable.

The basic structure of this 'argument for hope' is due to Carl Shulman, though he doesn't necessarily endorse the details. (Also, it's just a rough argument, and as stated is not deductively valid.)

Personally, I am not very comforted by this argument because:

Elites often fail to take effective action despite plenty of warning.
I think there's a >10% chance AI will not be preceded by visible signals.
I think the elites' safety measures will likely be insufficient.

Obviously, there's a lot more for me to spell out here, and some of it may be unclear. The reason I'm posting these thoughts in such a rough state is so that MIRI can get some help on our research into this question.

In particular, I'd like to know:

Which historical events are analogous to AI risk in some important ways? Possibilities include: nuclear weapons, climate change, recombinant DNA, nanotechnology, chloroflourocarbons, asteroids, cyberterrorism, Spanish flu, the 2008 financial crisis, and large wars.
What are some good resources (e.g. books) for investigating the relevance of these analogies to AI risk (for the purposes of illuminating elites' likely response to AI risk)?
What are some good studies on elites' decision-making abilities in general?
Has the increasing availability of information in the past century noticeably improved elite decision-making?

Some reasons for concern include:

Otherwise smart people say unreasonable things about AI safety.
Many people who believed AI was around the corner didn't take safety very seriously.
Elites have failed to navigate many important issues wisely (2008 financial crisis, climate change, Iraq War, etc.), for a variety of reasons.
AI may arrive rather suddenly, leaving little time for preparation.

But if you were trying to argue for hope, you might argue along these lines (presented for the sake of argument; I don't actually endorse this argument):

If AI is preceded by visible signals, elites are likely to take safety measures. Effective measures were taken to address asteroid risk. Large resources are devoted to mitigating climate change risks. Personal and tribal selfishness align with AI risk-reduction in a way they may not align on climate change. Availability of information is increasing over time.
AI is likely to be preceded by visible signals. Conceptual insights often take years of incremental tweaking. In vision, speech, games, compression, robotics, and other fields, performance curves are mostly smooth. "Human-level performance at X" benchmarks influence perceptions and should be more exhaustive and come more rapidly as AI approaches. Recursive self-improvement capabilities could be charted, and are likely to be AI-complete. If AI succeeds, it will likely succeed for reasons comprehensible by the AI researchers of the time.
Therefore, safety measures will likely be taken.
If safety measures are taken, then elites will navigate the creation of AI just fine. Corporate and government leaders can use simple heuristics (e.g. Nobel prizes) to access the upper end of expert opinion. AI designs with easily tailored tendency to act may be the easiest to build. The use of early AIs to solve AI safety problems creates an attractor for "safe, powerful AI." Arms races not insurmountable.

The basic structure of this 'argument for hope' is due to Carl Shulman, though he doesn't necessarily endorse the details. (Also, it's just a rough argument, and as stated is not deductively valid.)

Personally, I am not very comforted by this argument because:

Elites often fail to take effective action despite plenty of warning.
I think there's a >10% chance AI will not be preceded by visible signals.
I think the elites' safety measures will likely be insufficient.

In particular, I'd like to know:

Which historical events are analogous to AI risk in some important ways? Possibilities include: nuclear weapons, climate change, recombinant DNA, nanotechnology, chloroflourocarbons, asteroids, cyberterrorism, Spanish flu, the 2008 financial crisis, and large wars.
What are some good resources (e.g. books) for investigating the relevance of these analogies to AI risk (for the purposes of illuminating elites' likely response to AI risk)?
What are some good studies on elites' decision-making abilities in general?
Has the increasing availability of information in the past century noticeably improved elite decision-making?

More (#3) from Chaos:

Hubbard began using a computer to do what the orthodox techniques had not done. The computer would prove nothing. But at least it might unveil the truth so that a mathematician could know what it was he should try to prove. So Hubbard began to experiment. He treated Newton’s method not as a way of solving problems but as a problem in itself. Hubbard considered the simplest example of a degree-three polynomial, the equation x3– 1 =0. That is, find the cube root of 1. In real numbers, of course, there is just the trivial solution: 1. But the polynomial also has two complex solutions: –½ + i√3/2, and –½ – i√3/2. Plotted in the complex plane, these three roots mark an equilateral triangle, with one point at three o’clock, one at seven o’clock, and one at eleven o’clock. Given any complex number as a starting point, the question was to see which of the three solutions Newton’s method would lead to. It was as if Newton’s method were a dynamical system and the three solutions were three attractors. Or it was as if the complex plane were a smooth surface sloping down toward three deep valleys. A marble starting from anywhere on the plane should roll into one of the valleys—but which?

Hubbard set about sampling the infinitude of points that make up the plane. He had his computer sweep from point to point, calculating the flow of Newton’s method for each one, and color-coding the results. Starting points that led to one solution were all colored blue. Points that led to the second solution were red, and points that led to the third were green. In the crudest approximation, he found, the dynamics of Newton’s method did indeed divide the plane into three pie wedges. Generally the points near a particular solution led quickly into that solution. But systematic computer exploration showed complicated underlying organization that could never have been seen by earlier mathematicians, able only to calculate a point here and a point there. While some starting guesses converged quickly to a root, others bounced around seemingly at random before finally converging to a solution. Sometimes it seemed that a point could fall into a cycle that would repeat itself forever—a periodic cycle—without ever reaching one of the three solutions.

As Hubbard pushed his computer to explore the space in finer and finer detail, he and his students were bewildered by the picture that began to emerge. Instead of a neat ridge between the blue and red valleys, for example, he saw blotches of green, strung together like jewels. It was as if a marble, caught between the conflicting tugs of two nearby valleys, would end up in the third and most distant valley instead. A boundary between two colors never quite forms. On even closer inspection, the line between a green blotch and the blue valley proved to have patches of red. And so on—the boundary finally revealed to Hubbard a peculiar property that would seem bewildering even to someone familiar with Mandelbrot’s monstrous fractals: no point serves as a boundary between just two colors. Wherever two colors try to come together, the third always inserts itself, with a series of new, self-similar intrusions. Impossibly, every boundary point borders a region of each of the three colors.

And:

For... Peitgen the study of complexity provided a chance to create new traditions in science instead of just solving problems. “In a brand new area like this one, you can start thinking today and if you are a good scientist you might be able to come up with interesting solutions in a few days or a week or a month,” Peitgen said. The subject is unstructured.

“In a structured subject, it is known what is known, what is unknown, what people have already tried and doesn’t lead anywhere. There you have to work on a problem which is known to be a problem, otherwise you get lost. But a problem which is known to be a problem must be hard, otherwise it would already have been solved.”

Peitgen shared little of the mathematicians’ unease with the use of computers to conduct experiments. Granted, every result must eventually be made rigorous by the standard methods of proof, or it would not be mathematics. To see an image on a graphics screen does not guarantee its existence in the language of theorem and proof. But the very availability of that image was enough to change the evolution of mathematics. Computer exploration was giving mathematicians the freedom to take a more natural path, Peitgen believed. Temporarily, for the moment, a mathematician could suspend the requirement of rigorous proof. He could go wherever experiments might lead him, just as a physicist could. The numerical power of computation and the visual cues to intuition would suggest promising avenues and spare the mathematician blind alleys. Then, new paths having been found and new objects isolated, a mathematician could return to standard proofs. “Rigor is the strength of mathematics,” Peitgen said. “That we can continue a line of thought which is absolutely guaranteed — mathematicians never want to give that up. But you can look at situations that can be understood partially now and with rigor perhaps in future generations. Rigor, yes, but not to the extent that I drop something just because I can’t do it now.”

36

Will the world's elites navigate the creation of AI just fine?

36

36

36

Will the world's elites navigate the creation of AI just fine?

36

36