RSI capabilities could be charted, and are likely to be AI-complete.
What does RSI stand for?
Lately I've been listening to audiobooks (at 2x speed) in my down time, especially ones that seem likely to have passages relevant to the question of how well policy-makers will deal with AGI, basically continuing this project but only doing the "collection" stage, not the "analysis" stage.
I'll post quotes from the audiobooks I listen to as replies to this comment.
More (#3) from Better Angels of Our Nature:
...let’s have a look at political discourse, which most people believe has been getting dumb and dumber. There’s no such thing as the IQ of a speech, but Tetlock and other political psychologists have identified a variable called integrative complexity that captures a sense of intellectual balance, nuance, and sophistication. A passage that is low in integrative complexity stakes out an opinion and relentlessly hammers it home, without nuance or qualification. Its minimal complexity can be quantified by counting words like absolutely, always, certainly, definitively, entirely, forever, indisputable, irrefutable, undoubtedly, and unquestionably. A passage gets credit for some degree of integrative complexity if it shows a touch of subtlety with words like usually, almost, but, however, and maybe. It is rated higher if it acknowledges two points of view, higher still if it discusses connections, tradeoffs, or compromises between them, and highest of all if it explains these relationships by reference to a higher principle or system. The integrative complexity of a passage is not the same as the intelligence of the person who wrote it, but the
Okay. In this comment I'll keep an updated list of audiobooks I've heard since Sept. 2013, for those who are interested. All audiobooks are available via iTunes/Audible unless otherwise noted.
Outstanding:
Worthwhile if you care about the subject matter:
A process for turning ebooks into audiobooks for personal use, at least on Mac:
Personal and tribal selfishness align with AI risk-reduction in a way they may not align on climate change.
This seems obviously false. Local expenditures - of money, pride, possibility of not being the first to publish, etc. - are still local, global penalties are still global. Incentives are misaligned in exactly the same way as for climate change.
RSI capabilities could be charted, and are likely to be AI-complete.
This is to be taken as an arguendo, not as the author's opinion, right? See IEM on the minimal conditions for takeoff. Albeit if &q...
(I don't have answers to your specific questions, but here are some thoughts about the general problem.)
I agree with most of you said. I also assign significant probability mass to most parts of the argument for hope (but haven't thought about this enough to put numbers on this), though I too am not comforted on these parts because I also assign non-small chance to them going wrong. E.g., I have hope for "if AI is visible [and, I add, AI risk is understood] then authorities/elites will be taking safety measures".
That said, there are some steps in...
I personally am optimistic about the world's elites navigating AI risk as well as possible subject to inherent human limitations that I would expect everybody to have, and the inherent risk. Some points:
I've been surprised by people's ability to avert bad outcomes. Only two nuclear weapons have been used since nuclear weapons were developed, despite the fact that there are 10,000+ nuclear weapons around the world. Political leaders are assassinated very infrequently relative to how often one might expect a priori.
AI risk is a Global Catastrophic Risk i
The argument from hope or towards hope or anything but despair and grit is misplaced when dealing with risks of this magnitude.
Don't trust God (or semi-competent world leaders) to make everything magically turn out all right. The temptation to do so is either a rationalization of wanting to do nothing, or based on a profoundly miscalibrated optimism for how the world works.
/doom
I think there's a >15% chance AI will not be preceded by visible signals.
Aren't we seeing "visible signals" already? Machines are better than humans at lots of intelligence-related tasks today.
Which historical events are analogous to AI risk in some important ways? Possibilities include: nuclear weapons, climate change, recombinant DNA, nanotechnology, chloroflourocarbons, asteroids, cyberterrorism, Spanish flu, the 2008 financial crisis, and large wars.
Cryptography and cryptanalysis are obvious precursors of supposedly-dangerous tech within IT.
Looking at their story, we can plausibly expect governments to attempt to delay the development of "weaponizable" technology by others.
These days, cryptography facilitates international trade. It seems like a mostly-positive force overall.
One question is whether AI is like CFCs, or like CO2, or like hacking.
With CFCs, the solution was simple: ban CFCs. The cost was relatively low, and the benefit relatively high.
With CO2, the solution is equally simple: cap and trade. It's just not politically palatable, because the problem is slower-moving, and the cost would be much, much greater (perhaps great enough to really mess up the world economy). So, we're left with the second-best solution: do nothing. People will die, but the economy will keep growing, which might balance that out, because ...
Here are my reasons for pessimism:
There are likely to be effective methods of controlling AIs that are of subhuman or even roughly human-level intelligence which do not scale up to superhuman intelligence. These include for example reinforcement by reward/punishment, mutually beneficial trading, legal institutions. Controlling superhuman intelligence will likely require qualitatively different methods, such as having the superintelligence share our values. Unfortunately the existence of effective but unscalable methods of AI control will probably lull el
Congress' non-responsiveness to risks to critical infrastructure from geomagnetic storms, despite scientific consensus on the issue, is also worrying.
Even if one organization navigates the creation of friendly AI successfully, won't we still have to worry about preventing anyone from ever creating an unsafe AI?
Unlike nuclear weapons, a single AI might have world ending consequences, and an AI requires no special resources. Theoretically a seed AI could be uploaded to Pirate Bay, from where anyone could download and compile it.
The use of early AIs to solve AI safety problems creates an attractor for "safe, powerful AI."
What kind of "AI safety problems" are we talking about here? If they are like the "FAI Open Problems" that Eliezer has been posting, they would require philosophers of the highest (perhaps even super-human) caliber to solve. How could "early AIs" be of much help?
If "AI safety problems" here do not refer to FAI problems, then how do those problems get solved, according to this argument?
@Lukeprog, can you
(1) update us on your working answers the posed questions in brief? (2) your current confidence (and if you would like to, by proxy, MIRI's as an organisation's confidence in each of the 3:
Elites often fail to take effective action despite plenty of warning.
I think there's a >10% chance AI will not be preceded by visible signals.
I think the elites' safety measures will likely be insufficient.
Thank you for your diligence.
There's another reason for hope in this above global warming: The idea of a dangerous AI is already common in the public eye as "things we need to be careful about." A big problem the global warming movement had, and is still having, is convincing the public that it's a threat in the first place.
Who do you mean by "elites". Keep in mind that major disruptive technical progress of the type likely to precede the creation of a full AGI tends to cause the type of social change that shakes up the social hierarchy.
Combining the beginning and the end of your questions reveals an answer.
Can we trust the world's elite decision-makers (hereafter "elites") to navigate the creation of [nuclear weapons, climate change, recombinant DNA, nanotechnology, chloroflourocarbons, asteroids, cyberterrorism, Spanish flu, the 2008 financial crisis, and large wars] just fine?
Answer how just fine any of these are any you have analogous answers.
You might also clarify whether you are interested in what is just fine for everyone, or just fine for the elites, or just fine for the AI in question. The answer will change accordingly.
More (#3) from Chaos:
Hubbard began using a computer to do what the orthodox techniques had not done. The computer would prove nothing. But at least it might unveil the truth so that a mathematician could know what it was he should try to prove. So Hubbard began to experiment. He treated Newton’s method not as a way of solving problems but as a problem in itself. Hubbard considered the simplest example of a degree-three polynomial, the equation x3– 1 =0. That is, find the cube root of 1. In real numbers, of course, there is just the trivial solution: 1. But the polynomial also has two complex solutions: –½ + i√3/2, and –½ – i√3/2. Plotted in the complex plane, these three roots mark an equilateral triangle, with one point at three o’clock, one at seven o’clock, and one at eleven o’clock. Given any complex number as a starting point, the question was to see which of the three solutions Newton’s method would lead to. It was as if Newton’s method were a dynamical system and the three solutions were three attractors. Or it was as if the complex plane were a smooth surface sloping down toward three deep valleys. A marble starting from anywhere on the plane should roll into one of the valleys—but which?
Hubbard set about sampling the infinitude of points that make up the plane. He had his computer sweep from point to point, calculating the flow of Newton’s method for each one, and color-coding the results. Starting points that led to one solution were all colored blue. Points that led to the second solution were red, and points that led to the third were green. In the crudest approximation, he found, the dynamics of Newton’s method did indeed divide the plane into three pie wedges. Generally the points near a particular solution led quickly into that solution. But systematic computer exploration showed complicated underlying organization that could never have been seen by earlier mathematicians, able only to calculate a point here and a point there. While some starting guesses converged quickly to a root, others bounced around seemingly at random before finally converging to a solution. Sometimes it seemed that a point could fall into a cycle that would repeat itself forever—a periodic cycle—without ever reaching one of the three solutions.
As Hubbard pushed his computer to explore the space in finer and finer detail, he and his students were bewildered by the picture that began to emerge. Instead of a neat ridge between the blue and red valleys, for example, he saw blotches of green, strung together like jewels. It was as if a marble, caught between the conflicting tugs of two nearby valleys, would end up in the third and most distant valley instead. A boundary between two colors never quite forms. On even closer inspection, the line between a green blotch and the blue valley proved to have patches of red. And so on—the boundary finally revealed to Hubbard a peculiar property that would seem bewildering even to someone familiar with Mandelbrot’s monstrous fractals: no point serves as a boundary between just two colors. Wherever two colors try to come together, the third always inserts itself, with a series of new, self-similar intrusions. Impossibly, every boundary point borders a region of each of the three colors.
And:
For... Peitgen the study of complexity provided a chance to create new traditions in science instead of just solving problems. “In a brand new area like this one, you can start thinking today and if you are a good scientist you might be able to come up with interesting solutions in a few days or a week or a month,” Peitgen said. The subject is unstructured.
“In a structured subject, it is known what is known, what is unknown, what people have already tried and doesn’t lead anywhere. There you have to work on a problem which is known to be a problem, otherwise you get lost. But a problem which is known to be a problem must be hard, otherwise it would already have been solved.”
Peitgen shared little of the mathematicians’ unease with the use of computers to conduct experiments. Granted, every result must eventually be made rigorous by the standard methods of proof, or it would not be mathematics. To see an image on a graphics screen does not guarantee its existence in the language of theorem and proof. But the very availability of that image was enough to change the evolution of mathematics. Computer exploration was giving mathematicians the freedom to take a more natural path, Peitgen believed. Temporarily, for the moment, a mathematician could suspend the requirement of rigorous proof. He could go wherever experiments might lead him, just as a physicist could. The numerical power of computation and the visual cues to intuition would suggest promising avenues and spare the mathematician blind alleys. Then, new paths having been found and new objects isolated, a mathematician could return to standard proofs. “Rigor is the strength of mathematics,” Peitgen said. “That we can continue a line of thought which is absolutely guaranteed — mathematicians never want to give that up. But you can look at situations that can be understood partially now and with rigor perhaps in future generations. Rigor, yes, but not to the extent that I drop something just because I can’t do it now.”
One open question in AI risk strategy is: Can we trust the world's elite decision-makers (hereafter "elites") to navigate the creation of human-level AI (and beyond) just fine, without the kinds of special efforts that e.g. Bostrom and Yudkowsky think are needed?
Some reasons for concern include:
But if you were trying to argue for hope, you might argue along these lines (presented for the sake of argument; I don't actually endorse this argument):
The basic structure of this 'argument for hope' is due to Carl Shulman, though he doesn't necessarily endorse the details. (Also, it's just a rough argument, and as stated is not deductively valid.)
Personally, I am not very comforted by this argument because:
Obviously, there's a lot more for me to spell out here, and some of it may be unclear. The reason I'm posting these thoughts in such a rough state is so that MIRI can get some help on our research into this question.
In particular, I'd like to know: