<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
<channel>
<title>
Articles Tagged ‘siai’ - Less Wrong
</title> <link>http://lesswrong.com/</link>
<description></description>
<item>
<title>SI's Summer 2012 Matching Drive Ends July 31st</title>
<link>http://lesswrong.com/lw/dp8/sis_summer_2012_matching_drive_ends_july_31st/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/dp8/sis_summer_2012_matching_drive_ends_july_31st/</guid>
<pubDate>Fri, 20 Jul 2012 15:48:12 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
13 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/dp8/sis_summer_2012_matching_drive_ends_july_31st/#comments"&gt;17 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;The Singularity Institute's &lt;a href=&quot;http://singularity.org/blog/2012/07/03/summer-challenge/&quot;&gt;summer 2012 matching drive&lt;/a&gt; ends on July 31st! Donate by the end of the month to have your gift matched, dollar for dollar.&lt;/p&gt;
&lt;p&gt;As of this posting, SI has raised $70,000 of the $150,000 goal.&lt;/p&gt;
&lt;p&gt;The announcement &lt;a href=&quot;http://singularity.org/blog/2012/07/03/summer-challenge/&quot;&gt;says&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;Since we published our &lt;a href=&quot;/files/strategicplan20112.pdf&quot;&gt;strategic plan&lt;/a&gt; in August 2011, we have &lt;a href=&quot;/r/discussion/lw/dm9/revisiting_sis_2011_strategic_plan_how_are_we/#summary&quot;&gt;achieved most of the near-term goals outlined therein&lt;/a&gt;... &lt;ul&gt;
&lt;/ul&gt;
In the coming year, the &lt;strong&gt;Singularity Institute plans to do the following&lt;/strong&gt;: &lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Hold our annual &lt;a href=&quot;http://singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;&lt;/strong&gt;, this year in San Francisco!&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Spin off the &lt;a href=&quot;http://www.appliedrationality.org/&quot;&gt;Center for Applied Rationality&lt;/a&gt;&lt;/strong&gt; as a separate organization focused on rationality training, so that the Singularity Institute can be focused more exclusively on Singularity research and outreach.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Publish additional &lt;a href=&quot;/research/&quot;&gt;research&lt;/a&gt;&lt;/strong&gt; on AI risk and Friendly AI.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Eliezer will write an &quot;Open Problems in Friendly AI&quot; sequence&lt;/strong&gt; for &lt;em&gt;Less Wrong&lt;/em&gt;. &lt;small&gt;(For news on his rationality books, see &lt;a href=&quot;/lw/d06/intellectual_insularity_and_productivity/6swt&quot;&gt;here&lt;/a&gt;.)&lt;/small&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Finish &lt;em&gt;&lt;a href=&quot;http://facingthesingularity.com/&quot;&gt;Facing the Singularity&lt;/a&gt;&lt;/em&gt;&lt;/strong&gt; and publish ebook versions of &lt;em&gt;Facing the Singularity&lt;/em&gt; and &lt;em&gt;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/Sequences&quot;&gt;The Sequences, 2006-2009&lt;/a&gt;&lt;/em&gt;.&lt;/li&gt;
&lt;li&gt;And much more! For details on what we might do with additional funding, see &lt;a href=&quot;/lw/cs6/how_to_purchase_ai_risk_reduction/&quot;&gt;How to Purchase AI Risk Reduction&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;
If you're planning to earmark your donation to CFAR (Center for Applied Rationality), here's a preview of &lt;strong&gt;what CFAR plans to do in the next year&lt;/strong&gt;: &lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Develop additional lessons&lt;/strong&gt; teaching the most important and useful parts of rationality. CFAR has already developed and tested &lt;em&gt;over 18 hours of lessons&lt;/em&gt; so far, including classes on how to evaluate evidence using Bayesianism, how to make more accurate predictions, how to be more efficient using economics, how to use thought experiments to better understand your own motivations, and much more.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Run immersive rationality retreats&lt;/strong&gt; to teach from our curriculum and to connect aspiring rationalists with each other. CFAR ran pilot retreats in May and June. Participants in the May retreat called it &amp;#x201C;transformative&amp;#x201D; and &amp;#x201C;astonishing,&amp;#x201D; and the average response on the survey question, &amp;#x201C;Are you glad you came? (1-10)&amp;#x201D; was a 9.4. (We don't have the June data yet, but people were similarly enthusiastic about that one.)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Run SPARC, a camp on the advanced math of rationality&lt;/strong&gt; for mathematically gifted high school students. CFAR has a stellar first-year class for SPARC 2012; most students admitted to the program placed in the top 50 on the USA Math Olympiad (or performed equivalently in a similar contest).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Collect longitudinal data on the effects of rationality training&lt;/strong&gt;, to improve our curriculum and to generate promising hypotheses to test and publish, in collaboration with other researchers. CFAR has already launched a one-year randomized controlled study tracking reasoning ability and various metrics of life success, using participants in our June minicamp and a control group.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Develop apps and games about rationality&lt;/strong&gt;, with the dual goals of (a) helping aspiring rationalists practice essential skills, and (b) making rationality fun and intriguing to a much wider audience. CFAR has two apps in beta testing: one training players to update their own beliefs the right amount after hearing other people&amp;#x2019;s beliefs, and another training players to calibrate their level of confidence in their own beliefs. CFAR is working with a developer on several more games training people to avoid cognitive biases.&lt;/li&gt;
&lt;li&gt;And more!&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/blockquote&gt;
&lt;p&gt;In &lt;a href=&quot;/lw/dm9/revisiting_sis_2011_strategic_plan_how_are_we/&quot;&gt;another post&lt;/a&gt;, I compared the goals in our &lt;a href=&quot;http://singularity.org/files/strategicplan20112.pdf&quot;&gt;August 2011 strategic plan&lt;/a&gt; to our current situation, summarizing:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;That's it for the main list! Now let's check in on what we said &lt;strong&gt;our top priorities for 2011-2012&lt;/strong&gt;&amp;#xA0;were:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;em&gt;Public-facing research on creating a positive singularity&lt;/em&gt;. Check.&amp;#xA0;&lt;a href=&quot;/lw/axr/three_new_papers_on_ai_risk/627o&quot;&gt;SI has more peer-reviewed publications in 2012 than in all past years combined&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Outreach / education / fundraising&lt;/em&gt;. Check. Especially, through &lt;a href=&quot;http://appliedrationality.org/&quot;&gt;CFAR&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Improved organizational effectiveness&lt;/em&gt;. Check. &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6jzn&quot;&gt;Lots of good progress&lt;/a&gt; on this.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Singularity Summit&lt;/em&gt;. &lt;a href=&quot;http://singularitysummit.com/&quot;&gt;Check&lt;/a&gt;.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;In summary, I think SI is a bit behind where I hoped we'd be by now, though this is largely because we've poured so much into launching &lt;a href=&quot;http://appliedrationality.org/&quot;&gt;CFAR&lt;/a&gt;, and as a result, CFAR has turned out to be significantly more cool at launch than I had anticipated.&lt;/p&gt;
&lt;p&gt;Fundraising has been a challenge. One donor failed to actually give their $46,000 pledge despite repeated reminders and requests, and our support base is (understandably) anxious to see a shift from movement-building work to FAI research, a shift I have been fighting for since I was made Executive Director. (Note that spinning off rationality work to CFAR is a substantial part of trimming SI down into being primarily an FAI research institute.)&lt;/p&gt;
&lt;p&gt;Reforming SI into a more efficient, effective organization has been my greatest challenge. Frankly, SI was in &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6l4h&quot;&gt;pretty bad shape&lt;/a&gt; when Louie and I arrived as interns in April 2011, and there have been an incredible number of holes to dig SI out of &amp;#x2014; and several more remain. (In contrast, it has been a &lt;em&gt;joy&lt;/em&gt;&amp;#xA0;to help set up CFAR properly &lt;em&gt;from the very beginning&lt;/em&gt;, with all the right organizational tools and processes in place.) Reforming SI presents a fundraising problem, because reforming SI is time consuming and sometimes costly, but is generally unexciting to donors.&lt;/p&gt;
&lt;p&gt;I can see the light at the end of the tunnel, though. We won't reach it if we can't improve our fundraising success in the next 3-6 months, but it's close enough that I can see it. SI's path forward, from my point of view, looks like this:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;We finish launching CFAR, which takes over the rationality work SI was doing. (Before January 2013.)&lt;/li&gt;
&lt;li&gt;We change how the Singularity Summit is planned and run so that it pulls our core staff away from core mission work to a lesser degree. (Before January 2013.)&lt;/li&gt;
&lt;li&gt;Eliezer writes the &quot;Open Problems in Friendly AI&quot; sequence. (Before January 2013.)&lt;/li&gt;
&lt;li&gt;We hire 1-2 researchers to produce technical write-ups from &lt;a href=&quot;http://singularity.org/files/TDT.pdf&quot;&gt;Eliezer's TDT article&lt;/a&gt; and from his &quot;Open Problems in Friendly AI&quot; sequence. (Beginning September 2012, except that right now we don't have the cash to hire the 1-2 people who I know who&amp;#xA0;&lt;em&gt;could&lt;/em&gt;&amp;#xA0;do this and who &lt;em&gt;want&lt;/em&gt;&amp;#xA0;to do this as soon as we have the money to hire them.)&lt;/li&gt;
&lt;li&gt;With the &quot;Open FAI Problems&quot; sequence and the technical write-ups in hand, we greatly expand our efforts to show math/compsci researchers that there is a tractable, technical research program in FAI theory, and as a result some researchers work on the sexiest of these problems from their departments, and some other math researchers take more seriously the prospect of being &lt;em&gt;hired&lt;/em&gt; by SI to do technical research in FAI theory. (Beginning, roughly, in April 2013.)&amp;#xA0;Also: There won't be classes on x-risk at &lt;a href=&quot;http://appliedrationality.org/sparc.html&quot;&gt;SPARC&lt;/a&gt;&amp;#xA0;(rationality camp for young elite math talent), but some SPARC students might end up being interested in FAI stuff by osmosis.&amp;#xA0;&lt;/li&gt;
&lt;li&gt;With a more tightly honed SI, improved fundraising practices, and visible mission-central research happening, SI is able to attract more funding and hire even more FAI researchers. (Beginning, roughly, in September 2013.)&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;If you want to help us make this happen, please &lt;a href=&quot;http://singularity.org/blog/2012/07/03/summer-challenge/&quot;&gt;donate during our July matching drive!&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/dp8/sis_summer_2012_matching_drive_ends_july_31st/#comments"&gt;17 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Reply to Holden on The Singularity Institute</title>
<link>http://lesswrong.com/lw/di4/reply_to_holden_on_the_singularity_institute/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/di4/reply_to_holden_on_the_singularity_institute/</guid>
<pubDate>Wed, 11 Jul 2012 09:20:18 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
43 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/di4/reply_to_holden_on_the_singularity_institute/#comments"&gt;211 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Holden Karnofsky of &lt;a href=&quot;http://www.givewell.org/&quot;&gt;GiveWell&lt;/a&gt; has &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/&quot;&gt;objected&lt;/a&gt; to the Singularity Institute (SI) as a target for &lt;a href=&quot;/lw/3gj/efficient_charity_do_unto_others/&quot;&gt;optimal&lt;/a&gt; &lt;a href=&quot;/lw/6py/optimal_philanthropy_for_human_beings/&quot;&gt;philanthropy&lt;/a&gt;. As someone who thinks that existential risk reduction is really important and also that the Singularity Institute is an important target of optimal philanthropy, I would like to explain why I disagree with Holden on these subjects. (I am also SI's Executive Director.)&lt;/p&gt;
&lt;p&gt;Mostly, I'd like to explain my views to a broad audience. But I'd also like to explain my views to Holden himself. I value Holden's work, I enjoy interacting with him, and I think he is both intelligent &lt;em&gt;and&lt;/em&gt; capable of changing his mind about Big Things like this. Hopefully Holden and I can continue to work through the arguments together, though of course we are both busy with many other things.&lt;/p&gt;
&lt;p&gt;I appreciate the clarity and substance of &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/&quot;&gt;Holden's objections&lt;/a&gt;, and I hope to reply in kind. I begin with an overview of some basic points that may be familiar to most &lt;em&gt;Less Wrong&lt;/em&gt; veterans, and then I reply point-by-point to Holden's post. In the final section, I summarize my reply to Holden.&lt;/p&gt;
&lt;p&gt;Holden raised &lt;em&gt;many&lt;/em&gt; different issues, so unfortunately this post needed to be &lt;em&gt;long&lt;/em&gt;. My apologies to Holden if I have misinterpreted him at any point.&lt;/p&gt;
&lt;h2 id=&quot;contents&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;Contents&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Existential risk reduction is a critical concern for many people, given their values and given many plausible models of the future. &lt;a href=&quot;/lw/di4/reply_to_holden_on_the_singularity_institute/#x-risk&quot;&gt;Details here&lt;/a&gt;. &lt;/li&gt;
&lt;li&gt;Among existential risks, AI risk is probably the most important. &lt;a href=&quot;/lw/di4/reply_to_holden_on_the_singularity_institute/#ai-risk&quot;&gt;Details here&lt;/a&gt;. &lt;/li&gt;
&lt;li&gt;SI can purchase many kinds of AI risk reduction more efficiently than other groups can. &lt;a href=&quot;/lw/di4/reply_to_holden_on_the_singularity_institute/#si&quot;&gt;Details here&lt;/a&gt;. &lt;/li&gt;
&lt;li&gt;These points and many others weigh against many of Holden's claims and conclusions. &lt;a href=&quot;/lw/di4/reply_to_holden_on_the_singularity_institute/#Holden&quot;&gt;Details here&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/lw/di4/reply_to_holden_on_the_singularity_institute/#summary&quot;&gt;Summary of my reply to Holden&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&quot;comments&quot;&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;Comments&lt;/h2&gt;
&lt;p&gt;I must be brief, so while reading this post I am sure many objections will leap to your mind. To encourage &lt;em&gt;constructive&lt;/em&gt; discussion on this post, &lt;strong&gt;each question (posted as a comment on this page) that follows the template described below will receive a reply from myself or another SI representative.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Please word your question as clearly and succinctly as possible, and don't assume your readers will have read this post before reading your question (because: the conversations here may be used as source material for a comprehensive FAQ).&lt;/p&gt;
&lt;p&gt;Here's an example of how you could word the first paragraph of your question: &quot;You claimed that [insert direct quote here], and also that [insert another direct quote here]. That seems to imply that [something something]. But that doesn't seem to take into account that [blah blah blah]. What do you think of that?&quot;&lt;/p&gt;
&lt;p&gt;If your question needs more explaining, leave the details to &lt;em&gt;subsequent&lt;/em&gt; paragraphs in your comment. Please post multiple questions as multiple comments, so they can be voted upon and replied to individually. If you don't follow these rules, I can't guarantee SI will have time to give you a reply. (We probably &lt;em&gt;won't&lt;/em&gt;.)&lt;/p&gt;
&lt;h2 id=&quot;x-risk&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;Why many people care greatly about existential risk reduction&lt;/h2&gt;
&lt;p&gt;Why do many people consider existential risk reduction to be humanity's most important task? I can't say it much better than &lt;a href=&quot;http://www.existential-risk.org/concept.pdf&quot;&gt;Nick Bostrom does&lt;/a&gt;, so I'll just quote him:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;An existential risk is one that threatens the premature extinction of Earth-originating intelligent life or the permanent and drastic destruction of its potential for desirable future development. Although it is often difficult to assess the probability of existential risks, there are many reasons to suppose that the total such risk confronting humanity over the next few centuries is significant...&lt;/p&gt;
&lt;p&gt;Humanity has survived what we might call &lt;em&gt;natural existential risks&lt;/em&gt; [asteroid impacts, gamma ray bursts, etc.] for hundreds of thousands of years; thus it is prima facie unlikely that any of them will do us in within the next hundred...&lt;/p&gt;
&lt;p&gt;In contrast, our species is introducing entirely new kinds of existential risk&amp;#x2014;threats we have no track record of surviving... In particular, most of the biggest existential risks seem to be linked to potential future technological breakthroughs that may radically expand our ability to manipulate the external world or our own biology. As our powers expand, so will the scale of their potential consequences&amp;#x2014;intended and unintended, positive and negative. For example, there appear to be significant existential risks in some of the advanced forms of biotechnology, molecular nanotechnology, and machine intelligence that might be developed in the decades ahead.&lt;/p&gt;
&lt;p&gt;What makes existential catastrophes especially bad is not that they would [cause] a precipitous drop in world population or average quality of life. Instead, their significance lies primarily in the fact that they would destroy the future... To calculate the loss associated with an existential catastrophe, we must consider how much value would come to exist in its absence. It turns out that the ultimate potential for Earth-originating intelligent life is literally astronomical.&lt;/p&gt;
&lt;p&gt;One gets a large number even if one confines one&amp;#x2019;s consideration to the potential for biological human beings living on Earth. If we suppose... that our planet will remain habitable for at least another billion years, and we assume that at least one billion people could live on it sustainably, then the potential exist for at least 10&lt;sup&gt;18&lt;/sup&gt; human lives. [The numbers get &lt;em&gt;way&lt;/em&gt; bigger if you consider the expansion of posthuman civilization to the rest of the galaxy or the prospect of &lt;a href=&quot;http://en.wikipedia.org/wiki/Whole_brain_emulation&quot;&gt;mind uploading&lt;/a&gt;.]&lt;/p&gt;
&lt;p&gt;Even if we use the most conservative of these estimates, which entirely ignores the possibility of space colonization and software minds, we find that the expected loss of an existential catastrophe is greater than the value of 10&lt;sup&gt;16&lt;/sup&gt; human lives...&lt;/p&gt;
&lt;p&gt;These considerations suggest that the loss in expected value resulting from an existential catastrophe is so enormous that the objective of reducing existential risks should be a dominant consideration whenever we act out of an impersonal concern for humankind as a whole.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I refer the reader to &lt;a href=&quot;http://www.existential-risk.org/concept.pdf&quot;&gt;Bostrom's paper&lt;/a&gt; for further details and additional arguments, but neither his paper nor this post can answer every objection one might think of.&lt;/p&gt;
&lt;p&gt;Nor can I summarize all the arguments and evidence related to estimating the severity and time horizon of every proposed existential risk. Even the 500+ pages of Oxford University Press' &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Global-Catastrophic-Risks-Nick-Bostrom/dp/0199606501/&quot;&gt;Global Catastrophic Risks&lt;/a&gt;&lt;/em&gt; can barely scratch the surface of this enormous topic. As explained in &lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Intelligence Explosion: Evidence and Import&lt;/a&gt;, predicting long-term technological progress is &lt;em&gt;hard&lt;/em&gt;. Thus, we must&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;examine convergent outcomes that&amp;#x2014;like the evolution of eyes or the emergence of markets&amp;#x2014;can come about through any of several different paths and can gather momentum once they begin.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I'll say more about convergent outcomes later, but for now I'd just like to suggest that:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Many humans living today value both current and future people enough that &lt;em&gt;if&lt;/em&gt; existential catastrophe is plausible this century, then upon reflection (e.g. after counteracting their unconscious, default &lt;a href=&quot;/lw/hw/scope_insensitivity/&quot;&gt;scope insensitivity&lt;/a&gt;) they would conclude that reducing the risk of existential catastrophe is the most valuable thing they can do &amp;#x2014; whether through direct work or by &lt;a href=&quot;http://singularity.org/donate/&quot;&gt;donating&lt;/a&gt; to support direct work. It is to &lt;em&gt;these&lt;/em&gt; people I appeal. (I also have much to say to people who e.g. &lt;em&gt;don't&lt;/em&gt; care about future people, but it is too much to say here and now.)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;As it turns out, we &lt;em&gt;do&lt;/em&gt; have good reason to believe that existential catastrophe is plausible this century.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;I don't have the space here to discuss the likelihood of different kinds of existential catastrophe that could plausibly occur this century (see &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Global-Catastrophic-Risks-Nick-Bostrom/dp/0199606501/&quot;&gt;GCR&lt;/a&gt;&lt;/em&gt; for more details), so instead I'll talk about just one of them: an AI catastrophe.&lt;/p&gt;
&lt;h2 id=&quot;ai-risk&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;AI risk: the most important existential risk&lt;/h2&gt;
&lt;p&gt;There are two primary reasons I think AI is the most important existential risk:&lt;/p&gt;
&lt;p&gt;Reason 1: &lt;strong&gt;Mitigating AI risk could mitigate all other existential risks, but not vice-versa.&lt;/strong&gt; There is an asymmetry between AI risk and other existential risks. If we mitigate the risks from (say) synthetic biology and nanotechnology (without building Friendly AI), this only means we have bought a few years or decades for ourselves before we must face yet another existential risk from powerful new technologies. But if we manage &lt;em&gt;AI risk&lt;/em&gt; well enough (i.e. if we build a &lt;a href=&quot;http://en.wikipedia.org/wiki/Friendly_artificial_intelligence&quot;&gt;Friendly AI&lt;/a&gt; or &quot;FAI&quot;), we may be able to &quot;permanently&quot; (for several billion years) secure a desirable future. Machine superintelligence working in the service of humane goals could use its intelligence and resources to prevent all other existential catastrophes. (&lt;a href=&quot;http://singularity.org/files/CEV.pdf&quot;&gt;Eliezer&lt;/a&gt;: &quot;I distinguish 'human', that which we &lt;em&gt;are&lt;/em&gt;, from 'humane'&amp;#x2014;that which, being human, we &lt;em&gt;wish&lt;/em&gt; we were.&quot;)&lt;/p&gt;
&lt;p&gt;Reason 2: &lt;strong&gt;AI is probably the first existential risk we must face&lt;/strong&gt; (given &lt;em&gt;my&lt;/em&gt; evidence, only the tiniest fraction of which I can share in a blog post).&lt;/p&gt;
&lt;p&gt;One reason AI may be the most urgent existential risk is that it's more likely for AI (compared to other sources of catastrophic risk) to be a full-blown &lt;em&gt;existential&lt;/em&gt; catastrophe (as opposed to a merely &lt;em&gt;billions dead&lt;/em&gt; catastrophe). Humans are smart and adaptable; we are &lt;a href=&quot;http://reflectivedisequilibrium.blogspot.com/2012/05/what-to-eat-during-impact-winter.html&quot;&gt;already set up&lt;/a&gt; for a species-preserving number of humans to survive (e.g. in underground bunkers with stockpiled food, water, and medicine) major catastrophes from nuclear war, superviruses, supervolcano eruption, and many cases of asteroid impact or nanotechnological &lt;a href=&quot;http://en.wikipedia.org/wiki/Ecophagy&quot;&gt;ecophagy&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Machine superintelligences, however, could intelligently seek out and neutralize humans which they (correctly) recognize as threats to the maximal realization of their goals. Humans are surprisingly easy to kill if an intelligent process is &lt;em&gt;trying&lt;/em&gt; to do so. Cut off John's access to air for a few minutes, or cut off his water supply for a few days, or poke him with a &lt;a href=&quot;http://en.wikipedia.org/wiki/Sword&quot;&gt;sharp stick&lt;/a&gt;, and he dies. &lt;em&gt;Forever&lt;/em&gt;. (Post-humans might shudder at this absurdity like we shudder at the idea that people used to die from their &lt;em&gt;teeth&lt;/em&gt;.)&lt;/p&gt;
&lt;p&gt;Why think AI is coming anytime soon? This is too complicated a topic to breach here. See &lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Intelligence Explosion: Evidence and Import&lt;/a&gt; for a brief analysis of AI timelines. Or try &lt;a href=&quot;http://www.theuncertainfuture.com/&quot;&gt;The Uncertain Future&lt;/a&gt;, which outputs an estimated timeline for human-level AI based on &lt;em&gt;your&lt;/em&gt; predictions of various technological developments. (SI is currently collaborating with the &lt;a href=&quot;http://www.fhi.ox.ac.uk/&quot;&gt;Future of Humanity Institute&lt;/a&gt; to write another paper on this subject.)&lt;/p&gt;
&lt;p&gt;It's also important to mention that the case for caring about AI risk is less conjunctive that many seem to think, which I discuss in more detail &lt;a href=&quot;#disjunctive&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id=&quot;si&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;SI can purchase several kinds of AI risk reduction more efficiently than others can&lt;/h2&gt;
&lt;p&gt;The two organizations working most directly to reduce AI risk are the &lt;a href=&quot;http://singularity.org/&quot;&gt;Singularity Institute&lt;/a&gt; and the &lt;a href=&quot;http://www.fhi.ox.ac.uk/&quot;&gt;Future of Humanity Institute&lt;/a&gt; (FHI). Luckily, these organizations complement each other well, as I &lt;a href=&quot;/lw/7sc/siai_vs_fhi_achievements_20082010/4w6v&quot;&gt;pointed out&lt;/a&gt; back before I was running SI:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;FHI is part of Oxford, and thus can bring credibility to existential risk reduction. Resulting output: lots of peer-reviewed papers, books from OUP like &lt;em&gt;Global Catastrophic Risks&lt;/em&gt;, conferences, media appearances, etc.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;SI is independent and is less constrained by conservatism or the university system. Resulting output: Very novel (and, to the mainstream, &quot;weird&quot;) research on Friendly AI, and the ability to do unusual things that are nevertheless quite effective at finding/creating lots of new people interested in rationality and existential risk reduction: (1) &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Sequences&quot;&gt;The Sequences&lt;/a&gt;, the best tool I know for creating aspiring rationalists, (2) &lt;em&gt;&lt;a href=&quot;http://hpmor.com/&quot;&gt;Harry Potter and the Methods of Rationality&lt;/a&gt;&lt;/em&gt;, a surprisingly successful tool for grabbing the attention of mathematicians and computer scientists around the world, and (3) the &lt;a href=&quot;http://singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;, a mainstream-aimed conference that brings in people who end up making significant contributions to the movement &amp;#x2014; e.g. Tomer Kagan (an SI donor and board member) and David Chalmers (author of&amp;#xA0;&lt;a href=&quot;http://www.consc.net/papers/singularityjcs.pdf&quot;&gt;The Singularity: A Philosophical Analysis&lt;/a&gt;&amp;#xA0;and &lt;a href=&quot;http://consc.net/papers/singreply.pdf&quot;&gt;The Singularity: A Reply&lt;/a&gt;).&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;A few weeks later, Nick Bostrom (Director of FHI) &lt;a href=&quot;/lw/8fw/new_qa_by_nick_bostrom/597e&quot;&gt;said the same things&lt;/a&gt; (as far as I know, &lt;em&gt;without&lt;/em&gt; having read &lt;em&gt;my&lt;/em&gt; comment):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I think there is a sense that both organizations are synergistic. If one were about to go under... that would probably be the one [to donate to]. If both were doing well... different people will have different opinions. We work quite closely with the folks from [the Singularity Institute]...&lt;/p&gt;
&lt;p&gt;There is an advantage to having one academic platform and one outside academia. There are different things these types of organizations give us. If you wanna get academics to pay more attention to this, to get postdocs to work on this, that's much easier to do within academia; also to get the ear of policy-makers and media... On the other hand, for [SI] there might be things that are easier for them to do. More flexibility, they're not embedded in a big bureaucracy. So they can more easily hire people with non-standard backgrounds... and also more grass-roots stuff like &lt;em&gt;Less Wrong&lt;/em&gt;...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;FHI is, despite its small size, a highly productive philosophy department. More importantly, FHI has focused its research work on AI risk issues for the past 9 months, and plans to continue on that path for at least another 12 months. This is important work that should be supported. (Note that FHI recently hired SI research associate &lt;a href=&quot;http://www.danieldewey.net/&quot;&gt;Daniel Dewey&lt;/a&gt;.)&lt;/p&gt;
&lt;p&gt;SI lacks FHI's publishing productivity and its university credibility, but as an organization SI is &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6jzn&quot;&gt;improving quickly&lt;/a&gt;, and it can seize &lt;a href=&quot;/lw/cs6/how_to_purchase_ai_risk_reduction/&quot;&gt;many opportunities for AI risk reduction&lt;/a&gt; that FHI is not well-positioned to seize. (&lt;em&gt;New&lt;/em&gt; organizations will &lt;em&gt;also&lt;/em&gt; tend to be less capable of seizing these opportunities than SI, due to the financial and human capital already concentrated at SI and FHI.)&lt;/p&gt;
&lt;p&gt;Here are some examples of projects that SI is probably better able to carry out than FHI, given its greater flexibility (and assuming sufficient funding):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;A &lt;a href=&quot;/lw/cnj/a_scholarly_ai_risk_wiki/&quot;&gt;scholarly AI risk wiki&lt;/a&gt; written and maintained by dozens of part-time researchers from around the world.&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/r/discussion/lw/cs7/reaching_young_mathcompsci_talent/&quot;&gt;Reaching young math/compsci talent&lt;/a&gt; in unusual ways, e.g. &lt;em&gt;&lt;a href=&quot;http://hpmor.com/&quot;&gt;HPMoR&lt;/a&gt;&lt;/em&gt;.&lt;/li&gt;
&lt;li&gt;Writing &lt;em&gt;&lt;a href=&quot;/lw/cr7/proposal_for_open_problems_in_friendly_ai/&quot;&gt;Open Problems in Friendly AI&lt;/a&gt;&lt;/em&gt; (Eliezer has spent far more time working on the mathy sub-problems of FAI than anyone else).&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&quot;Holden&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;My replies to Holden, point by point&lt;/h2&gt;
&lt;p&gt;Holden's post makes so &lt;em&gt;many&lt;/em&gt; claims that I'll just have to work through his post from beginning to end, and then summarize where I think we stand at the end.&lt;/p&gt;
&lt;h4 id=&quot;labs&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;GiveWell Labs&lt;/h4&gt;
&lt;p&gt;Holden opened &quot;&lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/&quot;&gt;Thoughts on the Singularity Institute&lt;/a&gt;&quot; by noting that SI was previously outside Givewell's scope, since GiveWell was focused on specific domains like poverty reduction. With the launch of &lt;a href=&quot;http://www.givewell.org/about/labs&quot;&gt;GiveWell Labs&lt;/a&gt;, GiveWell is now open to evaluating &lt;em&gt;any&lt;/em&gt; giving opportunity, including SI.&lt;/p&gt;
&lt;p&gt;I admire this move. I'm sure people have been bugging GiveWell to do this for a long time, but almost none of those people appreciate how hard it is to launch broad new initiatives like this with the limited budget of an organization like Givewell or the Singularity Institute. Most of them &lt;em&gt;also&lt;/em&gt; do not understand how much work is required to write something like &quot;&lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/&quot;&gt;Thoughts on the Singularity Institute&lt;/a&gt;&quot;, &quot;&lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Reply to Holden on Tool AI&lt;/a&gt;&quot;, or &lt;em&gt;this&lt;/em&gt; post.&lt;/p&gt;
&lt;h4 id=&quot;outcomes&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Three possible outcomes&lt;/h4&gt;
&lt;p&gt;Next, Holden wrote:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;[I hope] that one of these three things (or some combination) will happen:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;New arguments are raised that cause me to change my mind and recognize SI as an outstanding giving opportunity. If this happens I will likely attempt to raise more money for SI (most likely by discussing it with other GiveWell staff and collectively considering a GiveWell Labs recommendation).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;SI concedes that my objections are valid and increases its determination to address them. A few years from now, SI is a better organization and more effective in its mission.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;SI can't or won't make changes, and SI's supporters feel my objections are valid, so SI loses some support, freeing up resources for other approaches to doing good.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/blockquote&gt;
&lt;p&gt;As explained at the top of Holden's post, I had already conceded that many of Holden's objections (especially concerning past organizational competence) are valid, and had been working to address them, even &lt;em&gt;before&lt;/em&gt; Holden's post was published. So outcome #2 is already true in part.&lt;/p&gt;
&lt;p&gt;I hope for outcome #1, too, but I don't expect Holden to change his opinion overnight. There are too many possible objections to which Holden has not yet heard a good response. But hopefully this post and its comment threads will successfully address &lt;em&gt;some&lt;/em&gt; of Holden's (and others') objections.&lt;/p&gt;
&lt;p&gt;Outcome #3 is unlikely since SI is &lt;em&gt;already&lt;/em&gt; making changes, though of course it's possible we will be unable to raise sufficient funding for SI &lt;em&gt;despite&lt;/em&gt; making these changes, or even &lt;em&gt;because of&lt;/em&gt; our efforts to make these changes. (Improving general organizational effectiveness is important but it costs money and is not exciting to donors.)&lt;/p&gt;
&lt;h4 id=&quot;mission&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's mission is more important than SI as an organization&lt;/h4&gt;
&lt;p&gt;Holden said:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;whatever happens as a result of my post will be positive for SI's mission, whether or not it is positive for SI as an organization. I believe that most of SI's supporters and advocates care more about the former than about the latter, and that this attitude is far too rare in the nonprofit world.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Clearly, SI's mission is more important than SI as an organization. If somebody launches an organization more effective (at AI risk reduction) than SI but just as flexible, then SI should probably fold itself and try to move its donor base, support community, and the best of its human capital to that new organization.&lt;/p&gt;
&lt;p&gt;That said, it's probably easier to reform SI into a more effective organization than it is to launch a new one, since SI has successfully concentrated lots of attention, donor support, and human capital. Also, SI has learned many lessons about how to run a very tricky kind of organization. AI risk reduction is a mission that (1) is beyond most people's time horizons for caring, (2) is hard to understand and visualize, (3) pattern-matches to science fiction and apocalyptic religion, (4) suffers under complicated and &lt;em&gt;necessarily&lt;/em&gt; uncertain &lt;a href=&quot;/lw/ajm/ai_risk_and_opportunity_a_strategic_analysis/&quot;&gt;strategic considerations&lt;/a&gt; (compare to the simplicity of &lt;a href=&quot;http://www.givewell.org/international/top-charities/AMF&quot;&gt;bed nets&lt;/a&gt;), (5) has a very small pool of people from which to recruit researchers, etc. SI has lots of experience with these issues; experience that probably takes a long time and lots of money to acquire.&lt;/p&gt;
&lt;p&gt;(On the other hand, SI has also concentrated some bad reputation which a new organization could launch without. But I still think the weight of the arguments is in favor of reforming SI.)&lt;/p&gt;
&lt;h4 id=&quot;clarity&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's arguments need to be clearer&lt;/h4&gt;
&lt;p&gt;Holden:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I do not believe that [my objections to SI's apparent views] constitute a sharp/tight case for the idea that SI's work has low/negative value; I believe, instead, that SI's own arguments are too vague for such a rebuttal to be possible. There are many possible responses to my objections, but SI's public arguments (and the private arguments) do not make clear which possible response (if any) SI would choose to take up and defend. Hopefully the dialogue following this post will clarify what SI believes and why.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I agree that SI's arguments are often vague. For example, Chris Hallquist &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6k30&quot;&gt;reported&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I've been trying to write something about Eliezer's debate with Robin Hanson, but the problem I keep running up against is that Eliezer's points are not clearly articulated at all. Even making my best educated guesses about what's supposed to go in the gaps in his arguments, I still ended up with very little.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I know the feeling! That's why I've tried to write as many clarifying documents as I can, including the &lt;a href=&quot;http://singularity.org/singularity-faq/&quot;&gt;Singularity FAQ&lt;/a&gt;, &lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Intelligence Explosion: Evidence and Import&lt;/a&gt;, &lt;a href=&quot;http://singularity.org/files/SaME.pdf&quot;&gt;The Singularity and Machine Ethics&lt;/a&gt;, &lt;em&gt;&lt;a href=&quot;http://facingthesingularity.com/&quot;&gt;Facing the Singularity&lt;/a&gt;&lt;/em&gt;, &lt;a href=&quot;http://lukeprog.com/SaveTheWorld.html&quot;&gt;So You Want to Save the World&lt;/a&gt;, and &lt;a href=&quot;/r/discussion/lw/cs6/how_to_purchase_ai_risk_reduction/&quot;&gt;How to Purchase AI Risk Reduction&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Unfortunately, it takes lots of &lt;a href=&quot;http://singularity.org/donate/&quot;&gt;resources&lt;/a&gt; to write up hundreds of arguments and responses to objections in clear and precise language, and we're &lt;a href=&quot;http://singularity.org/research/&quot;&gt;working on it&lt;/a&gt;. (For comparison, Nick Bostrom's forthcoming book on machine superintelligence will barely scratch the surface of the things SI and FHI researchers have worked out in conversation, and it will probably take him 2+ years to write in total, and Bostrom is &lt;em&gt;already&lt;/em&gt; an unusually prolific writer.) Hopefully SI's responses to Holden's post have helped to clarify our positions already.&lt;/p&gt;
&lt;h4 id=&quot;first_objection&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Holden's objection #1 punts to objection #2&lt;/h4&gt;
&lt;p&gt;The first objection on Holden's numbered list was:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;it seems to me that any AGI that was set to maximize a &quot;Friendly&quot; utility function would be extraordinarily dangerous.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I'm glad Holden agrees with us that successful Friendly AI is &lt;em&gt;very hard&lt;/em&gt;. SI has spent much of its effort trying to show people that the first 20 solutions they come up with all fail. See: &lt;a href=&quot;http://singularity.org/files/AIRisk.pdf&quot;&gt;AI as a Positive and Negative Factor in Global Risk&lt;/a&gt;, &lt;a href=&quot;http://singularity.org/files/SaME.pdf&quot;&gt;The Singularity and Machine Ethics&lt;/a&gt;, &lt;a href=&quot;http://singularity.org/files/ComplexValues.pdf&quot;&gt;Complex Value Systems are Required to Realize Valuable Futures&lt;/a&gt;, etc. Holden mentions the standard SI worry about the &lt;a href=&quot;/lw/ld/the_hidden_complexity_of_wishes/&quot;&gt;hidden complexity of wishes&lt;/a&gt;, and the one about a friendly utility function still causing havoc because the AI's priors are wrong (problem 3.6 from my &lt;a href=&quot;http://lukeprog.com/SaveTheWorld.html&quot;&gt;list of open problems in AI risk research&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;There are reasons to think FAI is harder &lt;em&gt;still&lt;/em&gt;. What if we get the utility function right and we get the priors right but the AI's values &lt;em&gt;change&lt;/em&gt; for the worse when it &lt;a href=&quot;http://singularity.org/files/OntologicalCrises.pdf&quot;&gt;updates its ontology&lt;/a&gt;? What if the smartest, most careful, most insanely safety-conscious AI researchers humanity can produce &lt;em&gt;just aren't smart enough&lt;/em&gt; to solve the problem? What if &lt;em&gt;no&lt;/em&gt; humans are altruistic enough to choose to build FAI over an AI that will make them king of the universe? What if the idea of FAI is incoherent? (The human brain is an existence proof for the possibility of general intelligence, but we have &lt;em&gt;no&lt;/em&gt; existence proof for the possibility of a decision theoretic agent which stably optimizes the world according to a set of preferences over states of affairs.)&lt;/p&gt;
&lt;p&gt;So, yeah. Friendly AI is &lt;em&gt;hard&lt;/em&gt;. But as I said &lt;a href=&quot;/lw/cck/holden_karnofskys_singularity_institute_objection/&quot;&gt;elsewhere&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The point is that &lt;em&gt;not&lt;/em&gt; trying as hard as you can to build Friendly AI is even &lt;em&gt;worse&lt;/em&gt;, because then you &lt;em&gt;almost certainly get uFAI&lt;/em&gt;. At least by &lt;em&gt;trying&lt;/em&gt; to build FAI, we've got some chance of winning.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;So Holden's objection #1 objection really just punts to objection #2, about tool-AGI, as the last paragraph in this section of Holden's post seems to indicate:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;So far, all I have argued is that the development of &quot;Friendliness&quot; theory can achieve at best only a limited reduction in the probability of an unfavorable outcome. However, as I argue in the next section, I believe there is at least one concept - the &quot;tool-agent&quot; distinction - that has more potential to reduce risks, and that SI appears to ignore this concept entirely.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;So if Holden's objection #2 doesn't work, then objection #1 ends up reducing to &quot;the development of Friendliness theory can achieve at best a reduction in AI risk,&quot; which is what SI has been saying all along.&lt;/p&gt;
&lt;h4 id=&quot;toolAI&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Tool AI&lt;/h4&gt;
&lt;p&gt;Holden's second numbered objection was:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;SI appears to neglect the potentially important distinction between &quot;tool&quot; and &quot;agent&quot; AI.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Eliezer wrote a whole post about this &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;here&lt;/a&gt;. To sum up:&lt;/p&gt;
&lt;p&gt;(1) &lt;strong&gt;Whether you're working with Tool AI or Agent AI, you need the &quot;Friendly AI&quot; domain experts that SI is trying to recruit&lt;/strong&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A &quot;Friendly AI programmer&quot; is somebody who specializes in seeing the correspondence of mathematical structures to What Happens in the Real World. It's somebody who looks at Hutter's specification of AIXI and reads the actual equations - actually stares at the Greek symbols and not just the accompanying English text - and sees, &quot;Oh, this AI will try to gain control of its reward channel,&quot; as well as numerous subtler issues like, &quot;This AI presumes a Cartesian boundary separating itself from the environment; it may drop an anvil on its own head.&quot; Similarly, working on TDT means e.g. looking at a mathematical specification of decision theory, and seeing &quot;Oh, this is vulnerable to blackmail&quot; and coming up with a mathematical counter-specification of an AI that isn't so vulnerable to blackmail.&lt;/p&gt;
&lt;p&gt;Holden's post seems to imply that if you're building a non-self-modifying planning Oracle (aka 'tool AI') rather than an acting-in-the-world agent, you don't need a Friendly AI programmer because FAI programmers only work on agents. But this isn't how the engineering skills are split up. Inside the AI, whether an agent AI or a planning Oracle, there would be similar AGI-challenges like &quot;build a predictive model of the world&quot;, and similar FAI-conjugates of those challenges like finding the 'user' inside an AI-created model of the universe. The insides would look a lot more similar than the outsides. An analogy would be supposing that a machine learning professional who does sales optimization for an orange company couldn't possibly do sales optimization for a banana company, because their skills must be about oranges rather than bananas.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;(2) &lt;strong&gt;Tool AI isn't that much safer than Agent AI, because Tool AIs have lots of hidden &quot;gotchas&quot; that cause havoc, too&lt;/strong&gt;. (See &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Eliezer's post&lt;/a&gt; for examples.)&lt;/p&gt;
&lt;p&gt;These points illustrate something else Eliezer wrote:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;What the human species needs from an x-risk perspective is experts on This Whole Damn Problem [of AI risk], who will acquire whatever skills are needed to that end. The Singularity Institute exists to host such people and enable their research&amp;#x2014;once we have enough funding to find and recruit them.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Indeed. We need places for experts who specialize in seeing the consequences of mathematical objects for things humans value (e.g. the Singularity Institute) just like we need places for experts on efficient charity (e.g. &lt;a href=&quot;http://www.givewell.org/&quot;&gt;Givewell&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Anyway, it's worth pointing out that Holden did &lt;em&gt;not&lt;/em&gt; make the common (and mistaken) argument that &quot;We should just build Tool AIs instead of Agent AIs and then we'll be fine.&quot; This is wrong for many reasons, but one obvious point is that there are incentives to build Agent AIs (because they're powerful), so even if the first 6 teams are careful enough to build only Tool AIs, the 7th team could still build Agent AI and destroy the world.&lt;/p&gt;
&lt;p&gt;Instead, Holden pointed out that &lt;em&gt;you could use Tool AI to increase your chances of successfully building agenty FAI&lt;/em&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;if developing &quot;Friendly AI&quot; is what we seek, a tool-AGI could likely be helpful enough in thinking through this problem as to render any previous work on &quot;Friendliness theory&quot; moot. Among other things, a tool-AGI would allow transparent views into the AGI's reasoning and predictions without any reason to fear being purposefully misled, and would facilitate safe experimental testing of any utility function that one wished to eventually plug into an &quot;agent.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;After reading &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Eliezer's reply&lt;/a&gt;, however, you can probably guess my replies to this paragraph:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Tool AI isn't as safe as Holden thinks.&lt;/li&gt;
&lt;li&gt;But yeah, a Friendly AI team may very well use &quot;Tool AI&quot; to aid Friendliness research if it can figure out a safe way to do that. This doesn't obviate the need for Friendly AI researchers; it's &lt;em&gt;part&lt;/em&gt; of their research toolbox.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;So Holden's Objection #2 doesn't work, which (as explained earlier) means that his Objection #1 (as stated) doesn't work either.&lt;/p&gt;
&lt;h4 id=&quot;disjunctive&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's mission assumes a scenario that is far &lt;em&gt;less&lt;/em&gt; conjunctive than it initially appears.&lt;/h4&gt;
&lt;p&gt;Holden's objection #3 is:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;SI's envisioned scenario is far more specific and conjunctive than it appears at first glance, and I believe this scenario to be highly unlikely.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;His main concern here seemed to be that technological developments and other factors would render earlier FAI work irrelevant. But Eliezer's &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;clarifications&lt;/a&gt; about what we mean by &quot;FAI team&quot; render this objection moot, at least as it is currently stated. The &lt;em&gt;purpose&lt;/em&gt; of an FAI team is not to blindly develop one particular approach to Friendly AI without checking to see whether this work will be obsoleted by future developments. Instead, the purpose of an FAI team is to develop highly specialized expertise on, among other things, which kinds of research are more and less likely to be relevant given future developments.&lt;/p&gt;
&lt;p&gt;Holden's confusion about what SI means by &quot;FAI team&quot; is common and understandable, and it is one reason that SI's mission assumes a scenario that is far &lt;em&gt;less&lt;/em&gt; conjunctive than it appears to many. We aren't saying we need an FAI team because we know lots of specific things about how AGI will be built 30 years from now. We're saying you need experts on &quot;the consequences of mathematical objects for things humans value&quot; (an FAI team) because AGIs are mathematical objects and will have big consequences. That's pretty disjunctive.&lt;/p&gt;
&lt;p&gt;Similarly, many people think SI's mission is predicated on &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Hard_takeoff&quot;&gt;hard takeoff&lt;/a&gt;. After all, we call ourselves the &quot;Singularity Institute,&quot; Eliezer has spent a lot of time arguing for hard takeoff, and our &lt;a href=&quot;http://singularity.org/summary/&quot;&gt;current research summary&lt;/a&gt; frames AI risk in terms of &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Recursive_self-improvement&quot;&gt;recursive self-improvement&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;But the case for AI as a global risk, and thus the need for dedicated experts on AI risk and &quot;the consequences of mathematical objects for things humans value&quot;, isn't predicated on hard takeoff. Instead, it looks something like this:&lt;/p&gt;
&lt;p&gt;(1) &lt;strong&gt;Eventually, most tasks are performed by machine intelligences.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The improved flexibility, copyability, and modifiability of machine intelligences make them economically dominant even without other advantages (&lt;a href=&quot;http://www.amazon.com/Race-Against-The-Machine-ebook/dp/B005WTR4ZI/&quot;&gt;Brynjolfsson &amp;amp; McAfee 2011&lt;/a&gt;; &lt;a href=&quot;http://maven.smith.edu/~thiebaut/research/singularity/ieee_spectrum__economics_of_the_singularity.pdf&quot;&gt;Hanson 2008&lt;/a&gt;). In addition, there is &lt;a href=&quot;http://facingthesingularity.com/2011/plenty-of-room-above-us/&quot;&gt;plenty of room&lt;/a&gt; &quot;above&quot; the human brain in terms of hardware and software for general intelligence (&lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Muehlhauser &amp;amp; Salamon 2012&lt;/a&gt;; &lt;a href=&quot;http://www.xuenay.net/Papers/DigitalAdvantages.pdf&quot;&gt;Sotala 2012&lt;/a&gt;; &lt;a href=&quot;http://www.amazon.com/The-Singularity-Is-Near-Transcend/dp/0143037889/&quot;&gt;Kurzweil 2005&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;(2) &lt;strong&gt;Machine intelligences don't necessarily do things we like.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;We don't necessarily control AIs, since advanced intelligences may be inherently goal-oriented (&lt;a href=&quot;http://selfawaresystems.files.wordpress.com/2008/01/nature_of_self_improving_ai.pdf&quot;&gt;Omohundro 2007&lt;/a&gt;), and even if we build advanced &quot;Tool AIs,&quot; these aren't necessarily safe either (&lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Yudkowsky 2012&lt;/a&gt;) and there will be significant economic incentives to transform them into autonomous agents (&lt;a href=&quot;http://www.amazon.com/Race-Against-The-Machine-ebook/dp/B005WTR4ZI/&quot;&gt;Brynjolfsson &amp;amp; McAfee 2011&lt;/a&gt;). We don't value most possible futures, but it's &lt;em&gt;very&lt;/em&gt; hard to get an autonomous AI to do exactly what you want (&lt;a href=&quot;http://singularity.org/files/AIRisk.pdf&quot;&gt;Yudkowsky 2008&lt;/a&gt;, &lt;a href=&quot;http://singularity.org/files/ComplexValues.pdf&quot;&gt;2011&lt;/a&gt;; &lt;a href=&quot;http://singularity.org/files/SaME.pdf&quot;&gt;Muehlhauser &amp;amp; Helm 2012&lt;/a&gt;; &lt;a href=&quot;http://www.amazon.com/Governing-Lethal-Behavior-Autonomous-Robots/dp/1420085948/&quot;&gt;Arkin 2009&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;(3) &lt;strong&gt;There are things we can do to increase the probability that machine intelligences do things we like.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Further research can clarify (1) the nature and severity of the risk, (2) how to engineer goal-oriented systems safely, (3) how to increase safety with &lt;a href=&quot;http://en.wikipedia.org/wiki/Differential_technological_development&quot;&gt;differential technological development&lt;/a&gt;, (4) how to limit and control machine intelligences (&lt;a href=&quot;http://www.aleph.se/papers/oracleAI.pdf&quot;&gt;Armstrong et al. 2012&lt;/a&gt;; &lt;a href=&quot;http://dl.dropbox.com/u/5317066/2012-yampolskiy.pdf&quot;&gt;Yampolskiy 2012&lt;/a&gt;), (5) solutions to AI development coordination problems, and more.&lt;/p&gt;
&lt;p&gt;(4) &lt;strong&gt;We should do those things now.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;People aren't doing much about these issues now. We could wait until we understand better (e.g.) what kind of AI is likely, but: (1) it might take a long time to resolve the core issues, including difficult technical subproblems that require time-consuming mathematical breakthroughs, (2) incentives &lt;a href=&quot;http://singularity.org/files/ArmsControl.pdf&quot;&gt;may be badly aligned&lt;/a&gt; (e.g. there seem to be strong economic incentives to build AI, but not to take into account social and global risks for AI), (3) AI may not be that far away (&lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Muehlhauser &amp;amp; Salamon 2012&lt;/a&gt;), and (4) the transition to machine dominance may be surprisingly rapid due to (e.g.) intelligence explosion (&lt;a href=&quot;http://consc.net/papers/singularityjcs.pdf&quot;&gt;Chalmers 2010&lt;/a&gt;, &lt;a href=&quot;http://consc.net/papers/singreply.pdf&quot;&gt;2012&lt;/a&gt;; &lt;a href=&quot;http://singularity.org/files/IE-EI.pdf&quot;&gt;Muehlhauser &amp;amp; Salamon 2012&lt;/a&gt;) or &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Computing_overhang&quot;&gt;computing overhang&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;What do I mean by &quot;computing overhang&quot;? We may get the hardware needed for AI long before we get the software, such that once software for general intelligence is figured out, there is tons of computing hardware sitting around for running AIs (a &quot;computing overhang&quot;). Thus we could switch from a world with one autonomous AI to a world with 10 billion autonomous AIs at the speed of copying software, and thereby transition rapidly from human dominance to AI dominance even without an intelligence explosion. (This is one of the many, many things we haven't yet written up in detail up due to lack of resources.)&lt;/p&gt;
&lt;p&gt;&lt;small&gt;(This broad argument is greatly compressed from a paper outline developed by &lt;a href=&quot;http://ordinaryideas.wordpress.com/&quot;&gt;Paul Christiano&lt;/a&gt;, &lt;a href=&quot;/lw/7ob/timeline_of_carl_shulman_publications/&quot;&gt;Carl Shulman&lt;/a&gt;, &lt;a href=&quot;https://sites.google.com/site/nbeckstead/&quot;&gt;Nick Beckstead&lt;/a&gt;, and myself. We'd love to write the paper at some point, but haven't had the resources to do so. The fuller version of this argument is of course more detailed.)&lt;/small&gt;&lt;/p&gt;
&lt;h4 id=&quot;argumentation&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's public argumentation&lt;/h4&gt;
&lt;p&gt;Next, Holden turned to the topic of SI's organizational effectiveness:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;when evaluating a group such as SI, I can't avoid placing a heavy weight on (my read on) the general competence, capability and &quot;intangibles&quot; of the people and organization, because SI's mission is not about repeating activities that have worked in the past...&lt;/p&gt;
&lt;p&gt;There are several reasons that I currently have a negative impression of SI's general competence, capability and &quot;intangibles.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The first reason Holden gave for his negative impression of SI is:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;SI has produced enormous quantities of public argumentation... Yet I have never seen a clear response to any of the three basic objections I listed in the previous section. One of SI's major goals is to raise awareness of AI-related risks; given this, the fact that it has not advanced clear/concise/compelling arguments speaks, in my view, to its general competence.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I agree &lt;em&gt;in part&lt;/em&gt;. Here's what I think:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;SI &lt;em&gt;hasn't&lt;/em&gt; made its arguments as clear, concise, and compelling as I would like. We're &lt;a href=&quot;http://singularity.org/research/&quot;&gt;working on that&lt;/a&gt;. It takes time, money, and people who are (1) smart and capable enough to do AI risk research work and yet somehow (2) willing to work for non-profit salaries and (3) willing to &lt;em&gt;not&lt;/em&gt; advance their careers like they would if they chose instead to work at a university.&lt;/li&gt;
&lt;li&gt;There are a &lt;em&gt;huge&lt;/em&gt; number of possible objections to SI's arguments, and we haven't had the resources to write up clear and compelling replies to &lt;em&gt;all&lt;/em&gt; of them. (See &lt;a href=&quot;http://consc.net/papers/singreply.pdf&quot;&gt;Chalmers 2012&lt;/a&gt; for quick rebuttals to many objections to intelligence explosion, but what he covers in that paper barely scratches the surface.) As &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Eliezer wrote&lt;/a&gt;, Holden's complaint that SI hasn't addressed &lt;em&gt;his&lt;/em&gt; particular objections &quot;seems to lack perspective on how &lt;em&gt;many&lt;/em&gt; different things various people see as the &lt;em&gt;one obvious solution&lt;/em&gt; to Friendly AI. Tool AI wasn't the obvious solution to John McCarthy, I.J. Good, or Marvin Minsky. Today's leading AI textbook, &lt;em&gt;Artificial Intelligence: A Modern Approach&lt;/em&gt;... discusses Friendly AI and AI risk for 3.5 pages but doesn't mention tool AI as an obvious solution. For Ray Kurzweil, the obvious solution is merging humans and AIs. For Jurgen Schmidhuber, the obvious solution is AIs that value a certain complicated definition of complexity in their sensory inputs. Ben Goertzel, J. Storrs Hall, and Bill Hibbard, among others, have all written about how silly Singinst is to pursue Friendly AI when the solution is obviously X, for various different X. Among current leading people working on serious AGI programs labeled as such, neither Demis Hassabis (VC-funded to the tune of several million dollars) nor Moshe Looks (head of AGI research at Google) nor Henry Markram (Blue Brain at IBM) think that the obvious answer is Tool AI. Vernor Vinge, Isaac Asimov, and any number of other SF writers with technical backgrounds who spent serious time thinking about these issues didn't converge on that solution.&quot;&lt;/li&gt;
&lt;li&gt;SI &lt;em&gt;has&lt;/em&gt; done a decent job of raising awareness of AI risk, I think. Writing &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Sequences&quot;&gt;The Sequences&lt;/a&gt; and &lt;em&gt;&lt;a href=&quot;http://hpmor.com/&quot;&gt;HPMoR&lt;/a&gt;&lt;/em&gt; have (indirectly) raised more awareness for AI risk that one can normally expect from, say, writing a bunch of clear and precise academic papers about a subject. (At least, it seems that way to me.)&lt;/li&gt;
&lt;/ul&gt;
&lt;h4 id=&quot;endorsements&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's endorsements&lt;/h4&gt;
&lt;p&gt;The second reason Holden gave for his negative impression of SI is &quot;a lack of impressive endorsements.&quot; This one is generally true, despite the three &quot;celebrity endorsements&quot; on &lt;a href=&quot;http://singularity.org/donate/&quot;&gt;our new donate page&lt;/a&gt;. More impressive than these is the fact that, as Eliezer mentioned, the latest edition of the leading AI textbook spend &lt;a href=&quot;/lw/4rx/singularity_and_friendly_ai_in_the_dominant_ai/&quot;&gt;several pages&lt;/a&gt; talking about AI risk and Friendly AI, and discusses the work of SI-associated researchers like &lt;a href=&quot;http://yudkowsky.net/&quot;&gt;Eliezer Yudkowsky&lt;/a&gt; and &lt;a href=&quot;http://steveomohundro.com/&quot;&gt;Steve Omohundro&lt;/a&gt; while &lt;em&gt;completely ignoring&lt;/em&gt; the existence of the older, more prestigious, and vastly larger mainstream academic field of &quot;&lt;a href=&quot;http://en.wikipedia.org/wiki/Machine_ethics&quot;&gt;machine ethics&lt;/a&gt;.&quot;&lt;/p&gt;
&lt;p&gt;Why don't we have impressive endorsements? To my knowledge, SI hasn't tried very hard to get them. That's &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6jzn&quot;&gt;another thing&lt;/a&gt; we're in the process of changing.&lt;/p&gt;
&lt;h4 id=&quot;feedback&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI and feedback loops&lt;/h4&gt;
&lt;p&gt;The third reason Holden gave for his negative impression of SI is:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;SI seems to have passed up opportunities to test itself and its own rationality by e.g. aiming for objectively impressive accomplishments... Pursuing more impressive endorsements and developing benign but objectively recognizable innovations (particularly commercially viable ones) are two possible ways to impose more demanding feedback loops.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;We have thought many times about commercially viable innovations we could develop, but these would generally be large distractions from the work of our core mission. (The &lt;a href=&quot;http://appliedrationality.org&quot;&gt;Center for Applied Rationality&lt;/a&gt;, in contrast, has &lt;em&gt;many&lt;/em&gt; opportunities to develop commercially viable innovations in line with its core mission.)&lt;/p&gt;
&lt;p&gt;Still, I &lt;em&gt;do&lt;/em&gt; think it's important for the Singularity Institute to test itself with tight feedback loops wherever feasible. This is particularly difficult to do for a research organization doing a &lt;a href=&quot;http://www.nickbostrom.com/old/predict.html&quot;&gt;philosophy of long-term forecasting&lt;/a&gt; (30 years is not a &quot;tight&quot; feedback loop in the slightest), but that's what &lt;a href=&quot;http://www.fhi.ox.ac.uk/&quot;&gt;FHI&lt;/a&gt; does and they have more &quot;objectively impressive&quot; (that is, &quot;externally proclaimed&quot;) accomplishments: lots of peer-reviewed publications, some major awards for its top researcher Nick Bostrom, etc.&lt;/p&gt;
&lt;h4 id=&quot;rationality&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI and rationality&lt;/h4&gt;
&lt;p&gt;Holden's fourth concern about SI is that it is overconfident about the level of its own rationality, and that this seems to show itself in (e.g.) &quot;insufficient self-skepticism&quot; and &quot;being too selective (in terms of looking for people who share its preconceptions) when determining whom to hire and whose feedback to take seriously.&quot;&lt;/p&gt;
&lt;p&gt;What would provide good evidence of rationality? Holden explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I endorse &lt;a href=&quot;/lw/nc/newcombs_problem_and_regret_of_rationality/&quot;&gt;Eliezer Yudkowsky's statement&lt;/a&gt;, &quot;Be careful &amp;#x2026; any time you find yourself defining the [rationalist] as someone other than the agent who is currently smiling from on top of a giant heap of utility.&quot; To me, the best evidence of superior general rationality (or of insight into it) would be objectively impressive achievements (successful commercial ventures, highly prestigious awards, clear innovations, etc.) and/or accumulation of wealth and power. As mentioned above, SI staff/supporters/advocates do not seem particularly impressive on these fronts...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Unfortunately, this seems to misunderstand the term &quot;rationality&quot; as it is meant in cognitive science. As I explained &lt;a href=&quot;/lw/c7g/rationality_and_winning/&quot;&gt;elsewhere&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Like intelligence and money, rationality is only a ceteris paribus predictor of success.&lt;/p&gt;
&lt;p&gt;So while it's empirically true (&lt;a href=&quot;http://www.amazon.com/Rationality-Reflective-Mind-Keith-Stanovich/dp/0195341147/&quot;&gt;Stanovich 2010&lt;/a&gt;) that rationality is a predictor of life success, it's a weak one. (At least, it's a weak predictor of success at the levels of human rationality we are capable of &lt;a href=&quot;/lw/b98/minicamps_on_rationality_and_awesomeness_may_1113/&quot;&gt;training&lt;/a&gt; &lt;em&gt;today&lt;/em&gt;.) If you want to more reliably achieve life success, I recommend inheriting a billion dollars or, failing that, being born+raised to have an excellent work ethic and low akrasia.&lt;/p&gt;
&lt;p&gt;The reason you should &quot;be careful&amp;#x2026; any time you find yourself defining the [rationalist] as someone other than the agent who is currently smiling from on top of a giant heap of utility&quot; is because you should &quot;&lt;a href=&quot;/lw/nc/newcombs_problem_and_regret_of_rationality/&quot;&gt;never end up envying someone else's mere &lt;em&gt;choices&lt;/em&gt;&lt;/a&gt;.&quot; You are still allowed to envy their resources, intelligence, work ethic, mastery over akrasia, and other predictors of success.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;But I don't mean to dodge the key issue. I think SIers &lt;em&gt;are&lt;/em&gt; generally more rational than most people (and so are LWers, &lt;a href=&quot;http://commonsenseatheism.com/?p=16199&quot;&gt;it seems&lt;/a&gt;), but I think SIers &lt;em&gt;have&lt;/em&gt; often overestimated their own rationality, myself included. Certainly, I think SI's leaders have been &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6l4h&quot;&gt;pretty irrational&lt;/a&gt; about &lt;em&gt;organizational development&lt;/em&gt; at many times in the past. In internal communications about why SI should help launch &lt;a href=&quot;http://appliedrationality.org&quot;&gt;CFAR&lt;/a&gt;, one reason on my list has been: &quot;We need to improve our own rationality, and figure out how to create better rationalists than exist today.&quot;&lt;/p&gt;
&lt;h4 id=&quot;activities&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;SI's goals and activities&lt;/h4&gt;
&lt;p&gt;Holden's fifth concern about SI is the apparent disconnect between SI's goals and its activities:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;SI seeks to build FAI and/or to develop and promote &quot;Friendliness theory&quot; that can be useful to others in building FAI. Yet it seems that most of its time goes to activities other than developing AI or theory.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This one is pretty easy to answer. We've focused mostly on movement-building rather than direct research because, until very recently, there wasn't enough community interest or funding to seriously begin to form an FAI team. To do that you need (1) at least a few million dollars a year, and (2) enough smart, altruistic people to care about AI risk that there exist some potential &lt;a href=&quot;/r/discussion/lw/cv9/building_toward_a_friendly_ai_team/&quot;&gt;superhero mathematicians&lt;/a&gt; for the FAI team. And to get those two things, you've got to do &lt;em&gt;mostly&lt;/em&gt; movement-building, e.g. &lt;em&gt;&lt;a href=&quot;/&quot;&gt;Less Wrong&lt;/a&gt;&lt;/em&gt;, &lt;em&gt;&lt;a href=&quot;http://hpmor.com/&quot;&gt;HPMoR&lt;/a&gt;&lt;/em&gt;, the &lt;a href=&quot;http://singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;, etc.&lt;/p&gt;
&lt;h4 id=&quot;theft&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Theft&lt;/h4&gt;
&lt;p&gt;And of course, Holden is (rightly) concerned about the &lt;a href=&quot;/lw/5il/siai_an_examination/&quot;&gt;2009 theft of $118,000&lt;/a&gt; from SI, and the lack of public statements from SI on the matter.&lt;/p&gt;
&lt;p&gt;Briefly:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Two former employees stole $118,000 from SI. Earlier this year we finally won stipulated judgments against both individuals, forcing them to pay back the full amounts they stole. We have already recovered several thousand dollars of this.&lt;/li&gt;
&lt;li&gt;We do have much better financial controls now. We consolidated our accounts so there are fewer accounts to watch, and at least three staff members check them regularly, as does our treasurer, who is &lt;em&gt;not&lt;/em&gt; an SI staff member or board member.&lt;/li&gt;
&lt;/ul&gt;
&lt;h4 id=&quot;mugging&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Pascal's Mugging&lt;/h4&gt;
&lt;p&gt;In another section, Holden wrote:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A common argument that SI supporters raise with me is along the lines of, &quot;Even if SI's arguments are weak and its staff isn't as capable as one would like to see, their goal is so important that they would be a good investment even at a tiny probability of success.&quot;&lt;/p&gt;
&lt;p&gt;I believe this argument to be a form of &lt;a href=&quot;/lw/kd/pascals_mugging_tiny_probabilities_of_vast/&quot;&gt;Pascal's Mugging&lt;/a&gt; and I have outlined the reasons I believe it to be invalid...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Some problems with Holden's &lt;a href=&quot;http://blog.givewell.org/2011/08/18/why-we-cant-take-expected-value-estimates-literally-even-when-theyre-unbiased/&quot;&gt;two&lt;/a&gt; &lt;a href=&quot;http://blog.givewell.org/2011/11/10/maximizing-cost-effectiveness-via-critical-inquiry/&quot;&gt;posts&lt;/a&gt;&amp;#xA0;on this subject will be explained in a forthcoming post by Steven Kaas. But as Holden notes, some SI principals like &lt;a href=&quot;/lw/745/why_we_cant_take_expected_value_estimates/4nzy&quot;&gt;Eliezer&lt;/a&gt; don't use &quot;small probability of large impact&quot; arguments, anyway. We in fact argue that the probability of a large impact is &lt;em&gt;not&lt;/em&gt; tiny.&lt;/p&gt;
&lt;h2 id=&quot;summary&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;Summary of my reply to Holden&lt;/h2&gt;
&lt;p&gt;Now that I have addressed so many details, let us return to the big picture. My summarized reply to Holden goes like this:&lt;/p&gt;
&lt;p&gt;Holden's first two objections can be summarized as arguing that developing the Friendly AI approach is more dangerous than developing non-agent &quot;Tool&quot; AI. &lt;a href=&quot;/lw/cze/reply_to_holden_on_tool_ai/&quot;&gt;Eliezer's post&lt;/a&gt; points out that &quot;Friendly AI&quot; domain experts are what you need whether you're working with Tool AI or Agent AI, because (1) both of these approaches require FAI experts (experts in seeing the consequences of mathematical objects for what humans value), and because (2) Tool AI isn't necessarily much safer than Agent AI, because Tool AIs have lots of hidden gotchas, too. Thus, &quot;What the human species needs from an x-risk perspective is experts on This Whole Damn Problem [of AI risk], who will acquire whatever skills are needed to that end. The Singularity Institute exists to host such people and enable their research &amp;#x2014; once we have enough funding to find and recruit them.&quot;&lt;/p&gt;
&lt;p&gt;Holden's third objection was that the argument behind SI's mission is more conjunctive than it seems. I &lt;a href=&quot;#disjunctive&quot;&gt;replied&lt;/a&gt; that the argument behind SI's mission is actually &lt;em&gt;less&lt;/em&gt; conjunctive than it often seems, because an &quot;FAI team&quot; works on a broader set of problems than Holden had realized, and because the case for AI risk is more disjunctive than many people realize. These confusions are understandable, however, and they probably are a result of insufficient clear argumentative writing from SI on these matters &amp;#x2014; a problem we am trying to fix with several recent and forthcoming &lt;a href=&quot;http://singularity.org/research/&quot;&gt;papers&lt;/a&gt; and other communications (like this one).&lt;/p&gt;
&lt;p&gt;Holden's next objection concerned SI as an organization: &quot;SI has, or has had, multiple properties that I associate with ineffective organizations.&quot; I acknowledged these problems before Holden published his post, and have since outlined the &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6jzn&quot;&gt;many improvements&lt;/a&gt; we've made to organizational effectiveness since I was made Executive Director. I addressed several of Holden's specific worries &lt;a href=&quot;#argumentation&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Finally, Holden recommended giving to a donor-advised fund rather than to SI:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I don't think that &quot;Cause X is the one I care about and Organization Y is the only one working on it&quot; to be a good reason to support Organization Y. For donors determined to donate within this cause, I encourage you to consider donating to a donor-advised fund while making it clear that you intend to grant out the funds to existential-risk-reduction-related organizations in the future....&lt;/p&gt;
&lt;p&gt;For one who accepts my arguments about SI, I believe withholding funds in this way is likely to be better for SI's mission than donating to SI&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;By now I've called into question most of Holden's arguments about SI, but I will still address the issue of donating to SI vs. donating to a &lt;a href=&quot;http://en.wikipedia.org/wiki/Donor_advised_fund&quot;&gt;donor-advised fund&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;First: Which public charity would administer the donor-advised fund? Remember also that in the U.S., the administering charity need not spend from the donor-advised fund as the donor wishes, though they often do.&lt;/p&gt;
&lt;p&gt;Second: As I said &lt;a href=&quot;#mission&quot;&gt;earlier&lt;/a&gt;,&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;it's probably easier to reform SI into a more effective organization than it is to launch a new one, since SI has successfully concentrated lots of attention, donor support, and human capital. Also, SI has learned many lessons about how to run a very tricky kind of organization. AI risk reduction is a mission that (1) is beyond most people's time horizons for caring, (2) is hard to understand and visualize, (3) pattern-matches to science fiction and apocalyptic religion, (4) suffers under complicated and &lt;em&gt;necessarily&lt;/em&gt; uncertain &lt;a href=&quot;/lw/ajm/ai_risk_and_opportunity_a_strategic_analysis/&quot;&gt;strategic considerations&lt;/a&gt; (compare to the simplicity of &lt;a href=&quot;http://www.givewell.org/international/top-charities/AMF&quot;&gt;bed nets&lt;/a&gt;), (5) has a very small pool of people from which to recruit researchers, etc. SI has lots of experience with these issues; experience that probably takes a long time and lots of money to acquire.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The case for funding improvements and growth at SI (as opposed to starving SI as Holden suggests) is bolstered by the fact that &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/6jzn&quot;&gt;SI's productivity and effectiveness have been improving rapidly&lt;/a&gt; of late, and many other improvements (and &lt;a href=&quot;/lw/cs6/how_to_purchase_ai_risk_reduction/&quot;&gt;exciting projects&lt;/a&gt;) are on our &quot;to-do&quot; list if we can raise sufficient funding to implement them.&lt;/p&gt;
&lt;p&gt;Holden even seems to share some of this optimism:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Luke's... recognition of the problems I raise... increases my estimate of the likelihood that SI will work to address them...&lt;/p&gt;
&lt;p&gt;I'm aware that SI has relatively new leadership that is attempting to address the issues behind some of my complaints. I have a generally positive impression of the new leadership; I believe the Executive Director and Development Director, in particular, to represent a step forward in terms of being interested in transparency and in testing their own general rationality. So I will not be surprised if there is some improvement in the coming years...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h4 id=&quot;conclusion&quot;&gt;&lt;br&gt;&lt;/h4&gt;
&lt;h4&gt;Conclusion&lt;/h4&gt;
&lt;p&gt;For brevity's sake I have skipped many important details. I may also have misinterpreted Holden somewhere. And surely, Holden and other readers have follow-up questions and objections. This is not the end of the conversation; it is closer to the beginning. I invite you to leave your comments, preferably in accordance with &lt;a href=&quot;#comments&quot;&gt;these guidelines&lt;/a&gt; (for improved discussion clarity).&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/di4/reply_to_holden_on_the_singularity_institute/#comments"&gt;211 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Reply to Holden on 'Tool AI'</title>
<link>http://lesswrong.com/lw/cze/reply_to_holden_on_tool_ai/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/cze/reply_to_holden_on_tool_ai/</guid>
<pubDate>Wed, 13 Jun 2012 04:00:51 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
91 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/cze/reply_to_holden_on_tool_ai/#comments"&gt;320 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;I begin by thanking&amp;#xA0;Holden Karnofsky of &lt;a href=&quot;http://www.givewell.org/&quot;&gt;Givewell&lt;/a&gt; for his rare gift of his detailed, engaged, and helpfully-meant critical article &lt;a href=&quot;/lw/cbs/thoughts_on_the_singularity_institute_si/&quot;&gt;Thoughts on the Singularity Institute (SI)&lt;/a&gt;. In this reply I will engage with only one of the &lt;em&gt;many&lt;/em&gt; subjects raised therein, the topic of, as I would term them, non-self-modifying planning Oracles, a.k.a. 'Google Maps AGI' a.k.a. 'tool AI', this being the topic that requires me personally to answer. &amp;#xA0;I hope that my reply will be accepted as addressing the most important central points, though I did not have time to explore every avenue. &amp;#xA0;I certainly do not wish to be logically rude, and if I have failed, please remember with compassion that it's not always obvious to one person what another person will think was the central point.&lt;/p&gt;
&lt;p&gt;Luke Mueulhauser and Carl Shulman contributed to this article, but the final edit was my own, likewise any flaws.&lt;/p&gt;
&lt;h3&gt;Summary:&lt;/h3&gt;
&lt;p&gt;Holden's concern is that &quot;SI appears to neglect the potentially important distinction between 'tool' and 'agent' AI.&quot; His archetypal example is &lt;a href=&quot;https://maps.google.com/&quot;&gt;Google Maps&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Google Maps is not an &lt;em&gt;agent&lt;/em&gt;, taking actions in order to maximize a utility parameter. It is a &lt;em&gt;tool&lt;/em&gt;, generating information and then displaying it in a user-friendly manner for me to consider, use and export or discard as I wish.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The reply breaks down into four heavily interrelated points:&lt;/p&gt;
&lt;p&gt;First, Holden seems to think (and Jaan Tallinn doesn't apparently object to, in their exchange) that if a non-self-modifying planning Oracle is indeed the best strategy, then all of SIAI's past and intended future work is wasted. &amp;#xA0;To me it looks like there's a huge amount of overlap in underlying processes in the AI that would have to be built and the insights required to build it, and I would be trying to assemble mostly - though not quite exactly - the same kind of&amp;#xA0;&lt;em&gt;team &lt;/em&gt;if I was trying to build a non-self-modifying planning Oracle, with the same initial mix of talents and skills.&lt;/p&gt;
&lt;p&gt;Second, a non-self-modifying planning Oracle doesn't sound nearly as safe once you stop saying human-English phrases like &quot;describe the consequences of an action to the user&quot; and start trying to come up with math that says scary dangerous things like (he translated into English) &quot;increase the correspondence between the user's belief about relevant consequences and reality&quot;. &amp;#xA0;Hence&amp;#xA0;why&amp;#xA0;the people on the team would have to solve the same sorts of problems.&lt;/p&gt;
&lt;p&gt;Appreciating the force of the third point is a lot easier if one appreciates the difficulties discussed in points 1 and 2, but is actually empirically verifiable independently: &amp;#xA0;Whether or not a non-self-modifying planning Oracle is the &lt;em&gt;best&lt;/em&gt; solution in the end, it's not such an &lt;em&gt;obvious&lt;/em&gt;&amp;#xA0;privileged-point-in-solution-space that someone should be alarmed at SIAI not discussing it. &amp;#xA0;This is empirically verifiable in the sense that 'tool AI' wasn't the obvious solution to e.g. John McCarthy, Marvin Minsky, I. J. Good, Peter Norvig, Vernor Vinge, or for that matter Isaac Asimov. &amp;#xA0;At one point, Holden says:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;One of the things that bothers me most about SI is that there is practically no public content, as far as I can tell, explicitly addressing the idea of a &quot;tool&quot; and giving arguments for why AGI is likely to work only as an &quot;agent.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;If I take literally that this is one of the things that bothers Holden &lt;em&gt;most...&lt;/em&gt;&amp;#xA0;I think I'd start stacking&amp;#xA0;up some of the literature on the number of different things that &lt;em&gt;just respectable academics&lt;/em&gt;&amp;#xA0;have suggested as the &lt;em&gt;obvious solution&lt;/em&gt;&amp;#xA0;to what-to-do-about-AI - none of which would be about non-self-modifying smarter-than-human planning Oracles - and beg him to have some compassion on us for what we &lt;em&gt;haven't addressed yet&lt;/em&gt;. &amp;#xA0;It might be the right suggestion, but it's not so obviously right that our failure to prioritize discussing it reflects negligence.&lt;/p&gt;
&lt;p&gt;The final point at the end is looking over all the preceding discussion and realizing that, yes, you want to have people specializing in Friendly AI who know this stuff, but as all that preceding discussion is actually the following discussion at this point, I shall reserve it for later.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;1. &amp;#xA0;The math of optimization, and the similar parts of a planning Oracle.&lt;/h3&gt;
&lt;p&gt;What does it take to build a smarter-than-human intelligence, of whatever sort, and have it go well?&lt;/p&gt;
&lt;p&gt;A &quot;Friendly AI programmer&quot; is somebody who specializes in seeing the correspondence of mathematical structures to What Happens in the Real World. It's somebody who looks at Hutter's specification of &lt;a href=&quot;http://www.amazon.com/Universal-Artificial-Intelligence-Algorithmic-Probability/dp/3642060528/&quot;&gt;AIXI&lt;/a&gt; and reads the actual equations - actually stares at the Greek symbols and not just the accompanying English text -&amp;#xA0;and sees, &quot;Oh, this AI will try to gain control of its reward channel,&quot; as well as numerous subtler issues like, &quot;This AI presumes a Cartesian boundary separating itself from the environment; it may drop an anvil on its own head.&quot; Similarly, working on &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Timeless_decision_theory&quot;&gt;TDT&lt;/a&gt; means e.g. looking at a mathematical specification of decision theory, and seeing &quot;Oh, this is vulnerable to blackmail&quot; and coming up with a mathematical counter-specification of an AI that isn't so vulnerable to blackmail.&lt;/p&gt;
&lt;p&gt;Holden's post seems to imply that if you're building a non-self-modifying planning Oracle (aka 'tool AI') rather than an acting-in-the-world agent, you don't need a Friendly AI programmer because FAI programmers only work on agents. But this isn't how the engineering skills are split up. Inside the AI, whether an agent AI or a planning Oracle, there would be similar AGI-challenges like &quot;build a predictive model of the world&quot;, and similar FAI-conjugates of those challenges like finding the 'user' inside an AI-created model of the universe. &amp;#xA0;The insides would look a lot more similar than the outsides. &amp;#xA0;An analogy would be supposing that a machine learning professional who does sales optimization for an orange company couldn't possibly do sales optimization for a banana company, because their skills must be about oranges rather than bananas.&lt;/p&gt;
&lt;p&gt;Admittedly, if it turns out to be possible to use a human understanding of cognitive algorithms to build and run a smarter-than-human Oracle without it being self-improving - this seems unlikely, but not impossible - then you wouldn't have to solve problems that arise with self-modification. &amp;#xA0;But this eliminates only one dimension of the work. &amp;#xA0;And on an even more meta level, it seems like you would call upon almost identical &lt;em&gt;talents and skills&lt;/em&gt;&amp;#xA0;to come up with whatever insights were required&amp;#xA0;- though if it were predictable in advance that we'd abjure self-modification, then, yes, we'd place less emphasis on e.g. finding a team member with past experience in reflective math, and wouldn't waste (additional) time specializing in reflection. &amp;#xA0;But if you wanted math inside the planning Oracle that &lt;em&gt;operated the way you thought it did&lt;/em&gt;, and you wanted somebody who &lt;em&gt;understood what could possibly go wrong&lt;/em&gt;&amp;#xA0;and how to avoid it, you would need to make a function call to the same sort of talents and skills&amp;#xA0;to build an agent AI, or an Oracle that &lt;em&gt;was&lt;/em&gt;&amp;#xA0;self-modifying, etc.&lt;/p&gt;
&lt;h3&gt;2. &amp;#xA0;Yes, planning Oracles have hidden gotchas too.&lt;/h3&gt;
&lt;p&gt;&quot;Tool AI&quot; may sound simple in English, a short sentence in the language of empathically-modeled agents &amp;#x2014; it's just &quot;a thingy that shows you plans instead of a thingy that goes and does things.&quot; If you want to know whether this hypothetical entity does X, you just check whether the outcome of X sounds like &quot;showing someone a plan&quot; or &quot;going and doing things&quot;, and you've got your answer. &amp;#xA0;It starts sounding much scarier once you try to say something more formal and internally-causal like &quot;Model the user and the universe, predict the degree of correspondence between the user's model and the universe, and select from among possible explanation-actions on this basis.&quot;&lt;/p&gt;
&lt;p&gt;Holden, in &lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/05/Tallinn-Karnofsky-2011.pdf&quot;&gt;his dialogue with Jaan Tallinn&lt;/a&gt;, writes out this attempt at formalizing:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Here's how I picture the Google Maps AGI ...&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px; &quot;&gt;utility_function = construct_utility_function(process_user_input());&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px; &quot;&gt;foreach $action in $all_possible_actions {&lt;/p&gt;
&lt;p style=&quot;padding-left: 60px; &quot;&gt;$action_outcome = prediction_function($action,$data);&lt;/p&gt;
&lt;p style=&quot;padding-left: 60px; &quot;&gt;$utility = utility_function($action_outcome);&lt;/p&gt;
&lt;p style=&quot;padding-left: 60px; &quot;&gt;if ($utility &amp;gt; $leading_utility) { $leading_utility = $utility;&lt;/p&gt;
&lt;p style=&quot;padding-left: 60px; &quot;&gt;$leading_action = $action; }&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px; &quot;&gt;}&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px; &quot;&gt;report($leading_action);&lt;/p&gt;
&lt;p&gt;construct_utility_function(process_user_input()) is just a human-quality function for understanding what the speaker wants. prediction_function is an implementation of a human-quality data-&amp;gt;prediction function in superior hardware. $data is fixed (it's a dataset larger than any human can process); same with $all_possible_actions. report($leading_action) calls a Google Maps-like interface for understanding the consequences of $leading_action; it basically breaks the action into component parts and displays predictions for different times and conditional on different parameters.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Google Maps doesn't check all possible routes. If I wanted to design Google Maps, I would start out by throwing out a standard planning technique on a connected graph where each edge has a cost function and there's a good heuristic measure of the distance, e.g. &lt;a href=&quot;http://en.wikipedia.org/wiki/A*_search_algorithm&quot;&gt;A* search&lt;/a&gt;. If that was too slow, I'd next try some more efficient version like weighted A* (or bidirectional weighted memory-bounded A*, which I expect I could also get off-the-shelf somewhere).&amp;#xA0;Once you introduce weighted A*, you no longer have a guarantee that you're selecting the optimal path. &amp;#xA0;You have a guarantee to within a known factor of the cost of the optimal path &amp;#x2014; but the actual path selected wouldn't be quite optimal. The suggestion produced would be an approximation whose exact steps depended on the exact algorithm you used. That's true even if you can predict the exact cost &amp;#x2014; exact utility &amp;#x2014; of any particular path you actually look at; and even if you have a heuristic that never overestimates the cost.&lt;/p&gt;
&lt;p&gt;The reason we don't have &lt;a href=&quot;http://en.wikipedia.org/wiki/Optimal_solutions_for_Rubik's_Cube&quot;&gt;God's Algorithm for solving the Rubik's Cube &lt;/a&gt; is that there's no perfect way of measuring the distance between any two Rubik's Cube positions &amp;#x2014; you can't look at two Rubik's cube positions, and figure out the minimum number of moves required to get from one to another. It took 15 years to prove that there was a position requiring at least 20 moves to solve, and then another 15 years to come up with a computer algorithm that could solve any position in at most 20 moves, but we still can't compute the actual, minimum solution to all Cubes (&quot;God's Algorithm&quot;). This, even though we can exactly calculate the cost and consequence of any actual Rubik's-solution-path we consider.&lt;/p&gt;
&lt;p&gt;When it comes to AGI &amp;#x2014; solving general cross-domain &quot;Figure out how to do X&quot; problems &amp;#x2014; you're not going to get anywhere near the one, true, optimal answer. You're going to &amp;#x2014; at best, if everything works right &amp;#x2014; get &lt;em&gt;a&lt;/em&gt; good answer that's a cross-product of the &quot;utility function&quot; and all the other algorithmic properties that determine what sort of answer the AI finds easy to invent (i.e. can be invented using bounded computing time).&lt;/p&gt;
&lt;p&gt;As for the notion that this AGI runs on a &quot;human predictive algorithm&quot; that we got off of neuroscience and then implemented using more computing power, without knowing how it works or being able to enhance it further: It took 30 years of multiple computer scientists doing basic math research, and inventing code, and running that code on a computer cluster, for them to come up with a 20-move solution to the Rubik's Cube. If a planning Oracle is going to produce better solutions than humanity has yet managed to the Rubik's Cube, it needs to be capable of doing original computer science research and writing its own code. You can't get a 20-move solution out of a human brain, using the native human planning algorithm. Humanity can do it, but only by exploiting the ability of humans to explicitly comprehend the deep structure of the domain (not just rely on intuition) and then inventing an artifact, a new design, running code which uses a different and superior cognitive algorithm, to solve that Rubik's Cube in 20 moves. We do all that without being &lt;em&gt;self&lt;/em&gt;-modifying, but it's still a capability to respect.&lt;/p&gt;
&lt;p&gt;And I'm not even going into what it would take for a planning Oracle to out-strategize any human, come up with a plan for persuading someone, solve original scientific problems by looking over experimental data (like Einstein did), design a nanomachine, and so on.&lt;/p&gt;
&lt;p&gt;Talking like there's this one simple &quot;predictive algorithm&quot; that we can read out of the brain using neuroscience and overpower to produce better plans... doesn't seem quite congruous with what humanity actually does to produce its predictions and plans.&lt;/p&gt;
&lt;p&gt;If we take the concept of the Google Maps AGI at face value, then it actually has four key magical components. &amp;#xA0;(In this case, &quot;magical&quot; isn't to be taken as prejudicial, it's a term of art that means we haven't said how the component works yet.) &amp;#xA0;There's a magical comprehension of the user's utility function, a magical world-model that GMAGI uses to comprehend the consequences of actions, a magical planning element that selects a &lt;em&gt;non-optimal&lt;/em&gt; path using some method &lt;em&gt;other&lt;/em&gt; than exploring all possible actions, and a magical explain-to-the-user function.&lt;/p&gt;
&lt;p&gt;report($leading_action) isn't exactly a trivial step either. Deep Blue tells you to move your pawn or you'll lose the game. You ask &quot;Why?&quot; and the answer is a gigantic search tree of billions of possible move-sequences, leafing at positions which are heuristically rated using a static-position evaluation algorithm trained on millions of games. Or the planning Oracle tells you that a certain DNA sequence will produce a protein that cures cancer, you ask &quot;Why?&quot;, and then humans aren't even capable of verifying, for themselves, the assertion that the peptide sequence will fold into the protein the planning Oracle says it does.&lt;/p&gt;
&lt;p&gt;&quot;So,&quot; you say, after the first dozen times you ask the Oracle a question and it returns an answer that you'd have to take on faith, &quot;we'll just specify in the utility function that the plan should be understandable.&quot;&lt;/p&gt;
&lt;p&gt;Whereupon other things start going wrong. Viliam_Bur, in the comments thread, gave this example, which I've slightly simplified:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Example question: &quot;How should I get rid of my disease most cheaply?&quot; Example answer: &quot;You won't. You will die soon, unavoidably. This report is 99.999% reliable&quot;. Predicted human reaction: Decides to kill self and get it over with. Success rate: 100%, the disease is gone. Costs of cure: zero. Mission completed.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Bur is trying to give an example of how things might go wrong if the preference function is over the accuracy of the predictions explained to the human&amp;#x2014; rather than &lt;em&gt;just&lt;/em&gt; the human's 'goodness' of the outcome. And if the preference function &lt;em&gt;was&lt;/em&gt; just over the human's 'goodness' of the end result, rather than the accuracy of the human's understanding of the predictions, the AI might tell you something that was predictively false but whose implementation would lead you to what the AI defines as a 'good' outcome. And if we ask how happy the human is, the resulting decision procedure would exert optimization pressure to convince the human to take drugs, and so on.&lt;/p&gt;
&lt;p&gt;I'm not saying any particular failure is 100% certain to occur; rather I'm trying to explain - as handicapped by the need to describe the AI in the native human agent-description language, using empathy to simulate a spirit-in-a-box instead of trying to think in mathematical structures like A* search or Bayesian updating - how, even so, one can still see that the issue is a tad more fraught than it sounds on an immediate examination.&lt;/p&gt;
&lt;p&gt;If you see the world just in terms of math, it's even worse; you've got some program with inputs from a USB cable connecting to a webcam, output to a computer monitor, and optimization criteria expressed over some combination of the monitor, the humans looking at the monitor, and the rest of the world. It's a whole lot easier to call what's inside a 'planning Oracle' or some other English phrase than to write a program that does the optimization safely without serious unintended consequences. Show me any attempted specification, and I'll point to the vague parts and ask for clarification in more formal and mathematical terms, and as soon as the design is clarified enough to be a hundred light years from implementation instead of a thousand light years, I'll show a neutral judge how that math would go wrong. (Experience shows that if you try to explain to would-be AGI designers how their design goes wrong, in most cases they just say &quot;Oh, but of course that's not what I meant.&quot; Marcus Hutter is a rare exception who specified his AGI in such unambiguous mathematical terms that he actually succeeded at realizing, after some discussion with SIAI personnel, that AIXI would kill off its users and seize control of its reward button. But based on past sad experience with many other would-be designers, I say &quot;Explain to a neutral judge how the math kills&quot; and not &quot;Explain to the person who invented that math and likes it.&quot;)&lt;/p&gt;
&lt;p&gt;Just as the gigantic gap between smart-sounding English instructions and actually smart algorithms is the main source of difficulty in AI, there's a gap between benevolent-sounding English and actually benevolent algorithms which is the source of difficulty in FAI. &amp;#xA0;&quot;Just make suggestions - don't&amp;#xA0;&lt;em&gt;do&lt;/em&gt;&amp;#xA0;anything!&quot; is, in the end, just more English.&lt;/p&gt;
&lt;h3&gt;3. &amp;#xA0;Why we haven't already discussed Holden's suggestion&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;One of the things that bothers me most about SI is that there is practically no public content, as far as I can tell, explicitly addressing the idea of a &quot;tool&quot; and giving arguments for why AGI is likely to work only as an &quot;agent.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The above&amp;#xA0;statement seems to lack perspective on how&amp;#xA0;&lt;em&gt;many&lt;/em&gt;&amp;#xA0;different things various people see as&amp;#xA0;&lt;em&gt;the one obvious solution&lt;/em&gt;&amp;#xA0;to Friendly AI. Tool AI wasn't the obvious solution to John McCarthy,&amp;#xA0;&lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/05/Good-Some-future-social-repurcussions-of-computers.pdf&quot;&gt;I.J. Good&lt;/a&gt;, or&amp;#xA0;&lt;a href=&quot;http://web.media.mit.edu/~minsky/papers/TrueNames.Afterword.html&quot;&gt;Marvin Minsky&lt;/a&gt;. Today's leading AI textbook,&amp;#xA0;&lt;em&gt;Artificial Intelligence: A Modern Approach&lt;/em&gt;&amp;#xA0;- where you can learn all about A* search, by the way - discusses Friendly AI and AI risk for 3.5 pages but doesn't mention tool AI as an obvious solution. For&amp;#xA0;&lt;a href=&quot;http://www.amazon.com/The-Singularity-Is-Near-Transcend/dp/0143037889/&quot;&gt;Ray Kurzweil&lt;/a&gt;, the obvious solution is merging humans and AIs. For&amp;#xA0;&lt;a href=&quot;http://www.idsia.ch/~juergen/&quot;&gt;Jurgen Schmidhuber&lt;/a&gt;, the obvious solution is AIs that value a certain complicated definition of complexity in their sensory inputs. Ben Goertzel, J. Storrs Hall, and Bill Hibbard, among others, have all written about how silly Singinst is to pursue Friendly AI when the solution is obviously X, for various different X. Among current leading people working on serious AGI programs labeled as such, neither&amp;#xA0;&lt;a href=&quot;http://en.wikipedia.org/wiki/Demis_Hassabis&quot;&gt;Demis Hassabis&lt;/a&gt;&amp;#xA0;(VC-funded to the tune of several million dollars) nor&amp;#xA0;&lt;a href=&quot;http://metacog.org/doc.html&quot;&gt;Moshe Looks&lt;/a&gt;&amp;#xA0;(head of AGI research at Google) nor&amp;#xA0;&lt;a href=&quot;http://metacog.org/doc.html&quot;&gt;Henry Markram&lt;/a&gt;&amp;#xA0;(Blue Brain at IBM) think that the obvious answer is Tool AI. Vernor Vinge, Isaac Asimov, and any number of other SF writers with technical backgrounds who spent serious time thinking about these issues didn't converge on that solution.&lt;/p&gt;
&lt;p&gt;Obviously I'm not saying that nobody should be allowed to propose solutions because someone else would propose a different solution. I have been known to advocate for particular developmental pathways for Friendly AI myself. But I haven't, for example, told Peter Norvig that deterministic self-modification is such an obvious solution to Friendly AI that I would mistrust his whole AI textbook if he didn't spend time discussing it.&lt;/p&gt;
&lt;p&gt;At one point in his conversation with Tallinn, Holden argues that AI will inevitably be developed along planning-Oracle lines, because making suggestions to humans is the natural course that most software takes. Searching for counterexamples instead of positive examples makes it clear that most lines of code don't do this. &amp;#xA0;Your computer, when it reallocates RAM, doesn't pop up a button asking you if it's okay to reallocate RAM in such-and-such a fashion. Your car doesn't pop up a suggestion when it wants to change the fuel mix or apply dynamic stability control. Factory robots don't operate as human-worn bracelets whose blinking lights suggest motion. High-frequency trading programs execute stock orders on a microsecond timescale. Software that does happen to interface with humans is selectively visible and salient to humans, especially the tiny part of the software that does the interfacing; but this is a special case of a general cost/benefit tradeoff which, more often than not, turns out to swing the other way, because human advice is either too costly or doesn't provide enough benefit. Modern AI programmers are generally more interested in e.g. pushing the technological envelope to allow self-driving cars than to &quot;just&quot; do Google Maps. Branches of AI that invoke human aid, like hybrid chess-playing algorithms designed to incorporate human advice, are a field of study; but they're the exception rather than the rule, and occur primarily where AIs can't yet do something humans do, e.g. humans acting as oracles for theorem-provers, where the humans suggest a route to a proof and the AI actually follows that route. This is another reason why planning Oracles were not a uniquely obvious solution to the various academic AI researchers, would-be AI-creators, SF writers, etcetera, listed above.&amp;#xA0;Again, regardless of whether a planning Oracle is actually the best solution, Holden seems to be empirically-demonstrably overestimating the degree to which other people will automatically have his preferred solution come up first in their search ordering.&lt;/p&gt;
&lt;h3&gt;4. &amp;#xA0;Why we should have full-time Friendly AI specialists just like we have trained professionals doing anything else mathy that somebody actually cares about getting right, like pricing interest-rate options or something&lt;/h3&gt;
&lt;p&gt;I hope that the preceding discussion has made, by example instead of mere argument, what's probably the most important point: If you want to have a sensible discussion about which AI designs are safer, there are specialized skills you can apply to that discussion, as built up over years of study and practice by someone who specializes in answering that sort of question.&lt;/p&gt;
&lt;p&gt;This isn't meant as an argument from authority. It's not meant as an attempt to say that only experts should be allowed to contribute to the conversation. But it is meant to say that there is (and ought to be) room in the world for Friendly AI specialists, just like there's room in the world for specialists on optimal philanthropy (e.g. Holden).&lt;/p&gt;
&lt;p&gt;The decision to build a non-self-modifying planning Oracle would be properly made by someone who: understood the risk gradient for self-modifying vs. non-self-modifying programs; understood the risk gradient for having the AI thinking about the thought processes of the human watcher and trying to come up with plans implementable by the human watcher in the service of locally absorbed utility functions, vs. trying to implement its own plans in the service of more globally descriptive utility functions; and who, above all, understood on a technical level what exactly gets &lt;em&gt;accomplished &lt;/em&gt;by having the plans routed through a human. I've given substantial previous thought to describing more precisely what happens &amp;#x2014; what is being gained, and how much is being gained &amp;#x2014; when a human &quot;approves a suggestion&quot; made by an AI. But that would be another a different topic, plus I haven't made too much progress on saying it precisely anyway.&lt;/p&gt;
&lt;p&gt;In the transcript of Holden's conversation with Jaan Tallinn, it looked like Tallinn didn't deny the assertion that Friendly AI skills would be inapplicable if we're building a Google Maps AGI. I would deny that assertion and emphasize that denial, because to me it seems that it is exactly Friendly AI programmers who would be able to tell you if the risk gradient for non-self-modification vs. self-modification, the risk gradient for routing plans through humans vs. acting as an agent, the risk gradient for requiring human approval vs. unapproved action, and the actual feasibility of directly constructing transhuman modeling-prediction-and-planning algorithms through directly design of sheerly better computations than are presently run by the human brain, had the right combination of properties to imply that you ought to go construct a non-self-modifying planning Oracle. Similarly if you wanted an AI that took a limited set of actions in the world with human approval, or if you wanted an AI that &quot;just answered questions instead of making plans&quot;.&lt;/p&gt;
&lt;p&gt;It is similarly implied that a &quot;philosophical AI&quot; might obsolete Friendly AI programmers. If we're talking about PAI that can start with a human's terrible decision theory and come up with a good decision theory, or PAI that can start from a human talking about bad metaethics and then construct a good metaethics... I don't want to say &quot;impossible&quot;, because, after all, that's just what human philosophers do. But we are not talking about a trivial invention here. Constructing a &quot;philosophical AI&quot; is a Holy Grail precisely because it's FAI-complete (just ask it &quot;What AI should we build?&quot;), and has been discussed (e.g. with and by Wei Dai) over the years on the old SL4 mailing list and the modern Less Wrong. But it's really not at all clear how you could write an algorithm which would knowably produce the correct answer to the entire puzzle of anthropic reasoning, without being in possession of that correct answer yourself (in the same way that we can have Deep Blue win chess games without knowing the exact moves, but understanding exactly what abstract work Deep Blue is doing to solve the problem).&lt;/p&gt;
&lt;p&gt;Holden's post presents a restrictive view of what &quot;Friendly AI&quot; people are supposed to learn and know &amp;#x2014; that it's about machine learning for optimizing orange sales but not apple sales, or about producing an &quot;agent&quot; that implements CEV &amp;#x2014; which is something of a straw view, much weaker than the view that a Friendly AI programmer takes of Friendly AI programming. What the human species needs from an x-risk perspective is experts on This Whole Damn Problem, who will acquire whatever skills are needed to that end. The Singularity Institute exists to host such people and enable their research&amp;#x2014;once we have enough funding to find and recruit them. &amp;#xA0;See also,&amp;#xA0;&lt;a href=&quot;/r/discussion/lw/cs6/how_to_purchase_ai_risk_reduction/&quot;&gt;How to Purchase AI Risk Reduction&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I'm pretty sure Holden has met people who think that having a whole institute to rate the efficiency of charities is pointless overhead, especially people who think that their own charity-solution is too obviously good to have to contend with busybodies pretending to specialize in thinking about 'marginal utility'. &amp;#xA0;Which Holden knows about, I would guess, from being paid quite well to think about that economic details when he was a hedge fundie, and learning from books written by professional researchers before then; and the really key point is that people who haven't studied all that stuff don't even realize what they're missing by trying to wing it. &amp;#xA0;If you don't know, you don't know &lt;em&gt;what &lt;/em&gt;you don't know, or the cost of not knowing. &amp;#xA0;Is there a problem of figuring out who might know something you don't, if Holden insists that there's this strange new stuff called 'marginal utility' you ought to learn about? &amp;#xA0;Yes, there is. &amp;#xA0;But is someone who trusts their philanthropic dollars to be steered just by the warm fuzzies of their heart, doing something wrong? &amp;#xA0;Yes, they are. &amp;#xA0;It's one thing to say that SIAI isn't known-to-you to be doing it right - another thing still to say that SIAI is known-to-you to be doing it wrong - and then quite another thing entirely to say that there's no need for Friendly AI programmers &lt;em&gt;and you know it,&lt;/em&gt;&amp;#xA0;that anyone can see it without resorting to math or cracking a copy of AI: A Modern Approach. &amp;#xA0;I do wish that Holden would at least credit that the task SIAI is taking on contains at least as many gotchas, relative to the instinctive approach, as optimal philanthropy compared to instinctive philanthropy, and might likewise benefit from some full-time professionally specialized attention, just as our society creates trained professionals to handle any other problem that someone actually cares about getting right.&lt;/p&gt;
&lt;p&gt;On the other side of things, Holden says that &lt;em&gt;even if&lt;/em&gt; Friendly AI is proven and checked:&lt;/p&gt;
&lt;blockquote&gt;&quot;I believe that the probability of an unfavorable outcome - by which I mean an outcome essentially equivalent to what a UFAI would bring about - exceeds 90% in such a scenario.&quot;&lt;/blockquote&gt;
&lt;p&gt;It's nice that this appreciates that the problem is hard. &amp;#xA0;Associating all of the difficulty with agenty proposals and thinking that it goes away as soon as you invoke tooliness is, well, of this I've already spoken.&amp;#xA0;I'm not sure whether this irreducible-90%-doom assessment is based on a common straw version of FAI where all the work of the FAI programmer goes into &quot;proving&quot; something and doing this carefully checked proof which then - alas, poor Spock! - turns out to be no more relevant than proving that the underlying CPU does floating-point arithmetic correctly if the transistors work as stated. I've repeatedly said that the idea behind proving determinism of self-modification isn't that this guarantees safety, but that if you prove the self-modification stable the AI &lt;em&gt;might &lt;/em&gt;work, whereas if you try to get by with no proofs at all, doom is &lt;em&gt;guaranteed&lt;/em&gt;. My mind keeps turning up Ben Goertzel as the one who invented this caricature - &quot;Don't you understand, poor fool Eliezer, life is full of uncertainty, your attempt to flee from it by refuge in 'mathematical proof' is doomed&quot; - but I'm not sure he was actually the inventor. In any case, the burden of safety isn't carried just by the proof, it's carried mostly by proving the right thing. If Holden is assuming that we're just running away from the inherent uncertainty of life by taking refuge in mathematical proof, then, yes, 90% probability of doom is an understatement, the vast majority of plausible-on-first-glance goal criteria you can prove stable will also kill you.&lt;/p&gt;
&lt;p&gt;If Holden's assessment does take into account a great effort to select the right theorem to prove - and attempts to incorporate the difficult but finitely difficult feature of meta-level error-detection, as it appears in e.g. the CEV proposal - and he is still assessing 90% doom probability, then I must ask, &quot;What do you think you know and how do you think you know it?&quot; The complexity of the human mind is finite; there's only so many things we want or would-want. Why would someone claim to know that proving the right thing is beyond human ability, even if &quot;100 of the world's most intelligent and relevantly experienced people&quot; (Holden's terms) check it over? There's hidden complexity of wishes, but not infinite complexity of wishes or unlearnable complexity of wishes. There are deep and subtle gotchas but not an unending number of them. And if that &lt;em&gt;were&lt;/em&gt;&amp;#xA0;the setting of the hidden variables&amp;#xA0;- how would you end up knowing that with 90% probability in advance? I don't mean to wield my own ignorance as a sword or engage in motivated uncertainty - I hate it when people argue that if they don't know something, nobody else is allowed to know either - so please note that I'm also counterarguing from positive facts pointing the other way:&amp;#xA0;the human brain is complicated but not infinitely complicated, there are hundreds or thousands of cytoarchitecturally distinct brain areas but not trillions or googols. &amp;#xA0;If humanity had two hundred years to solve FAI using human-level intelligence&lt;em&gt;&amp;#xA0;and there was no penalty for guessing wrong&lt;/em&gt;&amp;#xA0;I would be pretty relaxed about the outcome. &amp;#xA0;If&amp;#xA0;Holden says there's 90% doom probability left over no matter what sane intelligent people do (all of which goes away if you just build Google Maps AGI, but leave that aside for now) I would ask him what he knows now, in advance, that all those sane intelligent people will miss. &amp;#xA0;I don't see how you could (well-justifiedly) access that epistemic state.&lt;/p&gt;
&lt;p&gt;I acknowledge that there are points in Holden's post which are not addressed in this reply, acknowledge that these points are also deserving of reply, and hope that other SIAI personnel will be able to reply to them.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/cze/reply_to_holden_on_tool_ai/#comments"&gt;320 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Help Fund Lukeprog at SIAI</title>
<link>http://lesswrong.com/lw/78s/help_fund_lukeprog_at_siai/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/78s/help_fund_lukeprog_at_siai/</guid>
<pubDate>Wed, 24 Aug 2011 17:16:45 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
40 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/78s/help_fund_lukeprog_at_siai/#comments"&gt;276 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Singularity Institute desperately needs someone who is not me who can write cognitive-science-based material. Someone smart, energetic, able to speak to popular audiences, and with an excellent command of the science. If you&amp;#x2019;ve been reading Less Wrong for the last few months, you probably just thought the same thing I did: &amp;#x201C;SIAI should hire Lukeprog!&amp;#x201D; To support Luke Muelhauser becoming a full-time Singularity Institute employee, please &lt;a href=&quot;http://singinst.org/donate/&quot;&gt;donate&lt;/a&gt; and mention Luke (e.g. &amp;#x201C;Yay for Luke!&amp;#x201D;) in the check memo or the comment field of your donation - or if you donate by a method that doesn&amp;#x2019;t allow you to leave a comment, tell Louie Helm (louie.helm@singinst.org) your donation was to help fund Luke.&lt;/p&gt;
&lt;p&gt;Note that the &lt;a href=&quot;http://singinst.org/2011summerchallenge&quot;&gt;Summer Challenge&lt;/a&gt; that doubles all donations will run until August 31st. (We're currently at $31,000 of $125,000.)&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;During his stint as a Singularity Institute Visiting Fellow, Luke has already:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Co-organized and taught sessions for a well-received one-week &lt;a href=&quot;/lw/5ec/minicamp_on_rationality_awesomeness_and/&quot;&gt;Rationality Minicamp&lt;/a&gt;, and taught sessions for the nine-week Rationality Boot Camp.&lt;/li&gt;
&lt;li&gt;Written many &lt;a href=&quot;http://commonsenseatheism.com/?p=13954&quot;&gt;helpful and well-researched&lt;/a&gt; articles for Less Wrong on metaethics, rationality theory, and rationality practice, including the 20-page tutorial &lt;em&gt;&lt;a href=&quot;/lw/71x/a_crash_course_in_the_neuroscience_of_human/&quot;&gt;A Crash Course in the Neuroscience of Human Motivation&lt;/a&gt;&lt;/em&gt;.&lt;/li&gt;
&lt;li&gt;Written a new &lt;a href=&quot;http://singinst.org/singularityfaq&quot;&gt;Singularity FAQ&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Published an intelligence explosion &lt;a href=&quot;http://intelligenceexplosion.com/&quot;&gt;website&lt;/a&gt; for academics.&lt;/li&gt;
&lt;li&gt;...and completed many smaller projects.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;As a full-time Singularity Institute employee, Luke could:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Author and co-author research papers and outreach papers, including &lt;ul&gt;
&lt;li&gt;A chapter already accepted to Springer&amp;#x2019;s &lt;em&gt;&lt;a href=&quot;http://singularityhypothesis.blogspot.com/&quot;&gt;The Singularity Hypothesis&lt;/a&gt;&lt;/em&gt; volume (co-authored with Louie Helm).&lt;/li&gt;
&lt;li&gt;A paper on existential risk and optimal philanthropy, co-authored with a Columbia University researcher.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Continue to write articles for Less Wrong on the theory and practice of rationality.&lt;/li&gt;
&lt;li&gt;Write a report that summarizes unsolved problems related to Friendly AI.&lt;/li&gt;
&lt;li&gt;Continue to develop his &lt;a href=&quot;http://wiki.lesswrong.com/wiki/No-Nonsense_Metaethics&quot;&gt;metaethics sequence&lt;/a&gt;, the conclusion of which will be a sort of &lt;a href=&quot;http://en.wikipedia.org/wiki/Timothy_Gowers#Polymath_Project&quot;&gt;Polymath Project&lt;/a&gt; for collaboratively solving open problems in metaethics relevant to FAI development.&lt;/li&gt;
&lt;li&gt;Teach courses on rationality and social effectiveness, as he has been doing for the Singularity Institute&amp;#x2019;s &lt;a href=&quot;/lw/5ec/minicamp_on_rationality_awesomeness_and/&quot;&gt;Rationality Minicamp&lt;/a&gt; and &lt;a href=&quot;/lw/4wm/rationality_boot_camp/&quot;&gt;Rationality Boot Camp&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Produce introductory materials to help bridge &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Inferential_distance&quot;&gt;inferential gaps&lt;/a&gt;, as he did with the &lt;a href=&quot;http://singinst.org/singularityfaq&quot;&gt;Singularity FAQ&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Raise awareness of AI risk and the uses of rationality by giving talks at universities and technology companies, as he recently did at &lt;a href=&quot;http://halcyonmolecular.com/&quot;&gt;Halcyon Molecular&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you&amp;#x2019;d like to help us fund Luke Muehlhauser to do all that and probably more, please &lt;a href=&quot;http://singinst.org/donate/&quot;&gt;donate now&lt;/a&gt; and include the word &amp;#x201C;Luke&amp;#x201D; in the comment field. &amp;#xA0;And if you donate before August 31st, your donation will be doubled as part of the &lt;a href=&quot;http://singinst.org/2011summerchallenge&quot;&gt;2011 Summer Singularity Challenge&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/78s/help_fund_lukeprog_at_siai/#comments"&gt;276 comments&lt;/a&gt;
</description>
</item>
<item>
<title>The $125,000 Summer Singularity Challenge</title>
<link>http://lesswrong.com/lw/6w3/the_125000_summer_singularity_challenge/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/6w3/the_125000_summer_singularity_challenge/</guid>
<pubDate>Sat, 30 Jul 2011 07:02:54 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Kaj_Sotala"&gt;Kaj_Sotala&lt;/a&gt;
&amp;bull;
20 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/6w3/the_125000_summer_singularity_challenge/#comments"&gt;256 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;em&gt;From the &lt;a href=&quot;http://singinst.org/blog/&quot;&gt;SingInst blog&lt;/a&gt;:&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Thanks to the generosity of several major donors&lt;sup&gt;&amp;#x2020;&lt;/sup&gt;, every donation to the Singularity Institute made now &lt;strong&gt;until August 31, 2011&lt;/strong&gt; will be &lt;a href=&quot;http://singinst.org/donate&quot;&gt;matched dollar-for-dollar&lt;/a&gt;, up to a total of $125,000.&lt;/p&gt;
&lt;p style=&quot;padding-left: 60px;&quot;&gt;&lt;a href=&quot;http://singinst.org/donate&quot;&gt;Donate now!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;(Visit the &lt;a href=&quot;http://singinst.org/2011summerchallenge&quot;&gt;challenge page&lt;/a&gt; to see a progress bar.)&lt;/p&gt;
&lt;p&gt;Now is your chance to &lt;strong&gt;double your impact&lt;/strong&gt; while supporting the Singularity Institute and helping us raise up to $250,000 to help fund &lt;a href=&quot;http://singinst.org/research/summary/&quot;&gt;our research program&lt;/a&gt; and stage the upcoming &lt;strong&gt;&lt;a href=&quot;http://www.singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;&lt;/strong&gt;&amp;#x2026; which you can &lt;strong&gt;&lt;a href=&quot;http://www.singularitysummit.com/registration/&quot;&gt;register for now&lt;/a&gt;&lt;/strong&gt;!&lt;/p&gt;
&lt;p&gt;&lt;sup&gt;&amp;#x2020;&lt;/sup&gt; $125,000 in backing for this challenge is being generously provided by Rob Zahra, &lt;a href=&quot;http://www.quixey.com/&quot;&gt;Quixey&lt;/a&gt;, Clippy, Luke Nosek, Edwin Evans, Rick Schwall, Brian Cartmell, Mike Blume, Jeff Bone, Johan Edstr&amp;#xF6;m, Zvi Mowshowitz, John Salvatier, Louie Helm, Kevin Fischer, Emil Gilliam, Rob and Oksana Brazell, Guy Srinivasan, John Chisholm, and John Ku.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;2011 has been a &lt;strong&gt;huge year for Artificial Intelligence&lt;/strong&gt;. With the IBM computer Watson defeating two top &lt;em&gt;Jeopardy!&lt;/em&gt; champions in February, it&amp;#x2019;s clear that the field is making steady progress. Journalists like Torie Bosch of &lt;em&gt;Slate&lt;/em&gt; have &lt;a href=&quot;http://www.slate.com/id/2298318/&quot;&gt;argued&lt;/a&gt; that &lt;em&gt;&amp;#x201C;We need to move from robot-apocalypse jokes to serious discussions about the emerging technology.&amp;#x201D;&lt;/em&gt; We couldn&amp;#x2019;t agree more &amp;#x2014; in fact, the Singularity Institute has been thinking about how to create safe and ethical artificial intelligence since long before the Singularity landed on the &lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138,00.html&quot;&gt;front cover&lt;/a&gt; of &lt;em&gt;TIME&lt;/em&gt; magazine.&lt;/p&gt;
&lt;p&gt;The last 1.5 years were our biggest ever. Since the beginning of 2010, we have:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Held our annual &lt;a href=&quot;http://www.singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;, in San Francisco. Speakers included Ray Kurzweil, James Randi, Irene Pepperberg, and many others.&lt;/li&gt;
&lt;li&gt;Held the first &lt;a href=&quot;http://singinst.org/blog/2010/08/22/singularity-summit-australia-sept-7-11-12-melbourne/&quot;&gt;Singularity Summit Australia&lt;/a&gt; and &lt;a href=&quot;http://singinst.org/blog/2011/06/01/singularity-summit-slc-this-weekend-in-salt-lake-city/&quot;&gt;Singularity Summit Salt Lake City&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Held a wildly successful &lt;a href=&quot;http://singinst.org/blog/2011/06/21/rationality-minicamp-a-success/&quot;&gt;Rationality Minicamp&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Published &lt;a href=&quot;http://singinst.org/blog/2011/01/05/2010-singularity-institute-publications/&quot;&gt;seven research papers&lt;/a&gt;, including Yudkowsky&amp;#x2019;s much-awaited &amp;#x2018;&lt;a href=&quot;http://singinst.org/upload/TDT-v01o.pdf&quot;&gt;Timeless Decision Theory&lt;/a&gt;&amp;#x2018;.&lt;/li&gt;
&lt;li&gt;Helped philosopher David Chalmers write his seminal paper &amp;#x2018;&lt;a href=&quot;http://singinst.org/blog/2010/04/08/david-chalmers-on-singularity-intelligence-explosion/&quot;&gt;The Singularity: A Philosophical Analysis&lt;/a&gt;&amp;#x2018;, which has sparked broad discussion in academia, including an &lt;a href=&quot;http://fragments.consc.net/djc/2010/12/singularity-symposium.html&quot;&gt;entire issue&lt;/a&gt; of &lt;em&gt;Journal of Consciousness Studies&lt;/em&gt; and a &lt;a href=&quot;http://singularityhypothesis.blogspot.com/&quot;&gt;book&lt;/a&gt; from Springer devoted to responses to Chalmers&amp;#x2019; paper.&lt;/li&gt;
&lt;li&gt;Launched the &lt;a href=&quot;http://singinst.org/blog/2011/07/22/announcing-the-research-associates-program/&quot;&gt;Research Associates&lt;/a&gt; program. &lt;/li&gt;
&lt;li&gt;Brought MIT cosmologist &lt;a href=&quot;http://singinst.org/blog/2010/03/03/mit-professor-and-cosmologist-max-tegmark-joins-siai-advisory-board/&quot;&gt;Max Tegmark&lt;/a&gt; onto our advisory board, published our &lt;a href=&quot;http://singinst.org/singularityfaq&quot;&gt;Singularity FAQ&lt;/a&gt;, and much more.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In the coming year, we plan to do the following:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Hold our annual &lt;a href=&quot;http://www.singularitysummit.com/&quot;&gt;Singularity Summit&lt;/a&gt;, in New York City this year.&lt;/li&gt;
&lt;li&gt;Publish three chapters in the upcoming academic volume &lt;em&gt;&lt;a href=&quot;http://singularityhypothesis.blogspot.com/&quot;&gt;The Singularity Hypothesis&lt;/a&gt;&lt;/em&gt;, along with several other papers.&lt;/li&gt;
&lt;li&gt;Improve organizational transparency by creating a simpler, easier-to-use website that includes Singularity Institute planning and policy documents.&lt;/li&gt;
&lt;li&gt;Publish a document of open research problems related to Friendly AI, to clarify the research space and encourage other researchers to contribute to our mission.&lt;/li&gt;
&lt;li&gt;Add additional skilled researchers to our &lt;a href=&quot;http://singinst.org/aboutus/researchassociates&quot;&gt;Research Associates&lt;/a&gt; program.&lt;/li&gt;
&lt;li&gt;Publish well-researched documents making the case for existential risk reduction as optimal philanthropy.&lt;/li&gt;
&lt;li&gt;Diversify our funding sources by applying for targeted grants and advertising our &lt;a href=&quot;http://singinst.org/donate/#affinity&quot;&gt;affinity credit card program&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We appreciate your support for our high-impact work. As PayPal co-founder and Singularity Institute donor Peter Thiel &lt;a href=&quot;http://singinst.org/media/interviews/peterthiel&quot;&gt;said&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&amp;#x201C;I&amp;#x2019;m interested in facilitating a forum in which there can be&amp;#x2026; substantive research on how to bring about a world in which AI will be friendly to humans rather than hostile&amp;#x2026; [The Singularity Institute represents] a combination of very talented people with the right problem space [they&amp;#x2019;re] going after&amp;#x2026; [They&amp;#x2019;ve] done a phenomenal job&amp;#x2026; on a shoestring budget. From my perspective, the key question is always: What&amp;#x2019;s the amount of leverage you get as an investor? Where can a small amount make a big difference? This is a very leveraged kind of philanthropy.&amp;#x201D;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;a href=&quot;http://singinst.org/donate&quot;&gt;Donate now&lt;/a&gt;, and seize a better than usual chance to move our work forward. Credit card transactions are securely processed through Causes.com, Google Checkout, or PayPal. If you have questions about donating, please call Amy Willey at (586) 381-1801.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/6w3/the_125000_summer_singularity_challenge/#comments"&gt;256 comments&lt;/a&gt;
</description>
</item>
<item>
<title>SIAI - An Examination</title>
<link>http://lesswrong.com/lw/5il/siai_an_examination/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/5il/siai_an_examination/</guid>
<pubDate>Mon, 02 May 2011 17:08:30 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/BrandonReinhart"&gt;BrandonReinhart&lt;/a&gt;
&amp;bull;
141 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/5il/siai_an_examination/#comments"&gt;203 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;em&gt;12/13/2011 - A 2011 update with data from the 2010 fiscal year is in progress. Should be done by the end of the week or sooner.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-size: 16px; font-weight: bold;&quot;&gt;Disclaimer&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;I am not affiliated with the &lt;a href=&quot;http://singinst.org/&quot;&gt;Singularity Institute for Artificial Intelligence&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;I have not donated to the SIAI prior to writing this.&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c3&quot;&gt;I made &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;/lw/5ec/minicamp_on_rationality_awesomeness_and/40jo&quot;&gt;this pledge&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; prior to writing this document.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Notes&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Images are now hosted on LessWrong.com.&lt;/li&gt;
&lt;li&gt;The 2010 Form 990 data will be available later this month.&lt;/li&gt;
&lt;li&gt;It is not my intent to&amp;#xA0;propagate&amp;#xA0;misinformation. Errors will be corrected as soon as they are identified.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 class=&quot;c23 c1&quot;&gt;&lt;span class=&quot;c13 c11&quot;&gt;Introduction&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Acting on &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;/user/gwern/&quot;&gt;gwern&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;'s suggestion in his &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;/r/discussion/lw/5el/are_girl_scout_cookies_deliciously_evil_a_case/&quot;&gt;Girl Scout Cookie&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; analysis, I decided to look at SIAI funding. After reading about the Visiting Fellows Program and more recently the &lt;a href=&quot;/lw/4wm/rationality_boot_camp/&quot;&gt;Rationality Boot Camp&lt;/a&gt;, I decided that the SIAI might be something I would want to support. I am concerned with &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Existential_risk&quot;&gt;existential risk&lt;/a&gt; and grapple with the utility implications. I feel that I should do more.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;I wrote on the mini-boot camp page a pledge that I would donate enough to send someone to rationality mini-boot camp. This seemed to me a small cost for the potential benefit. The SIAI might get better at building rationalists. It might build a rationalist who goes on to solve a problem. Should I donate more? I wasn&amp;#x2019;t sure. I read gwern&amp;#x2019;s article and realized that I could easily get more information to clarify my thinking.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;So I downloaded the SIAI&amp;#x2019;s Form 990 annual IRS filings and started to write down notes in a spreadsheet. As I gathered data and compared it to my expectations and my goals, my beliefs changed. I now believe that donating to the SIAI is valuable. I cannot hide this belief in my writing. I simply have it.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;My goal is not to convince you to donate to the SIAI. My goal is to provide you with information necessary for you to determine for yourself whether or not you should donate to the SIAI. Or, if not that, to provide you with some direction so that you can continue your investigation.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The SIAI's Form 990's are available at &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www2.guidestar.org/&quot;&gt;GuideStar&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; and &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://foundationcenter.org/&quot;&gt;Foundation Center&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;. You must register in order to access the files at GuideStar.&lt;/span&gt;&lt;/p&gt;
&lt;ol class=&quot;c14&quot;&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://bit.ly/dZZkc0&quot;&gt;2002&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990-EZ)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://bit.ly/eOgI0B&quot;&gt;2003&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990-EZ)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c1 c6 c12&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://bit.ly/efHrQC&quot;&gt;2004&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990-EZ)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://bit.ly/f3Ekvg&quot;&gt;2005&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://bit.ly/gRC7SS&quot;&gt;2006&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.guidestar.org/FinDocuments//2007/582/565/2007-582565917-0487362e-9.pdf&quot;&gt;2007&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.guidestar.org/FinDocuments//2008/582/565/2008-582565917-0599169c-Z.pdf&quot;&gt;2008&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990-EZ)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.guidestar.org/FinDocuments//2009/582/565/2009-582565917-06b644eb-9.pdf&quot;&gt;2009&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (Form 990)&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;h2&gt;&lt;span class=&quot;c13 c11&quot;&gt;SIAI Financial Overview&lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span class=&quot;c13 c11&quot;&gt;The &lt;a href=&quot;http://singinst.org&quot;&gt;Singularity Institute for Artificial Intelligence&lt;/a&gt; (SIAI) is a public organization working to reduce &lt;a href=&quot;http://singinst.org/riskintro/index.html&quot;&gt;existential risk from future technologies&lt;/a&gt;, in particular artificial intelligence. &quot;The Singularity Institute brings rational analysis and rational strategy to the challenges facing humanity as we develop cognitive technologies that will exceed the current upper bounds on human intelligence.&quot; The SIAI are also the founders of Less Wrong.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_0.png&quot; alt=&quot;&quot; height=&quot;211&quot; width=&quot;591&quot;&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_1.png?v=a0ef8195b3866f478702a67e2238c3bb&quot; alt=&quot;&quot; height=&quot;194&quot; width=&quot;296&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_2.png?v=27393a444aa0942ee9c2ebb93acc3a5e&quot; alt=&quot;&quot; height=&quot;194&quot; width=&quot;296&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The graphs above offer an accurate summary of SIAI financial state since 2002. Sometimes the end of year balances listed in the Form 990 doesn&amp;#x2019;t match what you&amp;#x2019;d get if you did the math by hand. These are noted as discrepancies between the filed year end balance and the expected year end balance or between the filed year start balance and the expected year start balance.&lt;/span&gt;&lt;/p&gt;
&lt;ol class=&quot;c14&quot;&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c3 c11&quot;&gt;&lt;strong&gt;Filing Error 1&lt;/strong&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; - There appears to be a minor typo to the effect of $4.86 in the end of year balance for the 2004 document. It appears that Part I, Line 18 has been summed incorrectly. $32,445.76 is listed, but the expected result is $32,450.41. The Part II balance sheet calculations which agree with the error so the source of the error is unclear. The start of year balance in 2005 reflects the expected value so this was probably just a typo in 2004. The following year&amp;#x2019;s reported start of year balance does not contain the error.&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c3 c11&quot;&gt;&lt;strong&gt;Filing Error 2&lt;/strong&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; - The 2006 document reports a year start balance of $95,105.00 when the expected year start balance is $165,284.00, a discrepancy of $70,179.00. This amount is close to the estimated Program Service Accomplishments in 2005 Form 990 Part III Line F of $72,000.00. Looks like the service expenses were not included completely in Part II. The money is not missing: future forms show expected values moving forward.&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c12 c1 c6&quot;&gt;&lt;span class=&quot;c3 c11&quot;&gt;&lt;strong&gt;Theft&lt;/strong&gt; - &lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;The organization reported $118,803.00 in theft in 2009 resulting in a year end asset balance lower than expected. &lt;/span&gt;&lt;span class=&quot;c3 c10&quot;&gt;The SIAI is currently pursuing legal restitution.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The SIAI has generated a revenue surplus every year except 2008. The 2008 deficit appears to be a cashing out of excess surplus from 2007. Asset growth indicates that the SIAI is good at utilizing the funds it has available, without overspending. The organization is expanding it&amp;#x2019;s menu of services, but not so fast that it risks going broke.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;Nonetheless, current asset balance is insufficient to sustain a year of operation at existing rate of expenditure. Significant loss of revenue from donations would result in a shrinkage of services. Such a loss of revenue may be unlikely, but a reasonable goal for the organization would be to build up a year's reserves.&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Revenue&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px;&quot;&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Revenue is composed of public support, program service (events/conferences held, etc), and investment interest. The &quot;Other&quot; category tends to include Amazon.com affiliate income, etc.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c9&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_3.png?v=99acbb583f561c10b2f7f164f073d783&quot; alt=&quot;&quot; height=&quot;259&quot; width=&quot;608&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_4.png?v=7f3586b662dd9967ceeeca02f0b05519&quot; alt=&quot;&quot; height=&quot;187&quot; align=&quot;left&quot; width=&quot;339&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Income from public support has grown steadily with a notable regular increase starting in 2006. This increase is a result of new contributions from big donors. As an example, public support in 2007 is largely composed of significant contributions from Peter Thiel ($125k), Brian Cartmell ($75k), and Robert F. Zahra Jr ($123k) for $323k total in large scale individual contributions (break down below).&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;In 2007 the SIAI started receiving income from program services. Currently all &quot;Program Service&quot; revenue is from operation of the Singularity Summit. In 2010 the summit generated surplus revenue for the SIAI. This is a significant achievement, as it means the organization has created a sustainable service that could fund further services moving forward.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;A specific analysis of the summit is below.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Expenses&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px;&quot;&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Expenses are composed of grants paid to winners, benefits paid to members, officer compensation, contracts, travel, program services, and an other category.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The contracts column in the chart below includes legal and accounting fees. The other column includes administrative fees and other operational costs. I didn&amp;#x2019;t see reason to break the columns down further. In many cases the Form 990s provide more detailed itemization. If you care about how much officers spent on gas or when they bought new computers you might find the answers in the source.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_5.png?v=d7dbe8ccc715a746bd237e4aca186d18&quot; alt=&quot;&quot; height=&quot;211&quot; width=&quot;602&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;I don&amp;#x2019;t have data for 2000 or 2001, but left the rows in the spreadsheet in case it can be filled in later.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_6.png?v=40dc2b53beef6b4629617169a5c9fac1&quot; alt=&quot;&quot; height=&quot;202&quot; align=&quot;left&quot; width=&quot;350&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Program expenses have grown over the years, but not unreasonably. Indeed, officer compensation has declined steadily for several years. The grants in 2002, 2003, and 2004 were paid to Eliezer Yudkowsky for work relevant to Artificial Intelligence.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The program expenses category includes operating the Singularity Summit, Visiting Fellows Program, etc. Some of the cost of these programs is also included in the other category. For example, the 2007 Singularity Summit is reported as costing $101,577.00 but this total amount is accounted for in multiple sections.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;It appears that 2009 was a more productive year than 2008 and also less expensive. 2009 saw a larger Singularity Summit than in 2008 and also the creation of the Visiting Fellows Program.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Big Donors&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px;&quot;&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_7.png?v=3b5b3af66c23a90359308a68b369f487&quot; alt=&quot;&quot; height=&quot;336&quot; width=&quot;592&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;This is not an exhaustive list of contributions. The SIAI&amp;#x2019;s 2009 filing details major support donations for several previous years. Contributions in the 2010 column are derived from &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/donors&quot;&gt;http://singinst.org/donors&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;. &amp;#xA0;Known contributions of less than $5,000 are excluded for the sake of brevity. The 2006 donation from Peter Thiel is sourced from a discussion with the SIAI.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Peter Thiel and several other big donors compose the bulk of the organization's revenue. It would be good to see a broader base of donations moving forward. Note, however, that the base of donations has been improving. I don't have the 2010 Form 990 yet, but it appears to be the best year yet in terms of both the quantity of donations and the number of individual donors (based on conversation with SIAI members).&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Officer Compensation&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_8.png?v=f9fe89c72d0fea7ec1a58f7035d93a9d&quot; alt=&quot;&quot; height=&quot;147&quot; width=&quot;594&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_9.png?v=717b8a6e09d407d0a1e00a283657a54f&quot; alt=&quot;&quot; height=&quot;218&quot; align=&quot;left&quot; width=&quot;333&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;In 2002 to 2005 Eliezer Yudkowsky received compensation in the form of grants from the SIAI for AI research. &lt;/span&gt;&lt;em&gt;&lt;span class=&quot;c3 c10&quot;&gt;It is noted in the Form 990s that &lt;/span&gt;&lt;span class=&quot;c3 c10 c11&quot;&gt;&lt;strong&gt;no public funds &lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class=&quot;c3 c10&quot;&gt;&lt;em&gt;were used for Eliezer&amp;#x2019;s research grants as he is also an officer.&lt;/em&gt; &lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;Starting in 2006 all compensation for key officers is reported as salaried instead of in the form of grants.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Compensation spiked in 2006, the same year of greatly increased public support. Nonetheless, officer compensation has decreased steadily despite continued increases in public support. It appears that the SIAI has been managing it&amp;#x2019;s resources carefully in recent years, putting more money into programs than into officer compensation.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Eliezer's base compensation as salary increased 20% in 2008. It seems reasonable to compare Eliezer's salary with that of professional software developers. Eliezer would be able to make a fair amount more working in private industry as a software developer.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Mr. Yudkowsky clarifies: &quot;The reason my salary shows as $95K in 2009 is that Paychex screwed up and paid my first month of salary for 2010 in the 2009 tax year. My actual salary was, I believe, constant or roughly so through 2008-2010.&quot; In this case we would expect to see the 2010 Form 990 show a month reduced salary.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Moving forward, the SIAI will have to grapple with the high cost of recruiting top tier programmers and academics to do real work. I believe this is an argument for the SIAI improving its asset sheet. More money in the bank means more of an ability to take advantage of recruitment opportunities if they present themselves.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Singularity Summit&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1 c6&quot;&gt;&lt;span style=&quot;font-size: 15px;&quot;&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Founded in 2006 by the SIAI in cooperation with Ray Kurzweil and Peter Thiel, the Singularity Summit focuses on a broad number of topics related to the Singularity and emerging technologies. (1)&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c9&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_10.png?v=b0861f92f729c043979a7d8d7b9cc1f4&quot; alt=&quot;&quot; height=&quot;133&quot; width=&quot;577&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The Singularity Summit was free until 2008 when the SIAI chose to begin charging registration fees and accepting sponsorships. (2)&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c9&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_5il_11.png?v=b77e680872854187f7fb9985aec738f3&quot; alt=&quot;&quot; align=&quot;left&quot;&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Attendee counts are estimates drawn from SIAI Form 990 filings. 2010 is purported to be the largest conference so far. Beyond the core conference attendees, hundreds of thousands of online viewers are reached through recordings of the Summit sessions. (A)&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The cost of running the summit has increased annually, but revenue from sponsorships and registration have kept pace. The conference may have logistic and administrative costs, but it doesn't really impact the SIAI budget. This makes the conference a valuable blend of outreach and education. If the conference convinces someone to donate or in some way directly support work against existential risks, the benefits are effectively free (or at the very least come at no cost to other programs).&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span style=&quot;font-size: 14px; font-weight: bold;&quot;&gt;Is the Singularity Summit successful?&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;It&amp;#x2019;s difficult to evaluate the success of conferences. So many of the benefits are realized downstream of the actual event. Nonetheless, the attendee counts and widening exposure seem to bring immense value for the cost. Several factors contribute to a sense that the conference is a success:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In 2010 the Summit became a positive revenue generating exercise in its own right. With careful stewardship, the Singularity Summit could grow to generate a reliable annual revenue for the SIAI.&lt;/li&gt;
&lt;li&gt;The ability to run an efficient conference is itself valuable. Should it choose to, the SIAI could run other types of conferences or special interest events in the future with a good expectation of success.&lt;/li&gt;
&lt;li&gt;The high visibility of the Summit plants seeds for future fund raising. Conference attendees likely benefit as much or more from networking as they do from the content of the sessions. Networking builds relationships between people able to coordinate to solve problems or fund solutions to problems.&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c3&quot;&gt;The Singularity Summit has generated ongoing public interest and media coverage. Notable articles can be found in&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.popsci.com/scitech/article/2009-10/singularity-summit-2009-singularity-near&quot;&gt; &lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.popsci.com/scitech/article/2009-10/singularity-summit-2009-singularity-near&quot;&gt;Popular Science&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (3),&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.popularmechanics.com/technology/engineering/robots/4332783&quot;&gt; &lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.popularmechanics.com/technology/engineering/robots/4332783&quot;&gt;Popular Mechanics&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (4), the&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.guardian.co.uk/technology/2008/nov/06/artificialintelligenceai-engineering&quot;&gt; Guardian&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (5), and&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&quot;&gt; &lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&quot;&gt;TIME Magazine&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; (6). Quality media coverage raises public awareness of Singularity related topics. There is a strong argument that a person with an interest in futurist or existential risk consciousness raising reaches a wide audience by supporting the Singularity Summit.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;When discussing &amp;#x201C;future shock levels&amp;#x201D; -- gaps in exposure to and understanding of futurist concepts -- Eliezer Yudkowsky wrote, &amp;#x201C;In general, one shock level gets you enthusiasm, two gets you a strong reaction - wild enthusiasm or disbelief, three gets you frightened - not necessarily hostile, but frightened, and four can get you burned at the stake.&amp;#x201D; (7) Most futurists are familiar with this sentiment. Increased public exposure to unfamiliar concepts through the positive media coverage brought about by the Singularity Summit works to improve the legitimacy of those concepts and reduce future shock.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1 c21&quot;&gt;The result is that hard problems get easier to solve. Experts interested in helping, but afraid of social condemnation, will be more likely to do core research. The curious will be further motivated to break problems down. Vague far-mode thinking about future technologies will, for a few, shift into near-mode thinking about solutions. Public reaction to what would otherwise be shocking concepts will shift away from the extreme. The future becomes more conditioned to accept the real work and real costs of battling existential risk.&lt;/p&gt;
&lt;ul&gt;
&lt;/ul&gt;
&lt;h2&gt;&lt;span class=&quot;c13 c11&quot;&gt;SIAI Milestones&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1&quot;&gt;This is not a complete list of SIAI milestones, but covers quite a few of the materials and events that the SIAI has produced over the years.&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;2005&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span class=&quot;c3&quot;&gt; &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span class=&quot;c3&quot;&gt;RF Eliezer Yudkowsky publishes &amp;#x201C;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://yudkowsky.net/rational/technical&quot;&gt;A Technical Explanation of Technical Explanation&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&amp;#x201D;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c3&quot;&gt;RF Eliezer Yudkowsky writes chapters for &amp;#x201C;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.amazon.com/Global-Catastrophic-Risks-Nick-Bostrom/dp/0198570503/ref=sr_1_1?ie=UTF8&amp;amp;s=books&amp;amp;qid=1304291108&amp;amp;sr=8-1&quot;&gt;Global Catastrophic Risks&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&amp;#x201D;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;AI and existential risk presentations at Stanford, Immortality Institute&amp;#x2019;s Life Extension Conference, and the Terasem Foundation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;2006&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Fundraising efforts expand significantly.&lt;/li&gt;
&lt;li&gt;SIAI hosts the first Singularity Summit at Stanford.&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;2007&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;SIAI hosts the Singularity Summit in San Francisco.&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/blog/&quot;&gt;SIAI outreach blog&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; is started.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/media/interviews&quot;&gt;SIAI Interview Series&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; is started.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.youtube.com/watch?v=0A9pGhwQbS0&quot;&gt;SIAI introductory video&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; is developed and released.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;2008&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;SIAI hosts the Singularity Summit in San Jose.&lt;/li&gt;
&lt;li&gt;SIAI Interview Series is expanded.&lt;/li&gt;
&lt;li&gt;SIAI begins its summer intern program.&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;2009&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;&lt;strong&gt;&lt;span style=&quot;font-weight: normal;&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Significant detail on 2009 achievements is&amp;#xA0;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/achievements&quot;&gt;available here&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;. More publications are &lt;a href=&quot;http://singinst.org/research/publications&quot;&gt;available here&lt;/a&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;RF Eliezer Yudkowsky completes the rationality&amp;#xA0;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/Sequences&quot;&gt;sequences&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/&quot;&gt;Less Wrong&lt;/a&gt; is founded.&lt;/li&gt;
&lt;li&gt;SIAI hosts the Singularity Summit in New York.&lt;/li&gt;
&lt;li&gt;RF Anna Salamon speaks on technological forecasting at the Santa Fe institute.&lt;/li&gt;
&lt;li&gt;SIAI establishes the&lt;a href=&quot;http://singinst.org/aboutus/visitingfellows&quot;&gt; Visiting Fellows Program&lt;/a&gt;. Graduate and under-graduate students within AI related disciplines develop related talks and papers.&lt;/li&gt;
&lt;/ul&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Papers and talks from SIAI fellows produced in 2009:&lt;/span&gt;&lt;/p&gt;
&lt;ol class=&quot;c14&quot;&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://singinst.org/theuncertainfuture.html&quot;&gt;&amp;#x201C;Changing the frame of AI futurism: From storytelling to heavy-tailed, high-dimensional probability distributions&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;, by Steve Rayhawk, Anna Salamon, Tom McCabe, Rolf Nelson, and Michael Anissimov. (Presented at the European Conference of Computing and Philosophy in July &amp;#x2018;09 (ECAP))&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201C;Arms Control and Intelligence Explosions&amp;#x201D;, by Carl Shulman (Also presented at ECAP)&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c1 c16&quot;&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201C;&lt;/span&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://bentham.k2.t.u-tokyo.ac.jp/ap-cap09/openconf/data/papers/28-2.pdf&quot;&gt;Machine Ethics and Superintelligence&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201D;, by Carl Shulman and Henrik Jonsson (Presented at the Asia-Pacific Conference of Computing and Philosophy in October &amp;#x2018;09 (APCAP))&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201C;&lt;/span&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://bentham.k2.t.u-tokyo.ac.jp/ap-cap09/openconf/data/papers/33.pdf&quot;&gt;Which Consequentialism? Machine Ethics and Moral Divergence&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201D;, by Carl Shulman and Nick Tarleton (Also presented at APCAP);&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c7&quot;&gt;&amp;#x201C;Long-term AI forecasting: Building methodologies that work&amp;#x201D;, an invited presentation by Anna Salamon at the Santa Fe Institute conference on forecasting;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://www.vimeo.com/7318055&quot;&gt;&amp;#x201C;Shaping the Intelligence Explosion&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt; and &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.vimeo.com/7397629&quot;&gt;&amp;#x201C;How much it matters to know what matters: A back of the envelope calculation&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;, presentations by Anna Salamon at the Singularity Summit 2009 in October&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://www.vimeo.com/7320152&quot;&gt;&amp;#x201C;Pathways to Beneficial Artificial General Intelligence: Virtual Pets, Robot Children, Artificial Bioscientists, and Beyond&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;, a presentation by SIAI Director of Research Ben Goertzel at Singularity Summit 2009;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://www.vimeo.com/7426357&quot;&gt;&amp;#x201C;Cognitive Biases and Giant Risks&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;, &amp;#xA0;a presentation by SIAI Research Fellow Eliezer Yudkowsky at Singularity Summit 2009;&lt;/span&gt;&lt;/li&gt;
&lt;li class=&quot;c16 c1&quot;&gt;&lt;span class=&quot;c8&quot;&gt;&lt;a href=&quot;http://arxiv.org/abs/0907.5598&quot;&gt;&amp;#x201C;Convergence of Expected Utility for Universal Artificial Intelligence&amp;#x201D;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c7&quot;&gt;, a paper by Peter de Blanc, an SIAI Visiting Fellow.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;* Text for this list of papers reproduced from &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/achievements&quot;&gt;here&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;A list of achievements, papers, and talks from 2010 is pending. See also the Singularity Summit content links above.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span style=&quot;font-size: 15px; font-weight: bold;&quot;&gt;Further Editorial Thoughts...&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Prior to doing this investigation I had some expectation that the SIAI was a money losing operation. I didn&amp;#x2019;t expect the Singularity Summit to be making money. I had an expectation that Eliezer probably made around $70k (programmer money discounted for being paid by a non-profit). I figured the SIAI had a broad donor base of small donations. I was off base on all counts.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot; class=&quot;c1&quot;&gt;&lt;span class=&quot;c3 c10&quot;&gt;&lt;em&gt;I had some expectation that the SIAI was a money losing operation.&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;I had weak confidence in this belief, as I don&amp;#x2019;t know a lot about the finances of public organizations. The SIAI appears to be managing its cash reserves well. It would be good to see the SIAI build up some asset reserves so that it could operate comfortably in years where public support dips or so that it could take advantage of unexpected opportunities.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Overall, the allocation of funds strikes me as highly efficient.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot; class=&quot;c1&quot;&gt;&lt;span class=&quot;c3 c10&quot;&gt;&lt;em&gt;I didn&amp;#x2019;t expect the Singularity Summit to be making money.&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;This was a surprising finding, although I incorrectly conditioned my expectation from experiences working with game industry conferences. I don't know exactly how much the SIAI is spending on food and fancy tablecloths at the Singularity Summit, but I don't think I care: it's growing and showing better results on the revenue chart each year. If you attend the conference and contribute to the event you add pure value. As discussed above, the benefits of the conference appear to be very far in the &amp;#x201C;reducing existential risk&amp;#x201D; category. Losing the Summit would be a blow to ensuring a safe future.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;I know that the Summit will not itself do the hard work of dissolving and solving problems, or of synthesizing new theories, or of testing those theories, or of implementing solutions. The value of the Summit lies in its ability to raise awareness of the work that needs to be done, to create networks of people to do that work, to lower public shock at the implications of that work, and generate funding for those doing that work.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot; class=&quot;c1&quot;&gt;&lt;span class=&quot;c3 c10&quot;&gt;&lt;em&gt;I had an expectation that Eliezer probably made around $70k.&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Eliezer's compensation is slightly more than I thought. I'm not sure what upper bound I would have balked at or would balk at. I do have some concern about the cost of recruiting additional Research Fellows. The cost of additional RFs has to be weighed against new programs like Visiting Fellows.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;At the same time, the organization has been able to expand services without draining the coffers. A donor can hold a strong expectation that the bulk of their donation will go toward actual work in the form of salaries for working personnel or events like the Visiting Fellows Program.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot; class=&quot;c1&quot;&gt;&lt;span class=&quot;c3 c10&quot;&gt;&lt;em&gt;I figured the SIAI had a broad donor base of small donations.&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;I must have been out to lunch when making this prediction. I figured the SIAI was mostly supported by futurism enthusiasts and small scale rationalists.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The organization has a heavy reliance on major donor support. I would expect the 2010 filing to reveal a broadening of revenue, but I do not expect the organization to have become independent of big donor support. Big donor support is a good thing to have, but more long term stability would be provided by a broader base of supporters.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span class=&quot;c3 c11&quot;&gt;My suggestions to the SIAI:&lt;/span&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Consider relocating to a cheaper part of the planet. Research Fellows will likely have to accept lower than market average compensation for their work or no compensation at all. Better to live in an area where compensation goes farther.&lt;/li&gt;
&lt;li&gt;Consider increasing savings to allow for a larger safety net and the ability to take advantage of opportunities.&lt;/li&gt;
&lt;li&gt;Consider itemizing program service expenses in more detail. It isn&amp;#x2019;t required, but the transparency makes for better decision making on the part of donors.&lt;/li&gt;
&lt;li&gt;Consider providing more information on what Eliezer and other Research Fellows are working on from time to time. You are building two communities. A community of polymaths who will solve hard problems and a community of supporters who believe in the efforts of the polymaths. The latter are more likely to continue their support if they have insight into the activities of the former.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;&lt;span class=&quot;c3 c11&quot;&gt;Moving forward:&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;John Salvatier provided me with good insight into next steps for gaining further clarity into the SIAI&amp;#x2019;s operational goals, methodology, and financial standing.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Contact GiveWell for expert advice on organizational analysis to help clarify good next steps.&lt;/li&gt;
&lt;li&gt;&lt;span class=&quot;c3&quot;&gt;Get more information on current and forthcoming SIAI research projects. Is there active work in the &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://singinst.org/research/researchareas&quot;&gt;research areas&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; the SIAI has identified? Is there a game plan for attacking particular problems in the research space?&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;Spend some time gathering information from SIAI members on how they would utilize new funds. Are there specific opportunities the SIAI has identified? Where is the organization &amp;#x201C;only limited by a lack of cash&amp;#x201D; -- if they had more funds, what would they immediately pursue?&lt;/li&gt;
&lt;li&gt;Formulate methods of validating the SIAI&amp;#x2019;s execution of goals. It appears that the Summit is an example of efficient execution of the reducing existential risk goal by legitimizing the existential risk and AGI problem space and by building networks among interested individuals. How will donors verify the value of SIAI core research work in coming years?&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;&lt;span class=&quot;c3 c11&quot;&gt;Conclusion&lt;/span&gt;&lt;/h2&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;At present, the financial position of the SIAI seems sound. The Singularity Summit stands as a particular success that should be acknowledged. The ability for the organization to reduce officer compensation at the same time it expands programs is also notable.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Tax documents can only tell us so much. A deeper picture of the SIAI would work to reveal more of the moving parts within the organization. It would provide a better account of monthly activities and provide a means to measure future success or failure. The question for many supporters will not be &amp;#x201C;should I donate&amp;#x201D; but &amp;#x201C;should I continue to donate?&amp;#x201D; A question that can be answered by increased and ongoing transparency.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;It is important that those who are concerned with existential risks, AGI, and the safety of future technologies and who choose to donate to the SIAI take a role in shaping a positive future for the organization. Donating in support of AI research is valuable, but donating and also telling others about the donation is far more valuable.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Consider the Sequence post &lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;/lw/3h/why_our_kind_cant_cooperate/&quot;&gt;&amp;#x2018;Why Our Kind Can&amp;#x2019;t Cooperate.&amp;#x2019;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt; If the SIAI is an organization worth supporting, and given that they are working in a problem space that currently only has strong traction with &amp;#x201C;our kind,&amp;#x201D; then there is a risk of the SIAI failing to reach its maximum potential because donors do not coordinate successfully. If you are a donor, stand up and be counted. Post on Less Wrong and describe why you donated. Let the SIAI post your name. Help other donors see that they aren&amp;#x2019;t acting alone.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;Similarly, if you are critical of the SIAI think about why and write it up. Create a discussion and dig into the details. The path most likely to increase existential risk is the one where rational thinkers stay silent.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;The SIAI&amp;#x2019;s current operating budget and donor revenue is very small. It is well within our community&amp;#x2019;s ability to effect change.&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;My research has led me to the conclusion I should donate to the SIAI (above my previous pledge in support of rationality boot camp).&amp;#xA0;I already donate to Alcor and am an Alcor member. I have to determine an amount for the SIAI that won't cause wife aggro. Unilateral household financial decisions increase my personal existential risk. :P I will update this document or make a comment post when I know more.&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;References:&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;&lt;a href=&quot;https://spreadsheets.google.com/ccc?key=0Av5A0Qa--iQVdGFrd1JEWks2aWh2eXJfTTJpOFBiYWc&amp;amp;hl=en&quot;&gt;My working spreadsheet is here.&lt;/a&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(1)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.singularitysummit.com/&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.singularitysummit.com/&quot;&gt;http://www.singularitysummit.com/&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(2)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;/lw/ts/singularity_summit_2008/&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;/lw/ts/singularity_summit_2008/&quot;&gt;http://lesswrong.com/lw/ts/singularity_summit_2008/&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(3)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.popsci.com/scitech/article/2009-10/singularity-summit-2009-singularity-near&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.popsci.com/scitech/article/2009-10/singularity-summit-2009-singularity-near&quot;&gt;http://www.popsci.com/scitech/article/2009-10/singularity-summit-2009-singularity-near&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(4)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.popularmechanics.com/technology/engineering/robots/4332783&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.popularmechanics.com/technology/engineering/robots/4332783&quot;&gt;http://www.popularmechanics.com/technology/engineering/robots/4332783&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(5)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.guardian.co.uk/technology/2008/nov/06/artificialintelligenceai-engineering&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.guardian.co.uk/technology/2008/nov/06/artificialintelligenceai-engineering&quot;&gt;http://www.guardian.co.uk/technology/2008/nov/06/artificialintelligenceai-engineering&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(6)&lt;/span&gt;&lt;span class=&quot;c3&quot;&gt;&lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&quot;&gt;&amp;#xA0;&lt;/a&gt;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&quot;&gt;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(7)&amp;#xA0;&lt;/span&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.sl4.org/shocklevels.html&quot;&gt;http://www.sl4.org/shocklevels.html&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c0&quot;&gt;&lt;span class=&quot;c5&quot;&gt;&lt;a href=&quot;http://www.time.com/time/health/article/0,8599,2048138-1,00.html&quot;&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p class=&quot;c1&quot;&gt;&lt;span class=&quot;c3&quot;&gt;(A) Summit Content&lt;/span&gt;&lt;/p&gt;
&lt;ul style=&quot;margin-top: 10px; margin-right: 2em; margin-bottom: 10px; margin-left: 2em; list-style-type: disc; list-style-position: outside; list-style-image: initial;&quot;&gt;
&lt;li&gt;2006 - &lt;a href=&quot;http://singinst.org/media/singularitysummit2006&quot;&gt;http://singinst.org/media/singularitysummit2006&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;2007 - &lt;a href=&quot;http://singinst.org/media/singularitysummit2007&quot;&gt;http://singinst.org/media/singularitysummit2007&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;2008 - &lt;a href=&quot;http://singinst.org/media/singularitysummit2008&quot;&gt;http://singinst.org/media/singularitysummit2008&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;2009 - &lt;a href=&quot;http://www.vimeo.com/siai/videos/sort:oldest&quot;&gt;http://www.vimeo.com/siai/videos/sort:oldest&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;2010 - &lt;a href=&quot;http://www.vimeo.com/album/1519377&quot;&gt;http://www.vimeo.com/album/1519377&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/5il/siai_an_examination/#comments"&gt;203 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Rationality Boot Camp</title>
<link>http://lesswrong.com/lw/4wm/rationality_boot_camp/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/4wm/rationality_boot_camp/</guid>
<pubDate>Tue, 22 Mar 2011 19:37:46 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Jasen"&gt;Jasen&lt;/a&gt;
&amp;bull;
72 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/4wm/rationality_boot_camp/#comments"&gt;197 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;It&amp;#x2019;s been over a year since the Singularity Institute launched our ongoing Visiting Fellows Program and we&amp;#x2019;ve learned a lot in the process of running it.&amp;#xA0;&amp;#xA0;This&amp;#xA0;summer we&amp;#x2019;re going to try something different.&amp;#xA0; We&amp;#x2019;re going to run Rationality Boot Camp.&lt;/p&gt;
&lt;p&gt;We are going to try to take ten weeks and fill them with&amp;#xA0;activities&amp;#xA0;meant to teach mental skills - if there's reading to be done, we'll tell you to get it done in advance.&amp;#xA0; We aren't just aiming to teach skills like betting at the right odds or learning how to take into account others' information, we're going to practice techniques like mindfulness meditation and Rejection Therapy (making requests that you know will be rejected), in order to teach focus, non-attachment, social courage and all the other things that are&amp;#xA0;also&amp;#xA0;needed to produce formidable rationalists.&amp;#xA0; Participants will learn how to draw (so that they can learn how to pay attention to previously unnoticed details, and see that they can do things that previously seemed like mysterious superpowers).&amp;#xA0; We will play games, and switch games every few days, to get used to novelty and practice learning.&lt;/p&gt;
&lt;p&gt;We're going to run A/B tests on you, and track the results to find out which training activities work best, and begin the tradition of evidence-based rationality training.&lt;/p&gt;
&lt;p&gt;In short, we're going to start constructing the kind of program that universities would run if they&amp;#xA0;actually&amp;#xA0;wanted to teach you how to think. &lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;And then at the end, some of us are going to go to Burning Man for training in desert survival and living in an emotionally positive community.&lt;/p&gt;
&lt;p&gt;When I call the program Rationality Boot Camp, I mean this quite literally.&amp;#xA0; Six days per week, participants will rise, meditate, prepare and eat food, attend lectures, participate in group and individual activities and exercise together. &amp;#xA0;&lt;/p&gt;
&lt;p&gt;Everyone who applies needs to have read at least&amp;#xA0;some&amp;#xA0;of the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Sequences&quot; target=&quot;_blank&quot;&gt;Sequences&lt;/a&gt;, and may be assigned particular posts as makeup material - in which case you will need to read them&amp;#xA0;before&amp;#xA0;you arrive and you may be turned away otherwise.&amp;#xA0; Apart from that, we'll look for a mix of people who've demonstrated high productivity and who already seem like good epistemic rationalists.&amp;#xA0; The program will begin in the first week of June and continue through mid-August.&amp;#xA0; We will cover room, board and airfare.&amp;#xA0;&amp;#xA0;We're going to try to take this up to the next level of awesome.&amp;#xA0; It's our first time trying something this ambitious and there&amp;#xA0;will&amp;#xA0;be speedbumps - and if that sounds very scary, consider applying next year when we'll know more for certain about how to teach people courage and the art of overcoming setbacks.&amp;#xA0; But if you're the sort of person who wants to be part of this program&amp;#xA0;today, instead of putting it off into the indefinite future of maybe-never - or if you know that's the sort of person you&amp;#xA0;want&amp;#xA0;to be, and you're willing to put in the effort to reach up to that - then&amp;#xA0;&lt;a href=&quot;https://singularityinstitute.wufoo.com/forms/rationality-boot-camp-application/&quot; target=&quot;_blank&quot;&gt;send in an application&lt;/a&gt;.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Edit:&lt;/p&gt;
&lt;p&gt;
&lt;/p&gt;&lt;p&gt;Attention: &amp;#xA0;Anyone still interested in attending the course must get their application in by midnight on Friday the 8th of April. &amp;#xA0;I would like to make the final decision about who to accept by mid April and need to finish interviewing applicants before then. &amp;#xA0; &amp;#xA0; &amp;#xA0; &amp;#xA0;&lt;/p&gt;
&lt;div&gt;I expect to accept between 10 and 15 people this summer.&lt;/div&gt;
&lt;p&gt;I expect to make all decisions about acceptance before the end of April and will try to do so sooner. &amp;#xA0;I will start scheduling skype interviews in a few days and will &lt;em&gt;not&lt;/em&gt;&amp;#xA0;wait until an arbitrary date before accepting people. &amp;#xA0;Apply as soon as possible to maximize your chance of being considered for this summer!&lt;br&gt;Don't worry if you're not chosen this time. &amp;#xA0;This program is an experiment, and if all goes well we will be holding holding several (even better!) programs like it each year. &amp;#xA0;If we never hold it again, you probably didn't miss much. &amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/4wm/rationality_boot_camp/#comments"&gt;197 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Tallinn-Evans $125,000 Singularity Challenge</title>
<link>http://lesswrong.com/lw/3gy/tallinnevans_125000_singularity_challenge/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/3gy/tallinnevans_125000_singularity_challenge/</guid>
<pubDate>Sun, 26 Dec 2010 22:21:22 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Kaj_Sotala"&gt;Kaj_Sotala&lt;/a&gt;
&amp;bull;
27 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/3gy/tallinnevans_125000_singularity_challenge/#comments"&gt;369 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Michael Anissimov posted the following &lt;a href=&quot;http://singinst.org/blog/2010/12/21/announcing-the-tallinn-evans-125000-singularity-holiday-challenge/&quot;&gt;on the SIAI blog&lt;/a&gt;:&lt;/p&gt;
&lt;p&gt;Thanks to the generosity of two major donors; Jaan Tallinn, a founder of Skype and Ambient Sound Investments, and Edwin Evans, CEO of the mobile applications startup Quinly, &lt;strong&gt;every contribution to the Singularity Institute up until January 20, 2011 will be &lt;a href=&quot;http://singinst.org/donate&quot;&gt;matched dollar-for-dollar&lt;/a&gt;, up to a total of $125,000.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Interested in optimal philanthropy &amp;#x2014; that is, maximizing the future expected benefit to humanity per charitable dollar spent? The technological creation of greater-than-human intelligence has the potential to unleash an &amp;#x201C;intelligence explosion&amp;#x201D; as intelligent systems design still more sophisticated successors. This dynamic could transform our world as greatly as the advent of human intelligence has already transformed the Earth, for better or for worse. Thinking rationally about these prospects and working to encourage a favorable outcome offers an extraordinary chance to make a difference. The Singularity Institute exists to do so through its research, the Singularity Summit, and public education.&lt;/p&gt;
&lt;p&gt;We support both direct engagements with the issues as well as the improvements in methodology and rationality needed to make better progress. Through our &lt;a href=&quot;http://singinst.org/aboutus/visitingfellows&quot;&gt;Visiting Fellows program&lt;/a&gt;, researchers from undergrads to Ph.Ds pursue questions on the foundations of Artificial Intelligence and related topics in two-to-three month stints. Our &lt;a href=&quot;http://singinst.org/research/residentfaculty&quot;&gt;Resident Faculty&lt;/a&gt;, up to four researchers from three last year, pursues long-term projects, including AI research, a literature review, and a book on rationality, the first draft of which was just completed. Singularity Institute researchers and representatives gave over a dozen presentations at half a dozen conferences in 2010. Our Singularity Summit conference in San Francisco was a great success, bringing together over 600 attendees and 22 top scientists and other speakers to explore cutting-edge issues in technology and science.&lt;/p&gt;
&lt;p&gt;We are pleased to receive donation matching support this year from Edwin Evans of the United States, a long-time Singularity Institute donor, and Jaan Tallinn of Estonia, a more recent donor and supporter. Jaan recently gave a &lt;a href=&quot;http://aaltoes.com/2010/10/jaan-tallinn-from-soviets-to-singularity/&quot;&gt;talk on the Singularity and his life&lt;/a&gt; at a entrepreneurial group in Finland. Here&amp;#x2019;s what Jaan has to say about us:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&amp;#x201C;We became the dominant species on this planet by being the most intelligent species around. This century we are going to cede that crown to machines. After we do that, it will be them steering history rather than us. Since we have only one shot at getting the transition right, the importance of SIAI&amp;#x2019;s work cannot be overestimated. Not finding any organisation to take up this challenge as seriously as SIAI on my side of the planet, I conclude that it&amp;#x2019;s worth following them across 10 time zones.&amp;#x201D;&lt;/em&gt;&lt;br&gt; &amp;#x2013; Jaan Tallinn, Singularity Institute donor&lt;/p&gt;
&lt;p&gt;Make a lasting impact on the long-term future of humanity today &amp;#x2014; make a &lt;a href=&quot;http://singinst.org/donate&quot;&gt;donation to the Singularity Institute&lt;/a&gt; and help us reach our $125,000 goal. For more detailed information on our projects and work, contact us at &lt;a href=&quot;mailto:institute@singinst.org&quot;&gt;institute@singinst.org&lt;/a&gt; or read our new &lt;a href=&quot;http://singinst.org/riskintro/index.html&quot;&gt;organizational overview.&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;-----&lt;/p&gt;
&lt;p&gt;Kaj's commentary: if you haven't done so recently, do check out the &lt;a href=&quot;http://singinst.org/research/publications&quot;&gt;SIAI publications page&lt;/a&gt;. There are several new papers and presentations, out of which I thought that Carl Shulman's &lt;a href=&quot;http://singinst.org/upload/WBE-superorganisms.pdf&quot;&gt;Whole Brain Emulations and the Evolution of Superorganisms&lt;/a&gt; made for particularly fascinating (and scary) reading. SIAI's finally starting to get its paper-writing machinery into gear, so let's give them money to make that possible. There's also a &lt;a href=&quot;http://singinst.org/tallinn-evans_challenge&quot;&gt;static page&lt;/a&gt; about this challenge; if you're on Facebook, please take the time to &quot;like&quot; it there.&lt;/p&gt;
&lt;p&gt;(Full disclosure: I was an SIAI Visiting Fellow in April-July 2010.)&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/3gy/tallinnevans_125000_singularity_challenge/#comments"&gt;369 comments&lt;/a&gt;
</description>
</item>
<item>
<title>The danger of living a story - Singularity Tropes</title>
<link>http://lesswrong.com/lw/337/the_danger_of_living_a_story_singularity_tropes/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/337/the_danger_of_living_a_story_singularity_tropes/</guid>
<pubDate>Mon, 15 Nov 2010 09:39:06 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/patrissimo"&gt;patrissimo&lt;/a&gt;
&amp;bull;
24 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/337/the_danger_of_living_a_story_singularity_tropes/#comments"&gt;61 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;The following should sound familiar:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;A thoughtful and observant young &lt;a href=&quot;http://yudkowsky.net/&quot;&gt;protagonist&lt;/a&gt; dedicates their life to fighting &lt;a href=&quot;http://wiki.lesswrong.com/wiki/UnFriendly_artificial_intelligence&quot;&gt;a great world-threatening evil&lt;/a&gt; &lt;a href=&quot;/lw/2l8/existential_risk_and_public_relations/&quot;&gt;unrecognized&lt;/a&gt; by almost all of their short-sighted elders (except perhaps for one encouraging mentor), gathering &lt;a href=&quot;http://tvtropes.org/pmwiki/pmwiki.php/Main/RagtagBunchOfMisfits&quot;&gt;a rag-tag band&lt;/a&gt; of &lt;a href=&quot;http://www.acceleratingfuture.com/michael/blog/2006/08/visiting-siai/&quot;&gt;colorful misfits&lt;/a&gt; along the way and forging them into a &lt;a href=&quot;http://singinst.org/aboutus/team&quot;&gt;team&lt;/a&gt; by accepting their idiosyncrasies and making the most of their unique abilities, winning over previously neutral allies, ignoring &lt;a href=&quot;http://multiverseaccordingtoben.blogspot.com/2010/10/singularity-institutes-scary-idea-and.html&quot;&gt;those who just don't get it&lt;/a&gt;, obtaining or creating &lt;a href=&quot;http://en.wikipedia.org/wiki/Friendly_artificial_intelligence&quot;&gt;artifacts of great power&lt;/a&gt;, growing and &lt;a href=&quot;http://tvtropes.org/pmwiki/pmwiki.php/Main/ComingOfAgeStory&quot;&gt;changing along the way&lt;/a&gt; to &lt;a href=&quot;http://en.wikipedia.org/wiki/Intelligence_amplification&quot;&gt;become more powerful&lt;/a&gt;, fulfilling the potential seen by their &lt;a href=&quot;http://tvtropes.org/pmwiki/pmwiki.php/Main/Mentors&quot;&gt;mentors&lt;/a&gt;/&lt;a href=&quot;http://en.wikipedia.org/wiki/Peter_Thiel&quot;&gt;supporters&lt;/a&gt;/early adopters, while becoming more human (greater empathy, connection, humility) as they collect resources to prepare for their &lt;a href=&quot;http://tvtropes.org/pmwiki/pmwiki.php/Main/FinalBattle&quot;&gt;climactic battle&lt;/a&gt; &lt;a href=&quot;http://tvtropes.org/pmwiki/pmwiki.php/Main/RobotWar&quot;&gt;against the inhuman enemy&lt;/a&gt;.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Hmm, sounds a bit like &lt;a href=&quot;/singinst.org&quot;&gt;SIAI&lt;/a&gt;! &amp;#xA0;(And while I'm throwing stones, let me make it clear that I live in a glass house, since the same story could just as easily be adapted to &lt;a href=&quot;/seasteading.org&quot;&gt;TSI&lt;/a&gt;, my organization, as well as many others)&lt;/p&gt;
&lt;p&gt;This story is related to Robin's&amp;#xA0;&lt;a href=&quot;http://www.overcomingbias.com/2008/11/abstractdistant.html&quot;&gt;Abstract/Distant Future Bias&lt;/a&gt;:&amp;#xA0;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Regarding distant futures, however, we&amp;#x2019;ll be too confident, focus too much on unlikely global events, rely too much on trends, theories, and loose abstractions, while neglecting details and variation.&amp;#xA0; We&amp;#x2019;ll assume the main events take place far away (e.g., space), and uniformly across large regions.&amp;#xA0; We&amp;#x2019;ll focus on untrustworthy consistently-behaving globally-organized social-others.&amp;#xA0; And we&amp;#x2019;ll neglect feasibility, taking chances to achieve core grand symbolic values, rather than ordinary muddled values.&lt;/p&gt;
&lt;p&gt;More bluntly, we seem primed to confidently see history as an inevitable march toward a theory-predicted global conflict with an alien united them determined to oppose our core symbolic values, making infeasible overly-risky overconfident plans to oppose them.&amp;#xA0; We seem primed to neglect the value and prospect of trillions of quirky future creatures not fundamentally that different from us, focused on their simple day-to-day pleasures, mostly getting along peacefully in vastly-varied uncoordinated and hard-to-predict local cultures and life-styles.&amp;#xA0;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Living a story is potentially risky, for example&amp;#xA0;&lt;a href=&quot;http://patrissimo.livejournal.com/1371198.html&quot;&gt;Tyler Cowen warns us to be cautious of stories&lt;/a&gt;&amp;#xA0;as there are far fewer stories than there are real scenarios, and so stories must oversimplify. &amp;#xA0;Our view of the future may be colored by a&amp;#xA0;&lt;a href=&quot;http://www.overcomingbias.com/2010/10/the-future-seems-shiny.html#comment-457372&quot;&gt;&quot;fiction bias&quot;&lt;/a&gt;, which leads us to expect outcomes like those we see in movies (climactic battles, generally interesting events following a single plotline). &amp;#xA0;Thus stories threaten both epistemic rationality (we assume the real world is more like stories than it is) and instrumental rationality (we assume the best actions to effect real-world change are those which story heroes take).&lt;/p&gt;
&lt;p&gt;Yet we'll tend to live stories anyway because it is fun -&amp;#xA0;it inspires supporters, allies, and protagonists. &amp;#xA0;The marketing for &quot;we are an alliance to fight a great unrecognized evil&quot; can be quite emotionally evocative. &amp;#xA0;Including in our own self-narrative, which means we'll be tempted to buy into a story whether or not it is correct. &amp;#xA0;So while living a fun story is a utility benefit, it also means that story causes are likely to be over-represented among all causes, as they are memetically attractive. &amp;#xA0;This is especially true for the story&amp;#xA0;that there is risk of great, world-threatening evil, since those who believe it are inclined to shout it from the rooftops, while those who don't believe it get on with their lives. &amp;#xA0;(There are, of course,&amp;#xA0;&lt;a href=&quot;http://singinst.org/upload/cognitive-biases.pdf&quot;&gt;biases in the other direction as well&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Which is not to say that all aspects of the story are wrong - advancing an original idea to greater prominence (scaling) will naturally lead to some of these tropes - most people disbelieving, a few allies, winning more people over time, eventual recognition as a visionary. &amp;#xA0;And&amp;#xA0;Michael Vassar &lt;a href=&quot;http://patrissimo.livejournal.com/1412351.html?thread=13046783#t13046783&quot;&gt;suggests&lt;/a&gt; that some of the tropes arise as a result of &quot;trying to rise in station beyond the level that their society channels them towards&quot;. &amp;#xA0;For these aspects, the tropes may contain evolved wisdom about how our ancestors negotiated similar situations.&lt;/p&gt;
&lt;p&gt;And whether or not a potential protagonist believes in this wisdom, the fact that others do will surely affect marketing decisions. &amp;#xA0;If Harry wishes to not be seen as Dark, he must care what others see as the signs of a Dark Wizard, whether or not he agrees with them. &amp;#xA0;If potential collaborators have internalized these stories, skillful protagonists will invoke them in recruiting, converting, and team-building. &amp;#xA0;Yet the space of story actions is constrained, and the best strategy may sometimes lie far outside them.&lt;/p&gt;
&lt;p&gt;Since this is not a story, we are left with no simple answer. &amp;#xA0;Many aspects of stories are false but resonate with us, and we must guard against them lest they contaminate our rationality. &amp;#xA0;Others contain wisdom about how those like us have navigated similar situations in the past - we must decide whether the similarities are true or superficial. &amp;#xA0;The most universal stories are likely to be the most effective in manipulating others, which any protagonist must due to amplify their own efforts in fighting for their cause. &amp;#xA0;Some of these universal stories are true and generally applicable, like scaling techniques, yet the set of common tropes seems far too detailed to&amp;#xA0;reflect universal truths rather than arbitrary biases of humanity and our evolutionary history.&lt;/p&gt;
&lt;p&gt;May you live happily ever after (vanquishing your inhuman enemy with your team of true friends, bonded through a cause despite superficial dissimilarities).&lt;/p&gt;
&lt;p&gt;The End.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/337/the_danger_of_living_a_story_singularity_tropes/#comments"&gt;61 comments&lt;/a&gt;
</description>
</item>
<item>
<title>What I would like the SIAI to publish</title>
<link>http://lesswrong.com/lw/304/what_i_would_like_the_siai_to_publish/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/304/what_i_would_like_the_siai_to_publish/</guid>
<pubDate>Tue, 02 Nov 2010 01:07:42 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/XiXiDu"&gt;XiXiDu&lt;/a&gt;
&amp;bull;
27 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/304/what_i_would_like_the_siai_to_publish/#comments"&gt;218 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;span style=&quot;color: #ff0000;&quot;&gt;&lt;strong&gt;Major update&lt;/strong&gt;&lt;/span&gt; &lt;a href=&quot;http://kruel.co/2012/11/03/what-i-would-like-the-singularity-institute-to-publish/&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Related to:&lt;/strong&gt; &lt;a href=&quot;/lw/2l0/should_i_believe_what_the_siai_claims&quot;&gt;Should I believe what the SIAI claims?&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reply to:&lt;/strong&gt; &lt;a href=&quot;/lw/2zg/ben_goertzel_the_singularity_institutes_scary&quot; class=&quot;title&quot;&gt;Ben Goertzel: The Singularity Institute's Scary Idea (and Why I Don't Buy It)&lt;/a&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;... pointing out that something scary is possible, is a very different thing from having an argument that it&amp;#x2019;s likely. &amp;#x2014; Ben Goertzel&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2&gt;What I ask for:&lt;br&gt;&lt;/h2&gt;
&lt;p&gt;I want the &lt;a href=&quot;http://singinst.org/&quot;&gt;SIAI&lt;/a&gt; or someone who is convinced of the &lt;em&gt;&lt;a href=&quot;/lw/2zg/ben_goertzel_the_singularity_institutes_scary/&quot;&gt;Scary&lt;/a&gt; &lt;a href=&quot;/lw/wp/what_i_think_if_not_why/&quot;&gt;Idea&lt;/a&gt;&lt;/em&gt;&lt;sup&gt;1&lt;/sup&gt; to state concisely and mathematically (and with possible extensive references if necessary) the decision procedure that led they to make the development of friendly artificial intelligence their top priority. I want them to state the numbers of their subjective probability distributions&lt;sup&gt;2&lt;/sup&gt; and exemplify their chain of reasoning, how they came up with those numbers and not others by way of sober calculations.&lt;/p&gt;
&lt;p&gt;The paper should also account for the following uncertainties:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Comparison with other existential risks and how &lt;a href=&quot;http://singinst.org/riskintro/index.html&quot; rel=&quot;nofollow&quot;&gt;catastrophic risks from artificial intelligence&lt;/a&gt; outweigh them. &lt;/li&gt;
&lt;li&gt;Potential negative consequences&lt;sup&gt;3&lt;/sup&gt; of slowing down research on artificial intelligence (a risks and benefits analysis).&lt;/li&gt;
&lt;li&gt;The likelihood of a gradual and controllable development versus the likelihood of an &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Intelligence_explosion&quot;&gt;intelligence explosion&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;The likelihood of unfriendly AI&lt;sup&gt;4&lt;/sup&gt; versus friendly and respectively &lt;a href=&quot;http://medical-dictionary.thefreedictionary.com/abulic&quot;&gt;abulic&lt;/a&gt;&lt;sup&gt;5&lt;/sup&gt; AI.&lt;/li&gt;
&lt;li&gt;The ability of superhuman intelligence and cognitive flexibility as characteristics alone to constitute a serious risk given the absence of enabling technologies like advanced nanotechnology.&lt;/li&gt;
&lt;li&gt;The feasibility of &amp;#x201C;provably non-dangerous AGI&amp;#x201D;.&lt;/li&gt;
&lt;li&gt;The disagreement of the overwhelming majority of scientists working on artificial intelligence.&lt;/li&gt;
&lt;li&gt;That some people who are aware of the SIAI&amp;#x2019;s perspective do not accept it (e.g. Robin Hanson, Ben Goertzel, Nick Bostrom, Ray Kurzweil and Greg Egan).&lt;/li&gt;
&lt;li&gt;Possible conclusions that can be drawn from the &lt;a href=&quot;http://ieet.org/index.php/IEET/more/treder20100302/&quot;&gt;Fermi paradox&lt;/a&gt;&lt;sup&gt;6&lt;/sup&gt; regarding risks associated with superhuman AI versus other potential risks ahead.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Further I would like the paper to include and lay out a formal and systematic summary of what the SIAI expects researchers who work on artificial general intelligence to do and why they should do so. I would like to see a clear logical argument for why people working on artificial general intelligence should listen to what the SIAI has to say.&lt;/p&gt;
&lt;h2&gt;Examples:&lt;br&gt;&lt;/h2&gt;
&lt;p&gt;Here are are two examples of what I'm looking for:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;http://www.overcomingbias.com/2010/08/am-i-a-sim.html&quot;&gt;Am I A Sim?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.marginalrevolution.com/marginalrevolution/2009/12/asteroid-deflection-as-a-public-good.html&quot;&gt;Asteroid Deflection as a Public Good&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The first example is Robin Hanson demonstrating his estimation of the simulation argument. The second example is Tyler Cowen and Alex Tabarrok presenting the reasons for their evaluation of the importance of asteroid deflection.&lt;/p&gt;
&lt;h2&gt;Reasons:&lt;/h2&gt;
&lt;p&gt;I'm wary of using inferences derived from reasonable but unproven hypothesis as foundations for further speculative thinking and calls for action. Although the SIAI does a good job on stating reasons to justify its existence and monetary support, it does neither substantiate its initial premises to an extent that an outsider could draw the conclusions about the probability of associated risks nor does it clarify its position regarding contemporary research in a concise and systematic way. Nevertheless such estimations are given, such as that there is a high likelihood of humanity's demise given that we develop superhuman artificial general intelligence without first defining mathematically how to prove the benevolence of the former. But those estimations are not outlined, no decision procedure is provided on how to arrive at the given numbers. One cannot reassess the estimations without the necessary variables and formulas. This I believe is unsatisfactory, it lacks transparency and a foundational and reproducible corroboration of one's first principles. This is not to say that it is wrong to state probability estimations and update them given new evidence, but that although those ideas can very well serve as an urge to caution they are not compelling without further substantiation.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;1. If anyone is actively trying to build advanced AGI succeeds, we&amp;#x2019;re highly likely to cause an involuntary end to the human race.&lt;/p&gt;
&lt;p&gt;2. Stop taking the numbers so damn seriously, and think in terms of subjective probability distributions [...], Michael Anissimov (&lt;a href=&quot;http://xixidu.tumblr.com/post/797197173/stop-taking-the-numbers-so-damn-seriously-and&quot;&gt;existential.ieet.org mailing list, 2010-07-11&lt;/a&gt;)&lt;/p&gt;
&lt;p&gt;3. Could being overcautious be itself an existential risk that might significantly outweigh the risk(s) posed by the subject of caution? Suppose that most civilizations err on the side of caution. This might cause them to either evolve much slower so that the chance of a fatal natural disaster to occur before sufficient technology is developed to survive it, rises to 100%, or stops them from evolving at all for being unable to prove something being 100% safe before trying it and thus never taking the necessary steps to become less vulnerable to naturally existing existential risks.&lt;em&gt; &lt;/em&gt;Further reading: &lt;a href=&quot;/lw/10n/why_safety_is_not_safe/&quot;&gt;Why safety is not safe&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;4. If one pulled a random mind from the space of all possible minds, the odds of it being friendly to humans (as opposed to, e.g., utterly ignoring us, and being willing to repurpose our molecules for its own ends) are very low.&lt;/p&gt;
&lt;p&gt;5. Loss or impairment of the ability to make decisions or act independently.&lt;/p&gt;
&lt;p&gt;6. The Fermi paradox does allow for and provide the only conclusions and data we can analyze that amount to &lt;em&gt;empirical criticism&lt;/em&gt; of concepts like that of a &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Paperclip_maximizer&quot;&gt;Paperclip maximizer&lt;/a&gt; and general risks from superhuman AI's with non-human values without working directly on AGI to test those hypothesis ourselves. If you accept the premise that life is &lt;em&gt;not&lt;/em&gt; unique and special then one other technological civilisation in the observable universe should be sufficient to leave potentially observable traces of technological tinkering. Due to the absence of any signs of intelligence out there, especially paper-clippers burning the cosmic commons, we might conclude that unfriendly AI could not be the most dangerous existential risk that we should worry about.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/304/what_i_would_like_the_siai_to_publish/#comments"&gt;218 comments&lt;/a&gt;
</description>
</item>
</channel>
</rss>