<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
<channel>
<title>
Top scoring articles - Less Wrong
</title> <link>http://lesswrong.com/</link>
<description></description>
<item>
<title>Privileging the Question</title>
<link>http://lesswrong.com/lw/hba/privileging_the_question/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/hba/privileging_the_question/</guid>
<pubDate>Tue, 30 Apr 2013 04:30:35 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Qiaochu_Yuan"&gt;Qiaochu_Yuan&lt;/a&gt;
&amp;bull;
90 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/hba/privileging_the_question/#comments"&gt;281 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;strong&gt;Related to:&lt;/strong&gt; &lt;a href=&quot;/lw/19m/privileging_the_hypothesis/&quot;&gt;Privileging the Hypothesis&lt;/a&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Remember the exercises in critical reading you did in school, where you had to look at a piece of writing and step back and ask whether the author was telling the whole truth? If you really want to be a critical reader, it turns out you have to step back one step further, and ask not just whether the author is telling the truth, but &lt;em&gt;why he's writing about this subject at all.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;-- &lt;a href=&quot;http://www.paulgraham.com/submarine.html&quot;&gt;Paul Graham&lt;/a&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;There's an old saying in the public opinion business: we can't tell people what to think, but we can tell them what to think about.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;-- Doug Henwood&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Many philosophers&amp;#x2014;particularly amateur philosophers, and ancient philosophers&amp;#x2014;share a dangerous instinct: If you give them a question, they try to answer it.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;-- &lt;a href=&quot;/lw/of/dissolving_the_question/&quot;&gt;Eliezer Yudkowsky&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Here are some political questions that seem to commonly get discussed in US media: should gay marriage be legal? Should Congress pass stricter gun control laws? Should immigration policy be tightened or relaxed?&amp;#xA0;&lt;/p&gt;
&lt;p&gt;These are all examples of what I'll call &lt;strong&gt;privileged questions&lt;/strong&gt;&amp;#xA0;(if there's an existing term for this, let me know):&lt;strong&gt;&amp;#xA0;&lt;/strong&gt;questions that someone has unjustifiably brought to your attention in the same way that a privileged hypothesis unjustifiably gets brought to your attention. The questions above are probably not the most important questions we could be answering right now, even in politics (I'd guess that the economy is more important). Outside of politics, many LWers probably think &quot;what can we do about &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Existential_risk&quot;&gt;existential risks&lt;/a&gt;?&quot; is one of the most important questions to answer, or possibly &quot;how do we &lt;a href=&quot;/lw/3gj/efficient_charity_do_unto_others/&quot;&gt;optimize charity&lt;/a&gt;?&quot;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Why has the media privileged these questions? I'd guess that the media is incentivized to ask whatever questions will get them the most views. That's a very different goal from asking the most important questions, and is one reason to stop paying attention to the media.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;The problem with privileged questions is that you only have so much attention to spare. Attention paid to a question that has been privileged &lt;a href=&quot;http://en.wikipedia.org/wiki/Fungibility&quot;&gt;funges against&lt;/a&gt; attention you could be paying to better questions. Even worse, it may not feel from the inside like anything is wrong: you can apply all of the epistemic rationality in the world to answering a question like &quot;should Congress pass stricter gun control laws?&quot; and never once ask yourself where that question came from and whether there are better questions you could be answering instead.&lt;/p&gt;
&lt;p&gt;I suspect this is a problem in academia too. &lt;a href=&quot;http://www.cs.virginia.edu/~robins/YouAndYourResearch.html&quot;&gt;Richard Hamming&lt;/a&gt; once gave a talk in which he related the following story:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Over on the other side of the dining hall was a chemistry table. I had worked with one of the fellows, Dave McCall; furthermore he was courting our secretary at the time. I went over and said, &quot;Do you mind if I join you?&quot; They can't say no, so I started eating with them for a while. And I started asking, &quot;What are the important problems of your field?&quot; And after a week or so, &quot;What important problems are you working on?&quot; And after some more time I came in one day and said, &quot;If what you are doing is not important, and if you don't think it is going to lead to something important, why are you at Bell Labs working on it?&quot; I wasn't welcomed after that; I had to find somebody else to eat with!&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Academics answer questions that have been privileged in various ways: perhaps the questions their advisor was interested in, or the questions they'll most easily be able to publish papers on. Neither of these are necessarily well-correlated with the most important questions.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;So far I've found one tool that helps combat the worst privileged questions, which is to ask the following counter-question:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;What do I plan on doing with an answer to this question?&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;With the worst privileged questions I frequently find that the answer is &quot;nothing,&quot; sometimes with the follow-up answer &quot;signaling?&quot; That's a bad sign. (&lt;strong&gt;Edit:&lt;/strong&gt;&amp;#xA0;but &quot;nothing&quot; is different from &quot;I'm just curious,&quot; say in the context of an interesting mathematical or scientific question that isn't motivated by a practical concern. Intellectual curiosity can be a useful heuristic.)&lt;/p&gt;
&lt;p&gt;(I've also found the above counter-question generally useful for dealing with questions. For example, it's one way to notice when a question should be &lt;a href=&quot;/lw/of/dissolving_the_question/&quot;&gt;dissolved&lt;/a&gt;, and asked of someone else it's one way to help both of you clarify what they actually want to know.)&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/hba/privileging_the_question/#comments"&gt;281 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Maximizing Your Donations via a Job</title>
<link>http://lesswrong.com/lw/hd1/maximizing_your_donations_via_a_job/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/hd1/maximizing_your_donations_via_a_job/</guid>
<pubDate>Sun, 05 May 2013 23:19:05 +0000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Alexei"&gt;Alexei&lt;/a&gt;
&amp;bull;
85 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/hd1/maximizing_your_donations_via_a_job/#comments"&gt;43 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;In November of 2012 I set a goal for myself: find the most x-risk reducing role I can fill. At first I thought it would be by working directly with &lt;a href=&quot;http://intelligence.org/&quot;&gt;MIRI&lt;/a&gt;, but after a while it became clear that I could contribute more by simply donating. So my goal became: find the highest paying job, so I can donate lots of money to &lt;a href=&quot;http://appliedrationality.org/&quot;&gt;CFAR&lt;/a&gt; and &lt;a href=&quot;http://intelligence.org/&quot;&gt;MIRI&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;A little bit of background on me. Started programming in 2000. Graduated in 2009 with Bachelor's in computer science. Worked for about a year and a half at a game company. Then did my own game startup for about a year. Then moved to the bay area and joined a game startup here, which was acquired 10 months later. Worked a bit at the new company and then left. So, just under four years of professional programming experience, but primarily in the game industry. Almost no leadership / managerial experience, aside from the startup I did where I hired freelancers.&lt;/p&gt;
&lt;p&gt;Below is my experience of finding a software engineering job in the Silicon Valley. If you are not an engineer or not in the Silicon Valley, I think you'll still find a lot of useful information here.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Pre-game&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;Before sending out my resume, I spent about a month preparing. I read &lt;a href=&quot;http://www.amazon.com/Introduction-Algorithms-Thomas-H-Cormen/dp/0262033844/ref=sr_1_1?ie=UTF8&amp;amp;qid=1367534120&amp;amp;sr=8-1&amp;amp;keywords=intro+to+algorithms&quot; target=&quot;_blank&quot;&gt;Intro to Algorithms&lt;/a&gt;, which was very good overall, but not a huge help in preparing for interviews.&lt;a href=&quot;/lw/hd1/maximizing_your_donations_via_a_job/#1&quot;&gt;[1]&lt;/a&gt; I read &lt;a href=&quot;http://www.amazon.com/Cracking-Coding-Interview-Programming-Questions/dp/098478280X/ref=sr_1_1?s=books&amp;amp;ie=UTF8&amp;amp;qid=1367534155&amp;amp;sr=1-1&amp;amp;keywords=cracking+the+coding+interview&quot; target=&quot;_blank&quot;&gt;Cracking the Coding Interview&lt;/a&gt;, which was extremely helpful. (If you read only one book to prepare, make it this one.) The book has a lot of questions that are similar to the ones you'll actually see during interviews. I also did TopCoder problems, which were pretty helpful as well.&lt;a href=&quot;/lw/hd1/maximizing_your_donations_via_a_job/#2&quot;&gt;[2]&lt;/a&gt; Looking back, I wish I spent more time finding actual interview questions online and doing more of those (that's why CCI book was so helpful).&lt;/p&gt;
&lt;p&gt;After several weeks of&amp;#xA0;preparation, I compiled a long list of companies I was going to apply to. I checked on &lt;a href=&quot;http://www.glassdoor.com/index.htm&quot;&gt;GlassDoor&lt;/a&gt; to see what kind of salary I could expect at each one. I then rated all the companies. Companies with low salaries and poor personal fit received the lowest rating.&lt;/p&gt;
&lt;p&gt;I started by applying to companies with the lowest ratings. This way I could use them as practice for the companies I thought would actually make a competitive offer. This was the right move and worked very well. (Another friend of mine did the same approach with good results as well.) Remember, you are not just doing those interviews to practice the coding problems, you are practicing pitching yourself as well.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Interviewing with a company&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;Standard procedure for applying to a tech company:&lt;/p&gt;
&lt;p&gt;1. Send them your resume.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Proofread your resume. Let your friends proofread it.&lt;/li&gt;
&lt;li&gt;Make sure there are only relevant things on it. When I applied to tech companies, I removed a lot of game-specific things from my resume. When I applied to companies that did 3D graphics, I made sure I had all my 3D graphics experience listed. I ended up with two version of my resume.&lt;/li&gt;
&lt;li&gt;Have your resume in DOC, PDF, and TXT formats. This way you'll always have the right one when you upload / paste it.&lt;/li&gt;
&lt;li&gt;For a few companies, I had a friend or friend of a friend who referred me. This REALLY HELPS in two ways: 1) your resume will be processed a lot faster, 2) if your friend is a great engineer/employee, you'll be taken a lot more seriously, and the company will fight for you a lot harder.&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;2. You'll get an email from the recruiter and setup a time to speak, where you'll talk about yourself, what you've done, why you are interested in their company, and so on. You can and should ask them questions as well.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;When you start getting multiple calls each day, make sure you know who is calling. There is nothing worse than talking about the challenges of streaming music to a car sharing startup. (True story.)&lt;/li&gt;
&lt;li&gt;Read about the company on Wikipedia before the call. Know the basic stuff. Look at their website and read the About page.&lt;/li&gt;
&lt;li&gt;Find the thing that makes the company special and successful. Find the thing that you actually think is cool about the company. Those are your answers for why you want to work there.&lt;/li&gt;
&lt;li&gt;Ask non-technical questions: How is the company structured? How many teams are there? How many employees? Engineers? Think of other intelligent questions to ask.&lt;/li&gt;
&lt;li&gt;In my experience, it's not very beneficial to tell them you are interviewing with a dozen other companies. When they ask who else you are interviewing with, just name a few companies, especially the competitors / similar companies.&lt;/li&gt;
&lt;li&gt;Be SUPER NICE to your recruiter. They are your main point of contact with the company. They'll be the one fighting to get you the best offer.&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;3. You'll have a technical phone interview with a software engineer where you'll solve a problem or two on &lt;a href=&quot;http://collabedit.com/&quot; target=&quot;_blank&quot;&gt;collabedit&lt;/a&gt; or some similar website. At the end, you'll get a few minutes to ask them questions too.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All the usual interviewing tips apply here. E.g. talk out loud, your interviewer doesn't know what you are thinking.&lt;/li&gt;
&lt;li&gt;Most companies don't care what language you use, as long as it's mainstream. (I used C# for almost all my coding questions.)&lt;/li&gt;
&lt;li&gt;DO NOT start answering the question by writing code. If the questions seems vague, ask about the context. Who'll be using this solution? Definitely ask about the kind of data you are working with. If it's integers, are they random? Over some small range or over all possible integers?&lt;/li&gt;
&lt;li&gt;List out metrics for various approaches: brute-force solution, optimized for speed solution, optimized for memory solution. Here is a question I saw a few times: Write a data structure which can accept and store integers, and can check if there exist two integers that sum up to a given number. There are multiple solutions, and the best one depends on the ratio of addInteger to checkForSum calls.&lt;/li&gt;
&lt;li&gt;The previous steps should only take you a minute or two. Once you've decided what the best approach is, then you can write the solution. When you are done, check for errors, then run through several examples. Do a simple example and a slightly complicated example. When you find a bug, don't be hasty in fixing it. Understand why it happened and make sure you won't introduce new bugs by fixing it.&lt;/li&gt;
&lt;li&gt;If everything works, make sure you handle errors correctly. Can you handle invalid input? Input that violates your assumptions? (As a reminder, I leave &amp;#x201C;\\Check for errors&amp;#x201D; comments in appropriate spots as I code the solution.)&lt;/li&gt;
&lt;li&gt;When you are done, ask the interviewer questions. Ask them to tell you about what they do, if they haven't already. What have they been working on recently? What technologies/languages do they use at the company? Do they use Scrum/Agile? Pair-programming? Come up with other intelligent questions to ask.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;4. You'll be invited for an on-site interview which will be 3-6 hours long, at least half of which will be coding on a white-board. (Although, a friend told me he brought his laptop with him, and most people were fine with him coding on it.)&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All the previous tips apply.&lt;/li&gt;
&lt;li&gt;Be on time. Take bathroom breaks when you need them. I found that drinking water during the interview keeps me refreshed. Remember your posture, body-language, and eye-contact skills.&lt;/li&gt;
&lt;li&gt;Learn how to talk out loud as you are writing out your solution. If you are stuck, explain what you are thinking, and what your intuition is telling you.&lt;/li&gt;
&lt;li&gt;Learn how to read your interviewers. If you say, &quot;Here we should check for the null case or for empty array,&quot; and they go &quot;Yeah, yeah, okay,&quot; they are not the type of interviewer that really cares about error conditions, so you can be somewhat more lax there. By the time I was finishing my on-site interviews, I could tell if my solution was right just by the interviewer's body language.&lt;/li&gt;
&lt;li&gt;When you are done, ask them questions. What are they working on? What's the thing they like most about the company? What's their least favorite thing about the company? (Another way to phrase that: What's one thing you would change if you could change anything about the company?) Do they have to work overtime? How are the people here? Can you switch between projects? Are there company wide events? In all my interviews I've never met an interviewer that didn't try to sell their company really hard. People will always tell you their company is the best place to work.&lt;/li&gt;
&lt;li&gt;If the person is a manager or a director, ask them higher level questions. What kind of culture are they trying to create? What are the current big challenges? Where do they want the company to be in the next 5 years? How does one advance in the company? (Usually there is a managerial and a technical track.) How often are reviews done? How are they structured?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;5. You'll get a call from the recruiter congratulating you on an offer. They'll go over the offer details with you.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Before they make you an offer, they'll check if you are actually seriously considering their company. If you told a startup you are also interviewing with Google, they might suspect that you are not seriously considering them. Unless you dissuade those fears, they might actually not even make you an offer. (Happened to me with Rdio.)&lt;/li&gt;
&lt;li&gt;If you didn't get an offer, try to get as much info as you can. What happened? What can you improve on? Below are the reasons why I didn't get an offer after an on-site interview: &lt;ul&gt;
&lt;li&gt;Not doing well on a technical question. (Happened twice; one time because of a very obnoxious interviewer.)&lt;/li&gt;
&lt;li&gt;Not interviewing for quite the right position (that on-site interview ended early).&lt;/li&gt;
&lt;li&gt;Not having the necessary experience (a lot more important to startups than bigger companies).&lt;/li&gt;
&lt;li&gt;Not being passionate enough about the company.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;If this is not a good timing for the offer, e.g. it's one of your first interviews, then tell them so. They will probably wait to give you the offer details until you are ready to consider it.&lt;/li&gt;
&lt;li&gt;The recruiter will likely ask what's important to you in an offer. How are you going to make your decision? What I've said is that compensation will be an important factor in my decision, but that the team/project/etc. are important considerations as well.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;6. You have a few days (usually around 5 business days) until the offer expires to decide if you want to accept it.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Sometimes the offer will expire before you've received offers from other companies. This is why it's important to interview in rough order of ranking, so that you can just let those offers go, knowing you'll have much better ones soon. If you want to hold on to the offer, just ask your recruiter for an extension. It'll be much easier to get an extension at big companies, especially if you are interviewing for a generic position.&lt;/li&gt;
&lt;li&gt;If you decline the offer, let them know.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Always be very nice, friendly, and polite. Walk the fine line between telling the truth and saying the right thing. Ideally, make sure those are the same. Even if you are interviewing with a company you have no intention of working at, make sure to find something you really like about them, something that makes them stand out to you. Always have a good answer to: &quot;Why do you want to work here?&quot;&lt;/p&gt;
&lt;p&gt;Before each on-site interview make sure you research the company thoroughly. Use their product. Think of ways to improve it. It's very helpful if you can meet with someone that works there and talk to them. See if they can give you any tips on the interview process. Some companies (e.g. AirBnB) want people that are &lt;em&gt;extremely&lt;/em&gt; passionate about their product. Some companies focus more than usual on architectural questions. Many companies expect the engineers to have some familiarity with UI/UX and the ability to think about a feature from all angles.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Managing your time&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;I sent my resume to 78 companies, had at least a phone conversation with a recruiter with 27 of them, had an on-site interview with 16 companies, and received 12 offers. Out of those, I've only seriously considered 3. (Companies with lower ratings had an atrocious response rate.)&lt;/p&gt;
&lt;p&gt;My time-line ended up looking something like this:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Week 1: Started applying to low-rated companies. About 2 phone interviews.&lt;/li&gt;
&lt;li&gt;Week 2: About 7 phone interviews. One on-site interview. Sending out more resumes.&lt;/li&gt;
&lt;li&gt;Week 3: About 3 phone interviews.&lt;/li&gt;
&lt;li&gt;Week 4: About 15 phone interviews. A few meetings with friends of friends, who ended up referring me. 1 on-site interview. Sent my resume to all the high-rated companies. (During this week interviewing became a full-time job.)&lt;/li&gt;
&lt;li&gt;Week 5: About 10 phone interviews. 4 on-site interviews.&lt;/li&gt;
&lt;li&gt;Week 6: 8 phone interviews. 4 on-site interviews.&lt;/li&gt;
&lt;li&gt;Week 7: 4 phone calls. 5 on-site interviews.&lt;/li&gt;
&lt;li&gt;Week 8: 12 phone calls. 2 on-site interviews.&lt;/li&gt;
&lt;li&gt;Week 9: About 8 calls a day for a few days, while I negotiated with my top companies.&lt;/li&gt;
&lt;li&gt;(These are strictly lower bounds for phone calls. On-site data is pretty accurate.)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Some companies move fast, some companies move slow. Google took 2 weeks from the on-site interview to the offer call. This is very common for them, but most other companies move faster. With Amazon, I actually interviewed with two different branches. With one branch things were going well, until they dropped the ball and never got back to me, even after I pestered them. This is unusual; although, Twitter did something similar, but then ended up responding with an on-site invitation. With the other Amazon branch, when I got home from the on-site interview, I already had an email saying they were going to make an offer. This is extremely fast. (I had a very good reference for that position.) Most companies take about a week between on-site and offer. The whole process, from first call to offer, takes about three weeks.&lt;/p&gt;
&lt;p&gt;If your recruiter doesn't respond to you during 4 days or longer, shoot them an email. They might have forgotten to respond, or thought they did, or may be things are moving slowly, or may be they decided not to pursue. You want to be clear on where you stand with all the companies you are applying to.&lt;/p&gt;
&lt;p&gt;The timing is pretty important here. You want your top-rated companies to give you an offer within a span of a week. This way you'll be able to leverage all those offers against each other.&lt;/p&gt;
&lt;p&gt;If your current job position is already almost optimal for your goals, then it's possible you can do a few interviews, get a few offers and pick the best one, which will give you some marginal improvement. Or use those offers to leverage a raise at your existing company. But if you are pretty sure your current job has not been optimized for your goal, then I'd say, contrary to popular wisdom, just leave and spend a full month interviewing. (Or, even better, if you can, take a long &quot;vacation&quot;.) You just can't do this kind of intense interviewing while holding another job. The one exception to this rule I can think of is if one of your highest-rated companies is a competitor with your current employer. Then you can leverage that!&lt;/p&gt;
&lt;p&gt;Value of information is extremely high during this process. Talk to all the companies you can, talk to all the people you can. Once you have the final list of companies you are considering, reduce your uncertainty on everything. Validate all your assumptions. (Example: I was &lt;em&gt;sure&lt;/em&gt; Google matched donations up to $12k, but turns out it's only up to $6k.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;How to evaluate your offer&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;There are 4 basic components in an offer: sign-on bonus, base salary, equity, and bonus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sign-on bonus.&lt;/strong&gt; Most companies will be okay offering something like $12k sign-on bonus. Some will offer more. Most startups probably won't offer any.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Base salary.&lt;/strong&gt; This is pretty consistent across most companies. Based on your experience, you'll be given a title (e.g. Senior Software Engineer or SE 2), and that title will determine the range of the salary you can expect. If you are good, you can demand a salary at the top of that range, but it's extremely hard to go higher.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equity.&lt;/strong&gt; This is the most interesting part. A good amount of value will come from this portion. With a startup, it'll be most of it. Here are two things to pay attention to:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Is the company public or private? If it's public, you are most likely going to be given RSUs (restricted stock units), which will basically convert to normal company shares when they vest. For private companies, see the section below.&lt;/li&gt;
&lt;li&gt;What's the vesting schedule? For almost all companies you'll get 25% of your shares right after your first year. (This is called a 'cliff'.) After that you'll be given the appropriate fraction either monthly (e.g. at Google) or quarterly (e.g. at Facebook). Amazon is an example of a company where the vesting schedule is somewhat different: 5% after year 1, 15% after year 2, and then 20% each semester for the next two years.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Bonus.&lt;/strong&gt; This is the bonus system the company has setup. You can't negotiate it, but it's important to take it into account.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;There will usually be a cash bonus that's based on your salary. It'll have a target percent (e.g. 15%). If you can find out how many people hit their target, that will be very helpful. However, most companies don't share or simply don't have that information.&lt;/li&gt;
&lt;li&gt;Some companies also have equity bonuses. Try to get as much info on those as you can. Don't assume that you'll get the maximum bonus even if you work hard. If you have friends working at that company, ask them what kind of bonuses they've been getting.&lt;/li&gt;
&lt;li&gt;Lots of startups don't have bonus systems in place.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Other factors.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Donation matching: Google matches up to $6k (you donate $6k to any charity, they'll donate another $6k). Craigslist matches 3:1 up to 10% of your salary. Most companies don't have anything like that, and you can't negotiate it.&lt;/li&gt;
&lt;li&gt;Paid Time Off: Google offers 2 weeks, all other companies I was considering offer 3 weeks, and some even have unlimited PTO. This is not negotiable in most companies.&lt;/li&gt;
&lt;li&gt;Commute: how far will you have to travel to work? Are you okay moving closer to work? (Google and Facebook have shuttles that can pick you up almost anywhere, so you could work while you commute.)&lt;/li&gt;
&lt;li&gt;People/culture/community/team/project are all important factors as well, depending on what you want. If you are going to spend the next several years working on something, you should be building up skills that will still be valuable in the future.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Thinking about private companies&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;If the company is private, you might be given RSUs or you might be given stock options. With stock options, you'll have to pay the strike price to exercise your options. So the total value your options have is: (price of a share - strike price) * number of shares.&lt;/p&gt;
&lt;p&gt;You can't do anything with your shares until the company gets acquired or goes public. Some companies have liquidation events, but those are pretty rare. Most companies don't have them, and the ones that do only extend the opportunity to people that have been with the company for a while. There are also second-hand markets, but I don't know much about those.&lt;/p&gt;
&lt;p&gt;If you are completely risk-intolerant, then just go with a public company, and don't consider private companies. (This is actually not exactly true. Just because a company is public, doesn't mean its risk-free, and just because a company is private doesn't mean there is a lot of risk. There are other important factors like the size of the company, their market diversity, and how long they've been around.) If you are okay with some risk, then you want a company that's close to an IPO or is likely to get acquired soon. If you want to have a chance to make more than a few million dollars, either start your own company or join a very early stage startup (my top pick would be &lt;a href=&quot;https://ripple.com/&quot;&gt;Ripple&lt;/a&gt;). Before doing so, check out the stats on startups to make sure you understand how likely any given startup is to fail and make sure you understand the concepts of inside/outside view.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Taxes&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;It's crucial to understand all the tax implications of your salary, equity, and donations. I'm not going to go into all the details, there are a lot of resources out there for this, but you should definitely read them until it's crystal clear how you will be taxed. I'll highlight a few points:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Understand the &lt;a href=&quot;http://taxes.about.com/od/Federal-Income-Taxes/qt/Tax-Rates-For-The-2013-Tax-Year.htm&quot;&gt;tax rate schedule&lt;/a&gt; and notice the new 39.6% tax bracket. If your income is $100k, that doesn't mean you get taxed 28% on all of it. 28% applies only to the income portion above $87,850. Also note that this is only the federal tax. Your state will have additional taxes as well. Aside from those percentages, there are a few other flat taxes, but they are considerably smaller in magnitude.&lt;/li&gt;
&lt;li&gt;The money you donate to a&amp;#xA0;nonprofit&amp;#xA0;(aka.&amp;#xA0;501(c)(3)) organization can be subtracted from your taxable income. This means that you will most likely get a refund when you file your taxes. Why? Because when you fill out your W4 form, you'll basically tell your employer how much money to withhold from your paycheck for tax purposes. If you don't account for your future donations, more money will be withheld than is appropriate and the discrepancy will be paid back to you after you file your taxes. Ideally, you want to take your donations into account and fill out the W4 form such that there are no discrepancies. That means you'll get your money now rather than later. (I haven't gone through this process myself, so there is some uncertainty here.)&lt;/li&gt;
&lt;li&gt;You can claim tax deduction for up to 50% of your wages. That means if you make a lot of money in one year, even if you donate most of it, you'll be able to reduce your taxable income by a maximum of 50%. The rest goes over to the next year.&lt;/li&gt;
&lt;li&gt;When RSUs vest, their value is treated as ordinary income for tax purposes. When you sell them, the difference is taxed as a capital gain (or loss).&lt;/li&gt;
&lt;li&gt;Stock options have a more complicated set of tax rules, and you should understand them if you are considering a company that offers them.&lt;/li&gt;
&lt;li&gt;You can't have your employer donate money or stock for you to bypass the taxes. I've asked.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Calculating donations&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;To calculate exactly how much I could donate if I worked at a given company, I've created &lt;a href=&quot;https://dl.dropboxusercontent.com/u/30954211/SalariesExample.xls&quot;&gt;this spreadsheet&lt;/a&gt;. (This is an example with completely fictitious company offers with very low numbers, but the calculations should be correct.) Let me walk you through the spreadsheet.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time discounting (Cell B1)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Money now is more valuable than money later. By how much? That's a very complicated question. If you invest your money now, you might be able to make something like 10%&amp;#xA0;annually with some risk.&lt;a href=&quot;/lw/hd1/maximizing_your_donations_via_a_job/#1&quot;&gt;[3]&lt;/a&gt; If you are donating to a charity, and they are growing very rapidly, then they can do a lot with your money right now, and you should account for that as well. If you expect the charity to double in size/effectiveness/output in the next year, then you might use a discount rate as high as 50%. I chose to use 20% annual discount rate based on my own estimates. Since I'm doing monthly compounding, the spreadsheet value is slightly higher (~22%). You can look at the column K to see how the future value of a dollar is being discounted. Note, for example, that a dollar in 12 months is worth 80&amp;#xA2;&amp;#xA0;to me now. This discounting rate is especially important to keep in mind when examining startups, because almost all their compensation lies in the future. The further away it is, the more heavily you have to discount it.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost of living (Cell B2)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is how much pre-tax money a year I'm not going to donate. See column L for the monthly expenses. We time-discount those dollars as well.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Offers (Cells A4-I15)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is where you plug-in the offers you get. &lt;em&gt;Bonus&lt;/em&gt; row is for cash bonus. &lt;em&gt;Equity&lt;/em&gt; row is for the total equity the company offers you. I use the dollar amount, but you'll notice that for some of them I'm computing the dollar amount as: RSUs the company is giving me * current share price. For private companies, this is value I expect my equity to have when the company goes public. For Square it looks like: (percent of the company I'll own) * (my guess at valuation of the company at IPO) - (cost to exercise my options). For Twitter it looks like: (growth factor up to IPO) * (current price per share) * (RSUs I am granted). (Again, the numbers are completely made up.) In my calculations I'm not expecting public companies' share price to rise or fall. If you disagree, you should adjust for that as well.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monthly projections (Cells A18-I66)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;We are going to look at how much money we'll be making per month for the next four years. (Four years because our equity should be fully vested by that time.) If you are certain that you will stay at the company for less time than that, then you should consider a shorter timeline. This might affect companies differently. For example, most of the equity you get at Amazon comes during the last two years. If you are not going to be there, you are missing out on a big part of your offer.&lt;/p&gt;
&lt;p&gt;For companies that I was seriously considering, I created two columns: one for cash wages and one for equity wages. This way I can do taxes on them more precisely.&lt;/p&gt;
&lt;p&gt;Let's go through the Google's offer:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;For the first year we'll be only making our standard salary.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;After the first year, we get our cash bonus (green font). Here we are assuming it'll be 15% of our salary. We also get 25% of our RSUs vested (salmon background).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;For the remainder of the second year, we are making our normal salary. Each month we also get 1/48&lt;sup&gt;th&lt;/sup&gt; of our original equity offer.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Google also has an equity bonus system, where each year you can get a bonus of up to 50% of your original equity offer. This bonus will be paid in RSUs, and it vests over 4 years, but with no cliff. So we count that as well, but I'm assuming I'm only going to get 15%, not the full 50%.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;In year 3 everything is basically the same, except now we got our second equity bonus, so we have two of them running simultaneously.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;In year 4, we have three of them running simultaneously.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For pre-IPO companies, I've estimated when they'll go IPO. Most have clauses in place that don't allow you to sell your shares until after half a year or so after the IPO. I'm assuming I will sell/donate all my shares then, and then continue selling/donating them as they continue vesting.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sum (Cells A68-I71)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In row 68 we have the total sum. This is the amount of pre-tax dollars we expect to earn in the next four years (remember that this amount has been adjusted for time-discounting, so it'll seem much lower than you'd normally expect). L68 is how much money we are spending on ourselves during those four years.&lt;/p&gt;
&lt;p&gt;In row 69 we subtract our living expenses to get the amount of money we'll be able to donate. Note that I'm subtracting it from the cash column, leaving the equity column alone (for the companies where I split the two).&lt;/p&gt;
&lt;p&gt;In row 70 we account for taxes. Note that our living expenses already accounted for the taxes we pay up to $65k, so the rest of it will be taxed at around 28% or higher. You could sell your shares, or you could just donate your shares directly to your charity. (That's what we are doing with our Google offer.)&lt;/p&gt;
&lt;p&gt;In row 71 we simply sum up the donations from cash and equity.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Disclaimer 1: while I tried as hard as I could to double check this spreadsheet, there might still be mistakes there, so use it with caution and triple check everything. The tax calculations as they are right now are wrong, and you'll have to redo them (basically the whole Row 70) based on your own numbers.&lt;/p&gt;
&lt;p&gt;Disclaimer 2: this spreadsheet is not great for evaluating an offer from a startup, since it doesn't capture the associated uncertainty and risk. Furthermore, if you expect the startup to succeed after more than 4 years, to correctly compare it to other companies you'll have to compute more than 48 months and potentially start accounting for things like promotions and raises.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Picking the one&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;All right, so how do you actually pick the best company? It's not as simple as picking the one with the highest EV, since you have to account for risk involved with startups and even pre-IPO companies. In fact, you should be surprised if your offers from public companies have a higher EV than offers from startups. If that's the case, I'd double check your calculations.&lt;/p&gt;
&lt;p&gt;This is where it becomes extremely crucial to narrow down your uncertainty. When is the company going to IPO? What is the likely valuation? Does the company have a lot of competitors? Does the company have the necessary talent to execute on their plan? What's the company's history? What is the employee churn rate (especially for executives)? How well is the company doing financially? Who are the investors? Etc, etc, etc... There is a ton of questions you should be asking, and you should be asking them to everyone whose opinion on this issue you can respect. Honest opinion from an informed and knowledgeable neutral party is worth a LOT here!&lt;/p&gt;
&lt;p&gt;You should also talk to the people at the company. Your recruiter will connect you to the right people if you ask. Keep in mind that nobody there will tell you that the company is going to go bankrupt or fail. But you can still get some valuable estimates, and then potentially discount them down a bit. You can even ask for their opinion on other companies you are interviewing with. Expect them to completely throw the other company under the bus though, but even so, you could get a lot of valuable criticism and bring it up when you talk to that other company. Overall, expect a lot of conflicting messages.&lt;/p&gt;
&lt;p&gt;Keep in mind the charities you'll be donating to. What kind of donors do they have already? Are most people donating a bit from their salary? In that case, a more risky venture might be reasonable. Can they really use some money right now, or would they be a lot more effective later on with a large capital? What's their time discount rate? If you care about your charity, you can help them diversify their donor pool.&lt;/p&gt;
&lt;p&gt;For me, it was a hard choice between big public companies (primary candidate: Google) and close to IPO companies (primary candidates: Twitter and Square).&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;strong&gt;Negotiating&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;You have to negotiate your offer. You have to &lt;em&gt;have to&lt;/em&gt; &lt;strong&gt;have to&lt;/strong&gt; HAVE TO. For any given company, you'll be able to get them to up their offer at least once and potentially thrice. Example: Google upped my offer three times.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Some companies will tell you their offer is not negotiable. That's not true.&lt;/li&gt;
&lt;li&gt;It's much easier to leverage similar companies against each other. Leverage big public companies against each other; leverage pre-IPO companies against each other; etc... Leveraging between those categories is a bit more difficult, because startups know they can't compete with the raw cash value you are offered at bigger companies. The only thing they can do is up their equity offer and hope that they are a much better personal fit for you than the large companies.&lt;/li&gt;
&lt;li&gt;Recruiters will ask you very directly what the other companies are offering you. You can choose to disclose or not to disclose. If you don't disclose, the company will come back to you with their standard offer. That offer might be higher or lower than you expected. (Example: The first offer I got from Google was significantly worse than initial offers I got from Facebook and Amazon.) If you tell them what offers you have (and you should only disclose details of your very best offers), then they'll very likely match or come in a bit stronger. Usually you don't have much to gain by disclosing your other offers upfront. You can always do so later. However, you should let your recruiters know that other companies &lt;em&gt;did&lt;/em&gt; make an offer, or you are expecting them to. That gives you more leveraging power.&lt;/li&gt;
&lt;li&gt;Sign-on bonus is very easy to negotiate. You can easily convince a company to match a sign-on bonus their competitor has offered.&lt;/li&gt;
&lt;li&gt;Negotiating salary is much harder, but, again, usually you can convince a company to match a salary their competitor has offered or at least come closer to it. If you are interviewing with startups, their salary offer will usually be lower than at bigger companies and even harder to negotiate. (&quot;Cash is king&quot; is the common phrase used there.)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;First&amp;#xA0;&lt;/strong&gt;&lt;strong&gt;negotiating&amp;#xA0;phase:&lt;/strong&gt; simply email / call back your recruiter (who is now your best friend, right?) and tell them that the offer is somewhat lower than you expected, you have other better offers from other companies, and you are wondering if they can increase their offer. If the company made you a clearly worse offer than another similar company, you should be very open about it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second negotiating phase:&lt;/strong&gt; matching other companies. This is when it makes the most sense to disclose your other offers. For example, I used my Amazon and Facebook offers to convince Google to up their offer significantly. For some reason their original offer was very low, but seeing their competitors with much better offers convinced them to update pretty quickly. You can also bring up the perks one company has that the other doesn't (e.g. donation matching or unlimited PTO). The company can make up for that with salary/equity. There is some difficulty in using offers from private companies as leverage, because there is not much information you can disclose about them. You can talk about the number of shares you'll have, but it might not mean anything to the other recruiters if they are not familiar with the startup.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Third&amp;#xA0;negotiating&amp;#xA0;phase:&lt;/strong&gt; once you picked the company you'll work for, go back to them and say something along the lines of &quot;I really like the offer and the company, but there are a few things that don't make it ideal for me. One of your competitors did this, and another company has that. Right now I'm inclined to go with your competitor, but it's a tough decision, and I would rather go with you. I think if you can make me an offer with the following parameters, it'll make my decision extremely easy, and I'll sign on the spot.&quot; Include offer letters from other companies, especially the ones that have them beat or beat on some parameters. Notice the key promise at the end: &lt;em&gt;you will sign with them&lt;/em&gt;. Your recruiter will have a lot more leverage in fighting for you if you make that promise. You are not legally obligated to follow through with your promise, but I wouldn't advise breaking it or using it just to extract more value to use as leverage against other companies. Use this tactic at the very end to extract that last bit of value from the company that's already the best. This is what I did with Google. I asked for about 3% higher salary and 12% more equity than what they were offering, and they came back with the exact numbers I requested, which means I should have asked for more. My advice would be to ask for about twice or may be even three times as much (6% and 30% respectively). Even if they come back with a compromise, it'll very likely be more than 3% and 12% increase. If not, you can try to barter one more time.&lt;/p&gt;
&lt;p&gt;I'm sure some people will cringe at this kind of haggling, but, in all honesty, this is what recruiters expect, and they are very much used to it. Nobody even blinked an eye when I started negotiating, even on second and third rounds. However, some recruiters might try to make you feel guilty. They'll say that if you really want to work at their startup, then you shouldn't really care about your compensation. Most points they'll make will even be valid, but if you are trying to optimize for donations, then you have to make the compensation the most important factor in your decision. I've actually told most of my recruiters that I plan to donate most of my salary to charities. I don't think that got me higher offers, but it made me come off less like a greedy jerk.&lt;/p&gt;
&lt;p&gt;At the end of the day, the company wants you, but they want to pay you as little as possible. &lt;em&gt;But&lt;/em&gt;, given the choice of having you and paying you the most you&amp;#xA0;deserve&amp;#xA0;VS. not having you, all companies will pick the first option. ALL OF THEM. This is one of the best perks of being a talented software engineer in the bay area.&lt;/p&gt;
&lt;p&gt;Once you accept the offer, don't forget to email everyone else and let them know. Thank everyone that helped you. Some recruiters will be surprised by your decision, and some will even fight really hard to get you to reconsider.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;a name=&quot;1&quot;&gt;&lt;/a&gt;[1] None of the interviews required a data structure more complicated than a heap. All the answers had a very easy to compute complexity, either polynomial, polynomial * logarithmic, or factorial. The most weird one was probably O(&amp;#x221A;n) for computing prime numbers.&lt;/p&gt;
&lt;p&gt;&lt;a name=&quot;2&quot;&gt;&lt;/a&gt;[2] Some problems I did during actual single-round match-up (SRM) competitions, which is good for training yourself how to code and think faster than you are used to. I also did a lot of old SRM problems, which have solutions and explanations posted in case I couldn't get them. I could easily do problem 1 &amp;amp; 2 in the easy division, and could do problem 3 most of the time. I didn't really bother with the hard division, and none of the interview questions were ever as hard as problem 3 in the easy division.&lt;/p&gt;
&lt;p&gt;&lt;a name=&quot;3&quot;&gt;&lt;/a&gt;[3] According to the comments, this number is too high. Pick your own best estimate.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/hd1/maximizing_your_donations_via_a_job/#comments"&gt;43 comments&lt;/a&gt;
</description>
</item>
<item>
<title>MetaMed: Evidence-Based Healthcare</title>
<link>http://lesswrong.com/lw/gvi/metamed_evidencebased_healthcare/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/gvi/metamed_evidencebased_healthcare/</guid>
<pubDate>Wed, 06 Mar 2013 00:16:45 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
76 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/gvi/metamed_evidencebased_healthcare/#comments"&gt;165 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;In a world where 85% of doctors can't solve &lt;a href=&quot;http://library.mpib-berlin.mpg.de/ft/ps/PS_Teaching_2001.pdf&quot;&gt;simple Bayesian word problems&lt;/a&gt;...&lt;/p&gt;
&lt;p&gt;In a world where only 20.9% of reported results that a pharmaceutical company tries to investigate for development purposes, &lt;a href=&quot;http://online.wsj.com/article/SB10001424052970203764804577059841672541590.html&quot;&gt;fully replicate&lt;/a&gt;...&lt;/p&gt;
&lt;p&gt;In a world where &quot;&lt;a href=&quot;/lw/1gc/frequentist_statistics_are_frequently_subjective/&quot;&gt;p-values&lt;/a&gt;&quot; are &lt;a href=&quot;http://biomet.oxfordjournals.org/content/77/3/467.abstract&quot;&gt;anything the author wants them to be&lt;/a&gt;...&lt;/p&gt;
&lt;p&gt;...and where there are &lt;a href=&quot;http://www.cnn.com/2010/HEALTH/09/09/pinky.regeneration.surgery/index.html&quot;&gt;all sorts of amazing technologies and techniques&lt;/a&gt; which nobody at your hospital has ever heard of...&lt;/p&gt;
&lt;p&gt;...there's also &lt;a href=&quot;http://metamed.com/&quot;&gt;&lt;strong&gt;MetaMed&lt;/strong&gt;&lt;/a&gt;.&amp;#xA0;&amp;#xA0;Instead of just having &amp;#x201C;evidence-based medicine&amp;#x201D; in journals that doctors don't actually read, MetaMed will provide you with actual evidence-based healthcare. &amp;#xA0;Their Chairman and CTO is Jaan Tallinn (cofounder of Skype, major funder of xrisk-related endeavors), one of their major VCs is Peter Thiel (major funder of MIRI), their management includes some names LWers will find familiar, and their researchers know math and stats and in many cases have also read LessWrong. &amp;#xA0;If you have a sufficiently serious problem and can afford their service, MetaMed will (a) put someone on reading the relevant research literature who understands real statistics and can tell whether the paper is trustworthy; and (b) refer you to a cooperative doctor in their network who can carry out the therapies they find.&lt;/p&gt;
&lt;p&gt;MetaMed was partially inspired by the case of a woman who had her fingertip chopped off, was told by the hospital that she was screwed, and then read through an awful lot of literature on her own until she found someone working on an advanced regenerative therapy that let her actually &lt;a href=&quot;http://www.cnn.com/2010/HEALTH/09/09/pinky.regeneration.surgery/index.html&quot;&gt;grow the fingertip back&lt;/a&gt;. &amp;#xA0;The idea behind MetaMed isn't just that they will scour the literature to find how the best experimentally supported treatment differs from the average wisdom - people who regularly read LW will be aware that this is often a pretty large divergence - but that they will also look for this sort of very recent technology that most hospitals won't have heard about.&lt;/p&gt;
&lt;p&gt;This is a new service and it has to interact with the existing medical system, so they are currently expensive, starting at $5,000 for a research report. &amp;#xA0;(Keeping in mind that a basic report involves a lot of work by people who must be good at math.) &amp;#xA0;If you have a sick friend who can afford it - especially if the regular system is failing them, and they want (or you want) their next step to be&amp;#xA0;&lt;em&gt;more&lt;/em&gt;&amp;#xA0;science instead of &quot;alternative medicine&quot; or whatever - please do refer them to MetaMed&amp;#xA0;immediately. &amp;#xA0;We can&amp;#x2019;t all have nice things like this someday unless somebody pays for it while it&amp;#x2019;s still new and expensive. &amp;#xA0;And the regular healthcare system really is bad enough at science (especially in the US, but science is difficult everywhere) that there's no point in condemning anyone to it when they can afford better.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;I also got my hands on a copy of MetaMed's standard list of citations that they use to support points to reporters. &amp;#xA0;What follows isn't nearly everything on MetaMed's list, just the items I found most interesting.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;90% of preclinical cancer studies could not be replicated:&lt;br&gt;&lt;a href=&quot;http://www.nature.com/nature/journal/v483/n7391/full/483531a.html&quot;&gt;http://www.nature.com/nature/journal/v483/n7391/full/483531a.html&lt;/a&gt;&lt;/p&gt;
&lt;div&gt;&quot;It is frequently stated that it takes an average of 17 years for research evidence to reach clinical practice. Balas and Bohen, Grant, and Wratschko all estimated a time lag of 17 years measuring different points of the process.&quot; - &lt;a href=&quot;http://www.jrsm.rsmjournals.com/content/104/12/510.full&quot;&gt;http://www.jrsm.rsmjournals.com/content/104/12/510.full&lt;/a&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&quot;The authors estimated the volume of medical literature potentially relevant to primary care published in a month and the time required for physicians trained in medical epidemiology to evaluate it for updating a clinical knowledgebase.... Average time per article was 2.89 minutes, if this outlier was excluded. Extrapolating this estimate to 7,287 articles per month, this effort would require 627.5 hours per month, or about 29 hours per weekday.&quot;&amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;One-third of hospital patients are harmed by their stay in the hospital, and 7% of patients are either permanently harmed or die: &lt;a href=&quot;http://www.ama-assn.org/amednews/2011/04/18/prl20418.htm&quot;&gt;http://www.ama-assn.org/amednews/2011/04/18/prl20418.htm&lt;/a&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;em&gt;(I emailed MetaMed to ask for the actual bibliography for the following citations, since that wasn't included in the copy of the list I saw. &amp;#xA0;I already recognize some of the citations having to do with Bayesian reasoning, which makes me fairly confident of the others.)&lt;/em&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Statistical Illiteracy&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Doctors often confuse sensitivity and specificity (Gigerenzer 2002); most physicians do not understand how to compute the positive predictive value of a test (Hoffrage and Gigerenzer 1998); a third overestimate benefits if they are expressed as positive risk reductions (Gigerenzer et al 2007).&lt;/div&gt;
&lt;div&gt;Physicians think a procedure is more effective if the benefits are described as a relative risk reduction rather than as an absolute risk reduction (Naylor et al 1992).&lt;/div&gt;
&lt;div&gt;Only 3 out of 140 reviewers of four breast cancer screening proposals noticed that all four were identical proposals with the risks represented differently (Fahey et al 1995).&lt;/div&gt;
&lt;div&gt;60% of gynecologists do not understand what the sensitivity and specificity of a test are (Gigerenzer at al 2007).&lt;/div&gt;
&lt;div&gt;95% of physicians overestimated the probability of breast cancer given a positive mammogram by an order of magnitude (Eddy 1982).&lt;/div&gt;
&lt;div&gt;When physicians receive prostate cancer screening information in terms of five-year survival rates, 78% think screening is effective; when the same information is given in terms of mortality rates, 5% believe it is effective (Wegwarth et al, submitted).&lt;/div&gt;
&lt;div&gt;Only one out of 21 obstetricians could estimate the probability that an unborn child had Down syndrome given a positive test (Bramwell, West, and Salmon 2006).&lt;/div&gt;
&lt;div&gt;Sixteen out of twenty HIV counselors said that there was no such thing as a false positive HIV test (Gigerenzer et all 1998).&lt;/div&gt;
&lt;div&gt;Only 3% of questions in the certification exam for the American Board of Internal Medicine cover clinical epidemiology or medical statistics, and risk communication is not addressed (Gigerenzer et al 2007).&lt;/div&gt;
&lt;div&gt;British GPs rarely change their prescribing patterns and when they do it&amp;#x2019;s rarely in response to evidence (Armstrong et al 1996).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Drug Advertising&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Direct-to-customer advertising by pharmaceutical companies, which is intended to sell drugs rather than to educate, often does not contain information about a drug's success rate (only 9% did), alternative methods of treatment (29%), behavioral changes (24%), or the treatment duration (9%) (Bell et al 2000).&lt;/div&gt;
&lt;div&gt;Patients are more likely to request advertised drugs and doctors to prescribe them, regardless of their misgivings (Gilbody et al 2005).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Medical Errors&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;44,000 to 98,000 patients are killed in US hospitals each year by documented, preventable medical errors (Kohn et al 2000).&lt;/div&gt;
&lt;div&gt;Despite proven effectiveness of simple checklists in reducing infections in hospitals (Provonost et al 2006), most ICU physicians do not use them.&lt;/div&gt;
&lt;div&gt;Simple diagnostic tools which may even ignore some data give measurably better outcomes in areas such as deciding whether to put a new admission in a coronary care bed (Green and Mehr 1997).&lt;/div&gt;
&lt;div&gt;Tort law often actively penalizes physicians who practice evidence-based medicine instead of the medicine that is customary in their area (Monahan 2007).&lt;/div&gt;
&lt;div&gt;Out of 175 law schools, only one requires a basic course in statistics or research methods (Faigman 1999), so many judges, jurors, and lawyers are misled by nontransparent statistics.&lt;/div&gt;
&lt;div&gt;93% of surgeons, obstreticians, and other health care professionals at high risk for malpractice suits report practicing defensive medicine (Studdert et al 2005).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Regional Variations in Health Care&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Tonsillectomies vary twelvefold between the counties in Vermont with the highest and lowest rates of the procedure (Wennberg and Gittelsohn 1973).&lt;/div&gt;
&lt;div&gt;Fivefold variations in one-year survival from cancer across different regions have been observed (Quam and Smith 2005).&lt;/div&gt;
&lt;div&gt;Fiftyfold variations in people receiving drug treatment for dementia has been reported (Prescribing Observatory for Mental Health 2007).&lt;/div&gt;
&lt;div&gt;Rates of certain surgical procedures vary tenfold to fifteenfold between regions (McPherson et al 1982).&lt;/div&gt;
&lt;div&gt;Clinicians are more likely to consult their colleagues than medical journals or the library, partially explaining regional differences (Shaughnessy et al 1994).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Research&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Researchers may report only favorable trials, only report favorable data (Angell 2004), or cherry-pick data to only report favorable variables or subgroups (Rennie 1997).&lt;/div&gt;
&lt;div&gt;Of 50 systematic reviews and meta-analyses on asthma treatment 40 had serious or extensive flaws, including all 6 associated with industry (Jadad et al 2000).&lt;/div&gt;
&lt;div&gt;Less high-tech knowledge and applications tend to be considered less innovative and ignored (Shi and Singh 2008).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Poor Use of Statistics In Research&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Only about 7% of major-journal trials report results using transparent statistics (Nuovo, Melnivov and Chang 2002).&lt;/div&gt;
&lt;div&gt;Data are often reported in biased ways: for instance, benefits are often reported as relative risks (&amp;#x201C;reduces the risk by half&amp;#x201D;) and harms as absolute risks (&amp;#x201C;an increase of 5 in 1000&amp;#x201D;); absolute risks seem smaller even when the risk is the same (Gigerenzer et al 2007).&lt;/div&gt;
&lt;div&gt;Half of trials inappropriately use significance tests for baseline comparison; 2/3 present subgroup findings, a sign of possible data fishing, often without appropriate tests for interaction (Assman et al 2000).&lt;/div&gt;
&lt;div&gt;One third of studies use mismatched framing, where benefits are reported one way (usually relative risk reduction, which makes them look bigger) and harms another (usually absolute risk reduction, which makes them look smaller) (Sedrakyan and Shih 2007).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Positive Publication Bias&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Positive publication bias overstates the effects of treatment by up to one-third (Schultz et al 1995).&lt;/div&gt;
&lt;div&gt;More than 50% of research is unpublished or unreported (Mathieu et al 2009).&lt;/div&gt;
&lt;div&gt;In ten high-impact medical journals, only 45.5% of trials were adequately registered before testing began; of these 31% show discrepancies between outcomes measured and published (Mathieu et al 2009).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Pharmaceutical Company Induced Bias&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Studies funded by the pharmaceutical industry are more likely to report results favorable to the sponsoring company (Lexchin et al 2003).&lt;/div&gt;
&lt;div&gt;There is a significant association between industry sponsorship and both pro-industry outcomes and poor methodology (Bekelman and Kronmal 2008).&lt;/div&gt;
&lt;div&gt;In manufacturer-supported trials of non-steroidal anti-inflammatory drugs, half the time the data presented did not match claims made within the article (Rochon et al 1994).&lt;/div&gt;
&lt;div&gt;68% of US health research is funded by industry (Research!America 2008), which means that research that leads to profits to the health care industry tends to be prioritized.&lt;/div&gt;
&lt;div&gt;71 out of 78 drugs approved by the FDA in 2002 are &amp;#x201C;me too&amp;#x201D; drugs that are more profitable because of the patent but not substantially different from existing medication (Angell 2004).&lt;/div&gt;
&lt;div&gt;&amp;#x201C;Seeding trials&amp;#x201D; by pharmaceutical companies promote treatments instead of testing hypotheses (Hill et al 2008).&lt;/div&gt;
&lt;div&gt;Even accurate research may be misreported by pharmaceutical company advertising, including ads in medical journals (Villanueva et al 2003).&lt;/div&gt;
&lt;div&gt;In 92% of cases, pharmaceutical leaflets distributed to doctors have data summaries that either cannot be verified or inaccurately summarize available data (Kaiser et al 2004).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;
&lt;hr&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;I don't plan on becoming seriously sick, but if I do, I think I'll check in with MetaMed&amp;#xA0;just to make sure nobody is ignoring the research results showing that you shouldn't feed the patient rat poison.&lt;/div&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/gvi/metamed_evidencebased_healthcare/#comments"&gt;165 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Recent updates to gwern.net (2012-2013) </title>
<link>http://lesswrong.com/lw/h0b/recent_updates_to_gwernnet_20122013/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/h0b/recent_updates_to_gwernnet_20122013/</guid>
<pubDate>Tue, 19 Mar 2013 06:54:30 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/gwern"&gt;gwern&lt;/a&gt;
&amp;bull;
61 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/h0b/recent_updates_to_gwernnet_20122013/#comments"&gt;32 comments&lt;/a&gt;
&lt;div&gt;&lt;blockquote&gt;
&lt;p&gt;Previous: &lt;a href=&quot;/lw/8kv/recent_updates_to_gwernnet/&quot;&gt;Recent updates to gwern.net (2011)&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;#x201C;But where shall wisdom be found? / And where is the place of understanding? / Man knoweth not the price thereof; neither is it found in the land of the living&amp;#x2026;for the price of wisdom is above rubies.&amp;#x201D;&lt;/p&gt;
&lt;p&gt;As before, here is material I&amp;#x2019;ve worked on in the 477 days since my last update which LWers may find interesting. In roughly chronological &amp;amp; topical order, here are the major additions to &lt;code&gt;gwern.net&lt;/code&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;I interviewed translator &lt;a href=&quot;http://www.gwern.net/docs/2011-house&quot;&gt;Michael House&lt;/a&gt; about his work in Japan as a translator&lt;/li&gt;
&lt;li&gt;finished data collection for my &lt;a href=&quot;http://www.gwern.net/hafu&quot;&gt;hafu anime statistics page&lt;/a&gt; and begun analysis. (I&amp;#x2019;ve achieved good coverage of characters, found an astonishingly consistent absence of Korean characters, and confirmed the blond-haired/blue-eyed stereotype; but my original thesis doesn&amp;#x2019;t seem to work and the data is too unevenly distributed to identify time trends.)&lt;/li&gt;
&lt;li&gt;judged the &lt;a href=&quot;http://www.gwern.net/Haskell%20Summer%20of%20Code#section-5&quot;&gt;2011&lt;/a&gt; &amp;amp; &lt;a href=&quot;http://www.gwern.net/Haskell%20Summer%20of%20Code#section-6&quot;&gt;2012&lt;/a&gt; results for the Haskell Summer of Codes and the accuracy of my predictions&lt;/li&gt;
&lt;li&gt;did &lt;a href=&quot;http://www.gwern.net/DNB%20FAQ#meta-analysis&quot;&gt;a meta-analysis&lt;/a&gt; on whether dual n-back increases IQ, and examining possible biases and various claims about what makes the training work or not work&lt;/li&gt;
&lt;li&gt;did &lt;a href=&quot;http://www.gwern.net/Iodine#meta-analysis&quot;&gt;another meta-analysis&lt;/a&gt; on whether iodine increases IQ, etc&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;modafinil:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;checked for &lt;a href=&quot;http://www.gwern.net/Nootropics#modalert-blind-day-trial&quot;&gt;subjective effects of blinded modafinil&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;updated my &lt;a href=&quot;http://www.gwern.net/Modafinil#suppliers-prices&quot;&gt;modafinil price-chart&lt;/a&gt; twice, and expanded with brand data and a new armodafinil table&lt;/li&gt;
&lt;li&gt;researched modafinil-related &lt;a href=&quot;http://www.gwern.net/Modafinil#legal-risk&quot;&gt;prosecutions &amp;amp; convictions&lt;/a&gt; in the USA&lt;/li&gt;
&lt;li&gt;and any connection with &lt;a href=&quot;http://www.gwern.net/Modafinil#schizophrenia&quot;&gt;schizophrenia&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;tried &lt;a href=&quot;http://www.gwern.net/Nootropics#kratom&quot;&gt;kratom&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;did a &lt;a href=&quot;http://www.gwern.net/Nootropics#nicotine-experiment&quot;&gt;nicotine gum/n-back experiment&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;did &lt;a href=&quot;http://www.gwern.net/Zeo#potassium&quot;&gt;2 potassium experiments&lt;/a&gt;; neither improved my mood/productivity, and one damaged my sleep&lt;/li&gt;
&lt;li&gt;my Silk Road page has been expanded with a &lt;a href=&quot;http://www.gwern.net/Silk%20Road#bbc-questions&quot;&gt;BBC interview&lt;/a&gt;, putting SR in a &lt;a href=&quot;http://www.gwern.net/Silk%20Road#silk-road-as-cyphernomicons-black-markets&quot;&gt;historical cypherpunk context&lt;/a&gt;, an updated account of &lt;a href=&quot;http://www.gwern.net/Silk%20Road#safe&quot;&gt;all arrests &amp;amp; law enforcement actions&lt;/a&gt;, and &lt;a href=&quot;http://www.gwern.net/Silk%20Road#lsd-case-study&quot;&gt;application of basic statistics to ordering&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;ran 2 sleep experiments on the timing of taking a vitamin D supplement: I found that &lt;a href=&quot;http://www.gwern.net/Zeo#vitamin-d&quot;&gt;taking vitamin D before bed&lt;/a&gt; substantially damaged my sleep, while &lt;a href=&quot;http://www.gwern.net/Zeo#vitamin-d-at-morn-helps&quot;&gt;taking vitamin D after waking up&lt;/a&gt; did not hurt &amp;amp; somewhat helped&lt;/li&gt;
&lt;li&gt;checked whether a walking desk (treadmill) &lt;a href=&quot;http://www.gwern.net/Zeo#fn53&quot;&gt;damaged typing speed or accuracy&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;I have run 3 Wikipedia experiments establishing that: &lt;a href=&quot;http://www.gwern.net/In%20Defense%20Of%20Inclusionism#sins-of-omission-experiment-1&quot;&gt;Talk page edits are ignored&lt;/a&gt; by editors; &lt;a href=&quot;http://www.gwern.net/In%20Defense%20Of%20Inclusionism#sins-of-omission-experiment-2&quot;&gt;random link deletions (and their restoration) are also ignored&lt;/a&gt; by editors; and &lt;a href=&quot;http://www.gwern.net/In%20Defense%20Of%20Inclusionism#ignoti-sed-non-occulti&quot;&gt;external link suggestions on Talk pages&lt;/a&gt; are also ignored by readers. (I take the former 2 as indicative of the decline in edit activity and rise of deletionist beliefs on Wikipedia.)&lt;/li&gt;
&lt;li&gt;tried some economic/historical analysis: &lt;a href=&quot;http://www.gwern.net/Notes#reasons-of-state-why-didnt-denmark-sell-greenland&quot;&gt;&amp;#x201C;Reasons of State: Why Didn&amp;#x2019;t Denmark Sell Greenland to the USA?&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/Sunk%20cost&quot;&gt;Defending sunk costs&lt;/a&gt; essay (&lt;a href=&quot;/lw/9si/is_sunk_cost_fallacy_a_fallacy/&quot;&gt;LW discussion&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/Slowing%20Moore%27s%20Law&quot;&gt;&amp;#x201C;Slowing Moore&amp;#x2019;s Law: Why You Might Want To and How You Would Do It&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/Hyperbolic%20Time%20Chamber&quot;&gt;&amp;#x201C;The Hyperbolic Time Chamber as Brain Emulation Analogy&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;tried estimating the bandwidth of a &lt;a href=&quot;http://www.gwern.net/Death%20Note%20Anonymity#communicating-with-a-death-note&quot;&gt;Death Note&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/hpmor&quot;&gt;compiled predictions&lt;/a&gt; for &lt;em&gt;Harry Potter and the Methods of Rationality&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;looked into &lt;a href=&quot;http://www.gwern.net/Conscientiousness%20and%20online%20education&quot;&gt;Conscientiousness and online education&lt;/a&gt;; studies so far are useless from a meta-analytic standpoint&lt;/li&gt;
&lt;li&gt;tripled length of &lt;a href=&quot;http://www.gwern.net/DNB%20FAQ#flaws-in-mainstream-science-and-psychology&quot;&gt;appendix&lt;/a&gt; dealing with the reliability of mainstream science (methodological flaws, replication rates, etc)&lt;/li&gt;
&lt;li&gt;finished meta-ethics essay, &lt;a href=&quot;http://www.gwern.net/The%20Narrowing%20Circle&quot;&gt;&amp;#x201C;The Narrowing Circle&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;explained the philosophy saying &lt;a href=&quot;http://www.gwern.net/Prediction%20markets#modus-tollens-vs-modus-ponens&quot;&gt;&amp;#x201C;one man&amp;#x2019;s modus ponens is another man&amp;#x2019;s modus tollens&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;speculation about a &lt;a href=&quot;http://www.gwern.net/Notes#alternate-futures-the-second-english-restoration&quot;&gt;restoration of the British monarchy&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;clean up &amp;amp; exploratory data analysis of &lt;a href=&quot;http://www.gwern.net/Zeo#sdr-lucid-dreaming-exploratory-data-analysis&quot;&gt;SDr&amp;#x2019;s lucid dreaming data&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/Death%20Note%20script&quot;&gt;Who wrote the &lt;em&gt;Death Note&lt;/em&gt; script?&lt;/a&gt; (&lt;a href=&quot;/lw/f63/case_study_the_death_note_script_and_bayes/&quot;&gt;LW discussion&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/2012%20election%20predictions&quot;&gt;2012 US election predictions: statistical comparison&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href=&quot;http://www.gwern.net/Anchoring&quot;&gt;Comment anchoring experiment&lt;/a&gt; (&lt;a href=&quot;/lw/gft/lw_anchoring_experiment_maybe/&quot;&gt;LW discussion&lt;/a&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/Notes#surprising-turing-complete-languages&quot;&gt;Turing-completeness in surprising places&lt;/a&gt; (inventory of particularly &amp;#x201C;weird machines&amp;#x201D;; relevant to computer and AI security)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Transcribed or translated:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/1955-nash&quot;&gt;Nash&amp;#x2019;s letters on cryptography&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Douglas Hofstadter&amp;#x2019;s &lt;a href=&quot;http://www.gwern.net/docs/1985-hofstadter&quot;&gt;superrationality&lt;/a&gt; columns (from &lt;em&gt;Metamagical Themas&lt;/em&gt;, 1985)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/1987-rossi&quot;&gt;&amp;#x201C;The Iron Law Of Evaluation And Other Metallic Rules&amp;#x201D;&lt;/a&gt;, Rossi 1987 (lessons from the large RCTs evaluating social &amp;amp; welfare interventions)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/1994-falk&quot;&gt;&amp;#x201C;The Ups and Downs of the Hope Function In a Fruitless Search&amp;#x201D;&lt;/a&gt;, Falk et al 1994&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2007-wolfe&quot;&gt;Gene Wolfe on writing&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2002-gibson&quot;&gt;&amp;#x201C;Shiny balls of Mud: William Gibson Looks at Japanese Pursuits of Perfection&amp;#x201D;&lt;/a&gt; (2002)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2004-okada&quot;&gt;&amp;#x201C;Otaku Talk&amp;#x201D;, Okada et al 2004&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2005-murakami&quot;&gt;&amp;#x201C;Earth in My Window&amp;#x201D;, Murakami 2005&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2005-sawaragi&quot;&gt;&amp;#x201C;On The Battlefield of &amp;#x2018;Superflat&amp;#x2019;&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/2010-sarrazin&quot;&gt;&amp;#x201C;Ero-Anime: Manga Comes Alive&amp;#x201D;, Sarrazin 2010&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/eva/1996-newtype-anno-interview&quot;&gt;1996 &lt;em&gt;NewType&lt;/em&gt; interview with Hideaki Anno&lt;/a&gt; (translated by me, with the help of an EGFer)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/eva/1997-animeland-may-hideakianno-interview-english&quot;&gt;1997 &lt;em&gt;Animeland&lt;/em&gt; interview with Hideaki Anno&lt;/a&gt; (bought, transcribed, and translated by me with the help of other LWers)&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.gwern.net/docs/1997-utena&quot;&gt;1997 &lt;em&gt;Utena&lt;/em&gt; interviews&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;More technical:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;added &lt;a href=&quot;http://www.gwern.net/docs/gwern.net-gitstats/index.html&quot;&gt;edit history statistics/visualization&lt;/a&gt; for &lt;code&gt;gwern.net&lt;/code&gt; using &lt;a href=&quot;http://gitstats.sourceforge.net/&quot;&gt;GitStats&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;site traffic updates: &lt;a href=&quot;http://www.gwern.net/About#july-2011---december-2011&quot;&gt;July-December 2011&lt;/a&gt;, &lt;a href=&quot;http://www.gwern.net/About#january-2012---july-2012&quot;&gt;January 2012-July 2012&lt;/a&gt;, &lt;a href=&quot;http://www.gwern.net/About#july-2012---january-2013&quot;&gt;July 2012-Jan 2013&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;There&amp;#x2019;s also been a lot of &lt;a href=&quot;http://www.gwern.net/About#colophon&quot;&gt;backend&lt;/a&gt; changes: switching to Amazon S3+Cloudflare, adding error pages, metadata like tags, A/B testing, but no need to go into detail.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Personal:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;dumped my notes on my &lt;a href=&quot;http://www.gwern.net/2011%20San%20Francisco&quot;&gt;2011 visit&lt;/a&gt; to San Francisco&lt;/li&gt;
&lt;li&gt;posted summaries of &lt;a href=&quot;http://www.gwern.net/Links#profile&quot;&gt;my personality &amp;amp; attitudes&lt;/a&gt; &amp;amp; my &lt;a href=&quot;http://www.gwern.net/docs/gwern-google-reader-subscriptions.xml&quot;&gt;RSS feed collection&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;enjoyed some &lt;a href=&quot;http://www.gwern.net/Mead&quot;&gt;mead&lt;/a&gt;; I still like tea better, though&lt;/li&gt;
&lt;li&gt;dumped notes on the &lt;a href=&quot;http://www.gwern.net/ICON%202012&quot;&gt;2012 SF convention ICON&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/h0b/recent_updates_to_gwernnet_20122013/#comments"&gt;32 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Reflection in Probabilistic Logic</title>
<link>http://lesswrong.com/lw/h1k/reflection_in_probabilistic_logic/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/h1k/reflection_in_probabilistic_logic/</guid>
<pubDate>Mon, 25 Mar 2013 03:37:36 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
60 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/h1k/reflection_in_probabilistic_logic/#comments"&gt;161 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Paul Christiano has devised&amp;#xA0;&lt;a href=&quot;http://intelligence.org/wp-content/uploads/2013/03/Christiano-et-al-Naturalistic-reflection-early-draft.pdf&quot;&gt;&lt;strong&gt;a new fundamental approach&lt;/strong&gt;&lt;/a&gt;&amp;#xA0;to the &quot;&lt;a href=&quot;https://www.youtube.com/watch?v=MwriJqBZyoM&quot;&gt;L&amp;#xF6;b Problem&lt;/a&gt;&quot; wherein &lt;a href=&quot;/lw/t6/the_cartoon_guide_to_l%C3%B6bs_theorem/&quot;&gt;L&amp;#xF6;b's Theorem&lt;/a&gt; seems to pose an obstacle to AIs building successor AIs, or adopting successor versions of their own code, that trust the same amount of mathematics as the original. &amp;#xA0;(I am currently writing up a more thorough description of the &lt;em&gt;question &lt;/em&gt;this preliminary technical report is working on answering. &amp;#xA0;For now the main online description is in a&amp;#xA0;&lt;a href=&quot;https://www.youtube.com/watch?v=MwriJqBZyoM&quot;&gt;quick Summit talk&lt;/a&gt;&amp;#xA0;I gave. &amp;#xA0;See also Benja Fallenstein's description of the problem in the course of presenting a&amp;#xA0;&lt;a href=&quot;/lw/e4e/an_angle_of_attack_on_open_problem_1/&quot;&gt;different angle of attack&lt;/a&gt;. &amp;#xA0;Roughly the problem is that mathematical systems can only prove the soundness of, aka 'trust', weaker mathematical systems. &amp;#xA0;If you try to write out an exact description of how AIs would build their successors or successor versions of their code in the most obvious way, it looks like the mathematical strength of the proof system would tend to be stepped down each time, which is undesirable.)&lt;/p&gt;
&lt;p&gt;Paul Christiano's approach is inspired by the idea that whereof one cannot prove or disprove, thereof one must assign probabilities: and that although no mathematical system can contain its own&amp;#xA0;&lt;em&gt;truth&lt;/em&gt;&amp;#xA0;predicate, a mathematical system might be able to contain a reflectively consistent&amp;#xA0;&lt;em&gt;probability&lt;/em&gt;&amp;#xA0;predicate. &amp;#xA0;In particular, it looks like we can have:&lt;/p&gt;
&lt;p&gt;&amp;#x2200;a, b:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;(a &amp;lt; P(&amp;#x3C6;) &amp;lt; b) &amp;#xA0; &amp;#xA0; &amp;#xA0; &amp;#xA0; &amp;#xA0;&amp;#x21D2; &amp;#xA0;P(a &amp;lt; P('&amp;#x3C6;') &amp;lt; b) = 1&lt;br&gt;&amp;#x2200;a, b:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;P(a &amp;#x2264; P('&amp;#x3C6;')&amp;#xA0;&amp;#x2264;&amp;#xA0;b) &amp;gt; 0 &amp;#xA0;&amp;#x21D2; &amp;#xA0;a&amp;#xA0;&amp;#x2264;&amp;#xA0;P(&amp;#x3C6;)&amp;#xA0;&amp;#x2264;&amp;#xA0;b&lt;/p&gt;
&lt;p&gt;Suppose I present you with the human and probabilistic version of a G&amp;#xF6;del sentence, the&amp;#xA0;&lt;a href=&quot;http://books.google.com/books?id=cmX8yyBfP74C&amp;amp;pg=PA317&amp;amp;lpg=PA317&amp;amp;dq=whitely+lucas+cannot+consistently&amp;amp;source=bl&amp;amp;ots=68tuximFfI&amp;amp;sig=GdZro1wy6g_KzO-PXInGTKFrU7Q&amp;amp;hl=en&amp;amp;sa=X&amp;amp;ei=7-FMUb61LojRiAK9hIGQDw&amp;amp;ved=0CGoQ6AEwBg#v=onepage&amp;amp;q=whitely%20lucas%20cannot%20consistently&amp;amp;f=false&quot;&gt;Whitely sentence&lt;/a&gt;&amp;#xA0;&quot;You assign this statement a probability less than 30%.&quot; &amp;#xA0;If you disbelieve this statement, it is true. &amp;#xA0;If you believe it, it is false. &amp;#xA0;If you assign 30% probability to it, it is false. &amp;#xA0;If you assign 29% probability to it, it is true.&lt;/p&gt;
&lt;p&gt;Paul's approach resolves this problem by restricting your belief about your own probability assignment to within epsilon of 30% for any epsilon. &amp;#xA0;So Paul's approach replies, &quot;Well, I assign&amp;#xA0;&lt;em&gt;almost&lt;/em&gt;&amp;#xA0;exactly 30% probability to that statement - maybe a little more, maybe a little less - in fact I think there's about a 30% chance that I'm a tiny bit under 0.3 probability and a 70% chance that I'm a tiny bit over 0.3 probability.&quot; &amp;#xA0;A standard fixed-point theorem then implies that a consistent assignment like this should exist. &amp;#xA0;If asked if the probability is over 0.2999 or under 0.30001 you will reply with a definite yes.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We haven't yet worked out a walkthrough showing if/how this solves the L&amp;#xF6;b obstacle to self-modification, and the probabilistic theory itself is nonconstructive (we've shown that something like this should exist, but not how to compute it). &amp;#xA0;Even so, a possible fundamental triumph over Tarski's theorem on the undefinability of truth and a number of standard G&amp;#xF6;delian limitations is important news as math&amp;#xA0;&lt;em&gt;qua&lt;/em&gt;&amp;#xA0;math, though work here is still in very preliminary stages. &amp;#xA0;There are even whispers of unrestricted comprehension in a probabilistic version of set theory with&amp;#xA0;&amp;#x2200;&amp;#x3C6;: &amp;#x2203;S: P(x &amp;#x2208; S) = P(&amp;#x3C6;(x)), though this part is not in the preliminary report and is at even earlier stages and could easily not work out at all.&lt;/p&gt;
&lt;p&gt;It seems important to remark on how this result was developed: &amp;#xA0;Paul Christiano showed up with the idea (of consistent probabilistic reflection via a fixed-point theorem) to a week-long &quot;math squad&quot; (aka MIRI Workshop) with Marcello Herreshoff, Mihaly Barasz, and myself; then we all spent the next week proving that version after version of Paul's idea couldn't work or wouldn't yield self-modifying AI; until finally, a day after the workshop was supposed to end, it produced something that looked like it might work. &amp;#xA0;If we hadn't been trying to &lt;em&gt;solve &lt;/em&gt;this problem (with hope stemming from how it seemed like the sort of thing a reflective rational agent ought&amp;#xA0;to be able to do somehow), this would be just another batch of impossibility results in the math literature. &amp;#xA0;I remark on this because it may help demonstrate that Friendly AI is a productive approach to math&amp;#xA0;&lt;em&gt;qua&amp;#xA0;&lt;/em&gt;math, which may aid some mathematician in becoming interested.&lt;/p&gt;
&lt;p&gt;I further note that this does not mean the L&amp;#xF6;bian obstacle is resolved and no further work is required. &amp;#xA0;Before we can conclude that we need a computably specified version of the theory plus a walkthrough for a self-modifying agent using it.&lt;/p&gt;
&lt;p&gt;See also the&amp;#xA0;&lt;a href=&quot;http://intelligence.org/2013/03/22/early-draft-of-naturalistic-reflection-paper/&quot;&gt;blog post&lt;/a&gt;&amp;#xA0;on the MIRI site (and subscribe to MIRI's newsletter&amp;#xA0;&lt;a href=&quot;http://intelligence.org/&quot;&gt;here&lt;/a&gt;&amp;#xA0;to keep abreast of research updates).&lt;/p&gt;
&lt;p&gt;This LW post is the preferred place for feedback on the &lt;a href=&quot;http://intelligence.org/wp-content/uploads/2013/03/Christiano-et-al-Naturalistic-reflection-early-draft.pdf&quot;&gt;paper&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;EDIT: &amp;#xA0;But see discussion on a Google+ post by John Baez &lt;a href=&quot;https://plus.google.com/117663015413546257905/posts/jJModdTJ2R3?hl=en&quot;&gt;here&lt;/a&gt;. &amp;#xA0;Also see&amp;#xA0;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/Comment_formatting#Using_LaTeX_to_render_mathematics&quot;&gt;here&lt;/a&gt;&amp;#xA0;for how to display math LaTeX in comments.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/h1k/reflection_in_probabilistic_logic/#comments"&gt;161 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Co-Working Collaboration to Combat Akrasia</title>
<link>http://lesswrong.com/lw/gwo/coworking_collaboration_to_combat_akrasia/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/gwo/coworking_collaboration_to_combat_akrasia/</guid>
<pubDate>Sun, 10 Mar 2013 05:17:04 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/ShannonFriedman"&gt;ShannonFriedman&lt;/a&gt;
&amp;bull;
54 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/gwo/coworking_collaboration_to_combat_akrasia/#comments"&gt;95 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;strong style=&quot;font-family: Times; font-size: medium; font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt;&lt;/strong&gt;&lt;strong style=&quot;font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Before I was very involved in the Less Wrong community, I heard that Eliezer was looking for &lt;/span&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; color: #1155cc; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;&lt;a href=&quot;/lw/1sm/akrasia_tactics_review/1nl4&quot;&gt;people to sit with him&lt;/a&gt; &lt;/span&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;while he worked, to increase writing productivity. &lt;/span&gt;&lt;/strong&gt;&lt;strong style=&quot;font-family: Times; font-size: medium; font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt; I knew that he was doing important work in the world, and figured that this was the sort of contribution to improving humanity that I would like to make, which was within the set of things that would be easy and enjoyable for me. &lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style=&quot;font-family: Times; font-size: medium; font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;So I got a hold of him and offered to come and sit with him, and did that once/week for about a year. As anticipated, it worked marvelously. I found it easy to sit and not talk, just getting my own work done. &amp;#xA0;Eventually I became a beta reader for his &quot;Bayes for Everyone Else&quot;&lt;/span&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt; which is really great and helped me in my ability to estimate probabilities a ton. (Eliezer is still perfecting this work and has not yet released it, but you can find the older version &lt;/span&gt;&lt;a href=&quot;http://yudkowsky.net/rational/bayes&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; color: #1155cc; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;.) &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;In addition to learning the basics of Bayes from doing this, I also learned how powerful it is to have someone just to sit quietly with you to co-work on a regular schedule. &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;I&amp;#x2019;ve experimented with similar things since then, such as making skype dates with a friend to watch informational videos together. This worked for awhile until my friend got busy. I have two other recurring chat dates with friends to do &lt;/span&gt;&lt;a href=&quot;/lw/1sm/akrasia_tactics_review/1nl4&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; color: #1155cc; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;dual n-back&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt; together, and those have worked quite well and are still going. &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;A client of mine, &lt;a href=&quot;/user/Mqrius/&quot;&gt;Mqrius&lt;/a&gt;, is working on his Master&amp;#x2019;s thesis and has found that the only way he has been able to overcome his akrasia so far is by co-working with a friend. Unfortunately, his friend does not have as much time to co-work as he&amp;#x2019;d like, so we decided to spend Mqrius&amp;#x2019;s counseling session today writing this Less Wrong post to see if we can help him and other people in the community who want to co-work over skype connect, since this will probably be much higher value to him as well as others with similar difficulties than the next best thing we could do with the time. &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;I encourage anyone who is interested in co-working, watching informational videos together, or any other social productivity experiments that can be done over skype or chat, to coordinate in the comments. For this to work best, I recommend being as specific as possible about the ideal co-working partner for you, in addition to noting if you are open to general co-working. &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;If you are &lt;/span&gt;&lt;a href=&quot;/lw/bc3/sotw_be_specific/&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; color: #1155cc; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;specific&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;, you are much more likely to succeed in finding a good co-working partner for you. While its possible you might screen someone out, its more likely that you will get the attention of your ideal co-working partner who otherwise would have glossed over your comment. &lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;Here is my specific pitch for &lt;a href=&quot;/user/Mqrius/&quot;&gt;Mqrius&lt;/a&gt;:&lt;/span&gt;&lt;br&gt;&lt;br&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p style=&quot;margin-left: 36pt; margin-top: 0pt; margin-bottom: 0pt;&quot; dir=&quot;ltr&quot;&gt;&lt;strong style=&quot;font-family: Times; font-size: medium; font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;If you are working on a thesis, especially if it&amp;#x2019;s related to nanotechnology like his thesis, and think that you are likely to be similarly motivated by co-working, please comment or contact him about setting up an initial skype trial run. His ideal scenario is to find 2-3 people to co-work with him for about 20 hours co-working/week time for him in total. He would like to find people who are dependable about showing up for appointments they have made and will create a recurring schedule with him at least until he gets his thesis done. He&amp;#x2019;d like to try an initial 4 hour co-working block as an experiment with interested parties. &amp;#xA0;&amp;#xA0;Please comment below if you are interested. &lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style=&quot;font-family: Times; font-size: medium; font-weight: normal;&quot; id=&quot;internal-source-marker_0.417484016623348&quot;&gt; &lt;br&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;[Mqrius and I have predictions going about whether or not he will actually get a co-working partner who is working on a nanotech paper out of this, if others want to post predictions in the comments, this is encouraged. &amp;#xA0;Its a good practice for reducing &lt;/span&gt;&lt;a style=&quot;font-family: Times; font-size: medium;&quot; href=&quot;/lw/il/hindsight_bias/&quot;&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; color: #1155cc; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;hindsight bias&lt;/span&gt;&lt;/a&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;.&lt;/span&gt;&lt;span style=&quot;font-size: 15px; font-family: Arial; background-color: transparent; vertical-align: baseline; white-space: pre-wrap;&quot;&gt;&lt;span style=&quot;font-family: Times; font-size: small;&quot;&gt;]&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;[edit]&lt;/p&gt;
&lt;p&gt;An virtual co-working space has been created and is currently live, discussion and link to the room &lt;a href=&quot;/lw/gwo/coworking_collaboration_to_combat_akrasia/8lak&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/gwo/coworking_collaboration_to_combat_akrasia/#comments"&gt;95 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Fermi Estimates</title>
<link>http://lesswrong.com/lw/h5e/fermi_estimates/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/h5e/fermi_estimates/</guid>
<pubDate>Fri, 12 Apr 2013 03:52:28 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
45 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/h5e/fermi_estimates/#comments"&gt;104 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Just before the &lt;a href=&quot;http://is.gd/NeBfVu&quot;&gt;Trinity test&lt;/a&gt;, Enrico Fermi decided he wanted a rough estimate of the blast's power before the diagnostic data came in. So he dropped some pieces of paper from his hand as the blast wave passed him, and used this to estimate that the blast was equivalent to 10 kilotons of TNT. His guess was remarkably accurate for having so little data: the true answer turned out to be 20 kilotons of TNT.&lt;/p&gt;
&lt;p&gt;Fermi had a knack for making roughly-accurate estimates with very little data, and therefore such an estimate is known today as a &lt;a href=&quot;http://en.wikipedia.org/wiki/Fermi_problem&quot;&gt;Fermi estimate&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Why bother with Fermi estimates, if your estimates are likely to be off by a factor of 2 or even 10? Often, getting an estimate within a factor of 10 or 20 is enough to make a decision. So Fermi estimates can save you a lot of time, especially as you gain more practice at making them.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Estimation tips&lt;/h3&gt;
&lt;p&gt;&lt;small&gt;These first two sections are adapted from &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Guesstimation-2-0-Solving-Todays-Problems/dp/069115080X/&quot;&gt;Guestimation 2.0&lt;/a&gt;&lt;/em&gt;.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dare to be imprecise.&lt;/strong&gt; Round things off enough to do the calculations in your head. I call this the &lt;a href=&quot;http://en.wikipedia.org/wiki/Spherical_cow&quot;&gt;spherical cow principle&lt;/a&gt;, after a joke about how physicists oversimplify things to make calculations feasible:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Milk production at a dairy farm was low, so the farmer asked a local university for help. A multidisciplinary team of professors was assembled, headed by a theoretical physicist. After two weeks of observation and analysis, the physicist told the farmer, &quot;I have the solution, but it only works in the case of spherical cows in a vacuum.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;By the spherical cow principle, there are 300 days in a year, people are six feet (or 2 meters) tall, the circumference of the Earth is 20,000 mi (or 40,000 km), and cows are spheres of meat and bone 4 feet (or 1 meter) in diameter.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Decompose the problem.&lt;/strong&gt; Sometimes you can give an estimate in one step, within a factor of 10. (How much does a new compact car cost? $20,000.) But in most cases, you'll need to break the problem into several pieces, estimate each of them, and then recombine them. I'll give several examples below.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimate by bounding.&lt;/strong&gt; Sometimes it is easier to give lower and upper bounds than to give a point estimate. How much time per day does the average 15-year-old watch TV? I don't spend any time with 15-year-olds, so I haven't a clue. It could be 30 minutes, or 3 hours, or 5 hours, but I'm pretty confident it's more than 2 minutes and less than 7 hours (400 minutes, by the spherical cow principle).&lt;/p&gt;
&lt;p&gt;Can we convert those bounds into an estimate? You bet. But we don't do it by taking the &lt;em&gt;average&lt;/em&gt;. That would give us (2 mins + 400 mins)/2 = 201 mins, which is within a factor of 2 from our upper bound, but a factor &lt;em&gt;100&lt;/em&gt; greater than our lower bound. Since our goal is to estimate the answer within a factor of 10, we'll probably be way off.&lt;/p&gt;
&lt;p&gt;Instead, we take the &lt;em&gt;geometric mean&lt;/em&gt; &amp;#x2014; the square root of the product of our upper and lower bounds. But square roots often require a calculator, so instead we'll take the &lt;em&gt;approximate&lt;/em&gt; geometric mean (AGM). To do that, we average the coefficients and exponents of our upper and lower bounds.&lt;/p&gt;
&lt;p&gt;So what is the AGM of 2 and 400? Well, 2 is 2&amp;#xD7;10&lt;sup&gt;0&lt;/sup&gt;, and 400 is 4&amp;#xD7;10&lt;sup&gt;2&lt;/sup&gt;. The average of the coefficients (2 and 4) is 3; the average of the exponents (0 and 2) is 1. So, the AGM of 2 and 400 is 3&amp;#xD7;10&lt;sup&gt;1&lt;/sup&gt;, or 30. The precise geometric mean of 2 and 400 turns out to be 28.28. Not bad.&lt;/p&gt;
&lt;p&gt;What if the sum of the exponents is an odd number? Then we round the resulting exponent down, and multiply the final answer by three. So suppose my lower and upper bounds for how much TV the average 15-year-old watches had been 20 mins and 400 mins. Now we calculate the AGM like this: 20 is 2&amp;#xD7;10&lt;sup&gt;1&lt;/sup&gt;, and 400 is still 4&amp;#xD7;10&lt;sup&gt;2&lt;/sup&gt;. The average of the coefficients (2 and 4) is 3; the average of the exponents (1 and 2) is 1.5. So we round the exponent down to 1, and we multiple the final result by three: 3(3&amp;#xD7;10&lt;sup&gt;1&lt;/sup&gt;) = 90 mins. The precise geometric mean of 20 and 400 is 89.44. Again, not bad.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sanity-check your answer&lt;/strong&gt;. You should always sanity-check your final estimate by comparing it to some reasonable analogue. You'll see examples of this below.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Use Google as needed&lt;/strong&gt;. You can often quickly find the exact quantity you're trying to estimate on Google, or at least some &lt;em&gt;piece&lt;/em&gt; of the problem. In those cases, it's probably not worth trying to estimate it &lt;em&gt;without&lt;/em&gt; Google.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;&lt;br&gt;&lt;/h3&gt;
&lt;h3&gt;Fermi estimation failure modes&lt;/h3&gt;
&lt;p&gt;Fermi estimates go wrong in one of three ways.&lt;/p&gt;
&lt;p&gt;First, we might badly overestimate or underestimate a quantity. Decomposing the problem, estimating from bounds, and looking up particular pieces on Google should protect against this. Overestimates and underestimates for the different pieces of a problem &lt;a href=&quot;http://en.wikipedia.org/wiki/Fermi_problem#Explanation&quot;&gt;should roughly cancel out&lt;/a&gt;, especially when there are many pieces.&lt;/p&gt;
&lt;p&gt;Second, we might model the problem incorrectly. If you estimate teenage deaths per year on the assumption that most teenage deaths are from suicide, your estimate will probably be way off, because most teenage deaths are caused by accidents. To avoid this, try to decompose each Fermi problem by using a model you're fairly confident of, even if it means you need to use more pieces or give wider bounds when estimating each quantity.&lt;/p&gt;
&lt;p&gt;Finally, we might choose a nonlinear problem. Normally, we assume that if one object can get some result, then two objects will get twice the result. Unfortunately, this doesn't hold true for nonlinear problems. If one motorcycle on a highway can transport a person at 60 miles per hour, then 30 motorcycles can transport 30 people at 60 miles per hour. However, 10&lt;sup&gt;4&lt;/sup&gt; motorcycles cannot transport 10&lt;sup&gt;4&lt;/sup&gt; people at 60 miles per hour, because there will be a huge traffic jam on the highway. This problem is difficult to avoid, but with practice you will get better at recognizing when you're facing a nonlinear problem.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Fermi practice&lt;/h3&gt;
&lt;p&gt;When getting started with Fermi practice, I recommend estimating quantities that you can easily look up later, so that you can see how accurate your Fermi estimates tend to be. Don't look up the answer before constructing your estimates, though! Alternatively, you might allow yourself to look up particular pieces of the problem &amp;#x2014; e.g. the &lt;a href=&quot;http://en.wikipedia.org/wiki/List_of_religious_populations#Adherent_estimates&quot;&gt;number of Sikhs&lt;/a&gt; in the world, the formula for &lt;a href=&quot;http://en.wikipedia.org/wiki/Escape_velocity&quot;&gt;escape velocity&lt;/a&gt;, or the &lt;a href=&quot;http://en.wikipedia.org/wiki/Gross_world_product&quot;&gt;gross world product&lt;/a&gt;&amp;#xA0;&amp;#x2014; but not the final quantity you're trying to estimate.&lt;/p&gt;
&lt;p&gt;Most books about Fermi estimates are filled with examples done by Fermi estimate experts, and in many cases the estimates were probably adjusted after the author looked up the true answers. This post is different. My examples below are estimates I made &lt;em&gt;before&lt;/em&gt; looking up the answer online, so you can get a realistic picture of how this works from someone who isn't &quot;cheating.&quot; Also, there will be no selection effect: I'm going to do four Fermi estimates for this post, and I'm not going to throw out my estimates if they are way off. Finally, I'm not all that practiced doing &quot;Fermis&quot; myself, so you'll get to see what it's like for a relative newbie to go through the process. In short, I hope to give you a realistic picture of what it's like to do Fermi practice when you're just getting started.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Example 1: How many new passenger cars are sold each year in the USA?&lt;/h3&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/aMD9a4s.jpg&quot; style=&quot;float: right; padding: 10px;&quot; alt=&quot;&quot;&gt;The classic Fermi problem is &quot;How many piano tuners are there in Chicago?&quot; This kind of estimate is useful if you want to know the approximate size of the customer base for a new product you might develop, for example. But I'm not sure anyone knows how many piano tuners there &lt;em&gt;really&lt;/em&gt; are in Chicago, so let's try a different one we probably &lt;em&gt;can&lt;/em&gt; look up later: &quot;How many new passenger cars are sold each year in the USA?&quot;&lt;/p&gt;
&lt;p&gt;As with all Fermi problems, there are many different models we could build. For example, we could estimate how many new cars a dealership sells per month, and then we could estimate how many dealerships there are in the USA. Or we could try to estimate the annual demand for new cars from the country's population. Or, if we happened to have read how many Toyota Corollas were sold last year, we could try to build our estimate from there.&lt;/p&gt;
&lt;p&gt;The second model looks more robust to me than the first, since I know roughly how many Americans there are, but I have no idea how many new-car dealerships there are. Still, let's try it both ways. (I &lt;em&gt;don't&lt;/em&gt; happen to know how many new Corollas were sold last year.)&lt;/p&gt;
&lt;h4&gt;Approach #1: Car dealerships&lt;/h4&gt;
&lt;p&gt;How many new cars does a dealership sell per month, on average? Oofta, I dunno. To support the dealership's existence, I assume it has to be at least 5. But it's probably not more than 50, since most dealerships are in small towns that don't get much action. To get my point estimate, I'll take the AGM of 5 and 50. 5 is 5&amp;#xD7;10&lt;sup&gt;0&lt;/sup&gt;, and 50 is 5&amp;#xD7;10&lt;sup&gt;1&lt;/sup&gt;. Our exponents sum to an odd number, so I'll round the exponent down to 0 and multiple the final answer by 3. So, my estimate of how many new cars a new-car dealership sells per month is 3(5&amp;#xD7;10&lt;sup&gt;0&lt;/sup&gt;) = 15.&lt;/p&gt;
&lt;p&gt;Now, how many new-car dealerships are there in the USA? This could be tough. I know several towns of only 10,000 people that have 3 or more new-car dealerships. I don't recall towns much smaller than that having new-car dealerships, so let's exclude them. How many cities of 10,000 people or more are there in the USA? I have no idea. So let's decompose this problem a bit more.&lt;/p&gt;
&lt;p&gt;How many &lt;em&gt;counties&lt;/em&gt; are there in the USA? I remember seeing a map of counties colored by which national ancestry was dominant in that county. (Germany was the most common.) Thinking of that map, there were definitely more than 300 counties on it, and definitely less than 20,000. What's the AGM of 300 and 20,000? Well, 300 is 3&amp;#xD7;10&lt;sup&gt;2&lt;/sup&gt;, and 20,000 is 2&amp;#xD7;10&lt;sup&gt;4&lt;/sup&gt;. The average of coefficients 3 and 2 is 2.5, and the average of exponents 2 and 4 is 3. So the AGM of 300 and 20,000 is 2.5&amp;#xD7;10&lt;sup&gt;3&lt;/sup&gt; = 2500.&lt;/p&gt;
&lt;p&gt;Now, how many towns of 10,000 people or more are there per county? I'm pretty sure the average must be larger than 10 and smaller than 5000. The AGM of 10 and 5000 is 300. (I won't include this calculation in the text anymore; you know how to do it.)&lt;/p&gt;
&lt;p&gt;Finally, how many car dealerships are there in cities of 10,000 or more people, on average? Most such towns are pretty small, and probably have 2-6 car dealerships. The largest cities will have many more: maybe 100-ish. So I'm pretty sure the average number of car dealerships in cities of 10,000 or more people must be between 2 and 30. The AGM of 2 and 30 is 7.5.&lt;/p&gt;
&lt;p&gt;Now I just multiply my estimates:&lt;/p&gt;
&lt;p&gt;[15 new cars sold per month per dealership] &amp;#xD7; [12 months per year] &amp;#xD7; [7.5 new-car dealerships per city of 10,000 or more people] &amp;#xD7; [300 cities of 10,000 or more people per county] &amp;#xD7; [2500 counties in the USA] = 1,012,500,000.&lt;/p&gt;
&lt;p&gt;A sanity check immediately invalidates this answer. There's no way that 300 million American citizens buy a &lt;em&gt;billion&lt;/em&gt; new cars per year. I suppose they &lt;em&gt;might&lt;/em&gt; buy 100 million new cars per year, which would be within a factor of 10 of my estimate, but I doubt it.&lt;/p&gt;
&lt;p&gt;As I suspected, my first approach was problematic. Let's try the second approach, starting from the population of the USA.&lt;/p&gt;
&lt;h4&gt;Approach #2: Population of the USA&lt;/h4&gt;
&lt;p&gt;There are about 300 million Americans. How many of them own a car? Maybe 1/3 of them, since children don't own cars, many people in cities don't own cars, and many households share a car or two between the adults in the household.&lt;/p&gt;
&lt;p&gt;Of the 100 million people who own a car, how many of them bought a &lt;em&gt;new&lt;/em&gt; car in the past 5 years? Probably less than half; most people buy used cars, right? So maybe 1/4 of car owners bought a new car in the past 5 years, which means 1 in 20 car owners bought a new car in the past &lt;em&gt;year&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;100 million / 20 = 5 million new cars sold each year in the USA. That doesn't seem crazy, though perhaps a bit low. I'll take this as my estimate.&lt;/p&gt;
&lt;p&gt;Now is your last chance to try this one on your own; in the next paragraph I'll reveal the true answer.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;...&lt;/p&gt;
&lt;p&gt;Now, I Google &lt;a href=&quot;https://www.google.com/#hl=en&amp;amp;output=search&amp;amp;sclient=psy-ab&amp;amp;q=new+cars+sold+per+year+in+the+USA&amp;amp;oq=new+cars+sold+per+year+in+the+USA&quot;&gt;new cars sold per year in the USA&lt;/a&gt;. Wikipedia is the first result, and it &lt;a href=&quot;http://en.wikipedia.org/wiki/Passenger_vehicles_in_the_United_States#Sales&quot;&gt;says&lt;/a&gt; &quot;In the year 2009, about 5.5 million new passenger cars were sold in the United States according to the U.S. Department of Transportation.&quot;&lt;/p&gt;
&lt;p&gt;Boo-yah!&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Example 2: How many fatalities from passenger-jet crashes have there been in the past 20 years?&lt;/h3&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/TyoL5r1.jpg&quot; style=&quot;float: right; padding: 10px;&quot; alt=&quot;&quot;&gt;Again, there are multiple models I could build. I could try to estimate how many passenger-jet flights there are per year, and then try to estimate the frequency of crashes and the average number of fatalities per crash. Or I could just try to guess the total number of passenger-jet crashes around the world per year and go from there.&lt;/p&gt;
&lt;p&gt;As far as I can tell, passenger-jet crashes (with fatalities) almost always make it on the TV news and (more relevant to me) the front page of Google News. Exciting footage and multiple deaths will do that. So working just from memory, it feels to me like there are about 5 passenger-jet crashes (with fatalities) per year, so maybe there were about 100 passenger jet crashes with fatalities in the past 20 years.&lt;/p&gt;
&lt;p&gt;Now, how many fatalities per crash? From memory, it seems like there are usually two kinds of crashes: ones where &lt;em&gt;everybody&lt;/em&gt; dies (meaning: about 200 people?), and ones where only about 10 people die. I think the &quot;everybody dead&quot; crashes are less common, maybe 1/4 as common. So the average crash with fatalities should cause (200&amp;#xD7;1/4)+(10&amp;#xD7;3/4) = 50+7.5 = 60, by the spherical cow principle.&lt;/p&gt;
&lt;p&gt;60 fatalities per crash &amp;#xD7; 100 crashes with fatalities over the past 20 years = 6000 passenger fatalities from passenger-jet crashes in the past 20 years.&lt;/p&gt;
&lt;p&gt;Last chance to try this one on your own...&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;A Google search again brings me to Wikipedia, which reveals that an organization called ACRO &lt;a href=&quot;http://en.wikipedia.org/wiki/Aviation_accidents_and_incidents#Aircraft_Crashes_Record_Office_.28ACRO.29&quot;&gt;records&lt;/a&gt; the number of airline fatalities each year. Unfortunately for my purposes, they include fatalities from cargo flights. After more Googling, I tracked down Boeing's &quot;&lt;a href=&quot;http://www.boeing.com/news/techissues/pdf/statsum.pdf&quot;&gt;Statistical Summary of Commercial Jet Airplane Accidents, 1959-2011&lt;/a&gt;,&quot; but that report excludes jets lighter than 60,000 pounds, and excludes crashes caused by hijacking or terrorism.&lt;/p&gt;
&lt;p&gt;It appears it would be a major research project to figure out the true answer to our question, but let's at least estimate it from the ACRO data. Luckily, ACRO has statistics on which percentage of accidents are from passenger and other kinds of flights, which I'll take as a proxy for which percentage of &lt;em&gt;fatalities&lt;/em&gt; are from different kinds of flights. According to &lt;a href=&quot;http://www.baaa-acro.com/Statistiques%20diverses.htm&quot;&gt;that page&lt;/a&gt;, 35.41% of accidents are from &quot;regular schedule&quot; flights, 7.75% of accidents are from &quot;private&quot; flights, 5.1% of accidents are from &quot;charter&quot; flights, and 4.02% of accidents are from &quot;executive&quot; flights. I think that captures what I had in mind as &quot;passenger-jet flights.&quot; So we'll guess that 52.28% of fatalities are from &quot;passenger-jet flights.&quot; I won't round this to 50% because we're not doing a Fermi estimate right now; we're trying to &lt;em&gt;check&lt;/em&gt; a Fermi estimate.&lt;/p&gt;
&lt;p&gt;According to ACRO's &lt;a href=&quot;http://www.baaa-acro.com/archives/Accidents.htm&quot;&gt;archives&lt;/a&gt;, there were 794 fatalities in 2012, 828 fatalities in 2011, and... well, from 1993-2012 there were a total of 28,021 fatalities. And 52.28% of that number is 14,649.&lt;/p&gt;
&lt;p&gt;So my estimate of 6000 was off by less than a factor of 3!&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Example 3: How much does the New York state government spends on K-12 education every year?&lt;/h3&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/N5zf5d3.jpg&quot; style=&quot;float: right; padding: 10px;&quot; alt=&quot;&quot;&gt;How might I estimate this? First I'll estimate the number of K-12 students in New York, and then I'll estimate how much this should cost.&lt;/p&gt;
&lt;p&gt;How many people live in New York? I seem to recall that NYC's greater metropolitan area is about 20 million people. That's probably most of the state's population, so I'll guess the total is about 30 million.&lt;/p&gt;
&lt;p&gt;How many of those 30 million people attend K-12 public schools? I can't remember what the United States' &lt;a href=&quot;http://en.wikipedia.org/wiki/Population_pyramid&quot;&gt;population pyramid&lt;/a&gt; looks like, but I'll guess that about 1/6 of Americans (and hopefully New Yorkers) attend K-12 at any given time. So that's 5 million kids in K-12 in New York. The number attending private schools probably isn't large enough to matter for factor-of-10 estimates.&lt;/p&gt;
&lt;p&gt;How much does a year of K-12 education cost for one child? Well, I've heard teachers don't get paid much, so after benefits and taxes and so on I'm guessing a teacher costs about $70,000 per year. How big are class sizes these days, 30 kids? By the spherical cow principle, that's about $2,000 per child, per year on teachers' salaries. But there are lots of other expenses: buildings, transport, materials, support staff, etc. And maybe some money goes to private schools or other organizations. Rather than estimate all those things, I'm just going to guess that about $10,000 is spent per child, per year.&lt;/p&gt;
&lt;p&gt;If that's right, then New York spends $50 billion per year on K-12 education.&lt;/p&gt;
&lt;p&gt;Last chance to make your own estimate!&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;Before I did the Fermi estimate, I had &lt;a href=&quot;http://www.juliagalef.com/&quot;&gt;Julia Galef&lt;/a&gt; check Google to find this statistic, but she didn't give me any hints about the number. Her two sources were &lt;a href=&quot;http://www.wolframalpha.com/input/?i=new+york+state+spending+education+k-12&quot;&gt;Wolfram Alpha&lt;/a&gt; and a &lt;a href=&quot;http://governor.ny.gov/citizenconnects/?q=content/web-chat-deputy-secretary-education-david-wakelyn&quot;&gt;web chat&lt;/a&gt; with New York's Deputy Secretary for Education, both of which put the figure at approximately $53 billion.&lt;/p&gt;
&lt;p&gt;Which is definitely within a factor of 10 from $50 billion. :)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Example 4: How many plays of My Bloody Valentine's &quot;Only Shallow&quot; have been reported to last.fm?&lt;/h3&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/sr8R6T5.jpg&quot; style=&quot;float: right; padding: 10px;&quot; alt=&quot;&quot;&gt;&lt;a href=&quot;http://en.wikipedia.org/wiki/Last.fm&quot;&gt;Last.fm&lt;/a&gt; makes a record of every audio track you play, if you enable the relevant feature or plugin for the music software on your phone, computer, or other device. Then, the service can show you charts and statistics about your listening patterns, and make personalized music recommendations from them. My own charts are &lt;a href=&quot;http://www.last.fm/user/lukeprog/charts&quot;&gt;here&lt;/a&gt;. (Chuck Wild / &lt;a href=&quot;http://www.liquidmindmusic.com/&quot;&gt;Liquid Mind&lt;/a&gt; dominates my charts because I used to listen to that artist while sleeping.)&lt;/p&gt;
&lt;p&gt;My Fermi problem is: How many plays of &quot;&lt;a href=&quot;http://www.youtube.com/watch?v=FyYMzEplnfU&quot;&gt;Only Shallow&lt;/a&gt;&quot; have been reported to last.fm?&lt;/p&gt;
&lt;p&gt;My Bloody Valentine is a popular &quot;indie&quot; rock band, and &quot;Only Shallow&quot; is probably one of their most popular tracks. How can I estimate how many plays it has gotten on last.fm?&lt;/p&gt;
&lt;p&gt;What do I know that might help?&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;I know last.fm is popular, but I don't have a sense of whether they have 1 million users, 10 million users, or 100 million users. &lt;/li&gt;
&lt;li&gt;I accidentally saw on Last.fm's Wikipedia page that just over 50 billion track plays have been recorded. We'll consider that to be one piece of data I looked up to help with my estimate. &lt;/li&gt;
&lt;li&gt;I seem to recall reading that major music services like iTunes and Spotify have about 10 million tracks. Since last.fm records songs that people play from their private collections, whether or not they exist in popular databases, I'd guess that the total number of different tracks named in last.fm's database is an order of magnitude larger, for about 100 million tracks named in its database.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;I would guess that track plays obey a &lt;a href=&quot;http://en.wikipedia.org/wiki/Power_law&quot;&gt;power law&lt;/a&gt;, with the most popular tracks getting vastly more plays than tracks of average popularity. I'd also guess that there are maybe 10,000 tracks more popular than &quot;Only Shallow.&quot;&lt;/p&gt;
&lt;p&gt;Next, I simulated being good at math by having &lt;a href=&quot;/user/Qiaochu_Yuan/overview/&quot;&gt;Qiaochu Yuan&lt;/a&gt; show me how to do the calculation. I also allowed myself to use a calculator. Here's what we do:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Plays(rank) = C/(rank&lt;sup&gt;P&lt;/sup&gt;)&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;P is the exponent for the power law, and C is the proportionality constant. We'll guess that P is 1, a common power law exponent for empirical data. And we calculate C like so:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;C &amp;#x2248; [total plays]/ln(total songs) &amp;#x2248; 2.5 billion&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;So now, assuming the song's rank is 10,000, we have:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Plays(10&lt;sup&gt;4&lt;/sup&gt;) = 2.5&amp;#xD7;10&lt;sup&gt;9&lt;/sup&gt;/(10&lt;sup&gt;4&lt;/sup&gt;)&lt;/p&gt;
&lt;p&gt;Plays(&quot;Only Shallow&quot;) = 250,000&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;That seems high, but let's roll with it. Last chance to make your own estimate!&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;&amp;#x2026;&lt;/p&gt;
&lt;p&gt;...&lt;/p&gt;
&lt;p&gt;And when I &lt;a href=&quot;http://www.last.fm/music/My+Bloody+Valentine/_/Only+Shallow&quot;&gt;check the answer&lt;/a&gt;, I see that &quot;Only Shallow&quot; has about 2 million plays on last.fm.&lt;/p&gt;
&lt;p&gt;My answer was off by less than a factor of 10, which for a Fermi estimate is called &lt;em&gt;victory&lt;/em&gt;!&lt;/p&gt;
&lt;p&gt;Unfortunately, last.fm doesn't publish all-time track rankings or other data that might help me to determine which parts of my model were correct and incorrect.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Further examples&lt;/h3&gt;
&lt;p&gt;I focused on examples that are similar in structure to the kinds of quantities that entrepreneurs and CEOs might want to estimate, but of course there are all kinds of things one can estimate this way. Here's a sampling of Fermi problems featured in various books and websites on the subject:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Guesstimation-Solving-Worlds-Problems-Cocktail/dp/0691129495/&quot;&gt;Guesstimation&lt;/a&gt;&lt;/em&gt; (2008): If all the humans in the world were crammed together, how much area would we require? What would be the mass of all 10&lt;sup&gt;8&lt;/sup&gt; MongaMillions lottery tickets? On average, how many people are airborne over the US at any given moment? How many cells are there in the human body? How many people in the world are picking their nose right now? What are the relative costs of fuel for NYC rickshaws and automobiles?&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Guesstimation-2-0-Solving-Todays-Problems/dp/069115080X/&quot;&gt;Guesstimation 2.0&lt;/a&gt; (2011): If we launched a trillion one-dollar bills into the atmosphere, what fraction of sunlight hitting the Earth could we block with those dollar bills? If a million monkeys typed randomly on a million typewriters for a year, what is the longest string of consecutive correct letters of *The Cat in the Hat&lt;/em&gt; (starting from the beginning) would they likely type? How much energy does it take to crack a nut? If an airline asked its passengers to urinate before boarding the airplane, how much fuel would the airline save per flight? What is the radius of the largest rocky sphere from which we can reach escape velocity by jumping?&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href=&quot;http://www.amazon.com/How-Many-Licks-Estimate-Anything/dp/0762435607/&quot;&gt;How Many Licks?&lt;/a&gt;&lt;/em&gt; (2009): What fraction of Earth's volume would a &lt;a href=&quot;http://en.wikipedia.org/wiki/Mole_(unit)&quot;&gt;mole&lt;/a&gt; of hot, sticky, chocolate-jelly doughnuts be? How many miles does a person walk in a lifetime? How many times can you outline the continental US in shoelaces? How long would it take to read every book in the library? How long can you shower and still make it more environmentally friendly than taking a bath?&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Ballparking-Practical-Impractical-Sports-Questions/dp/0762443456/&quot;&gt;Ballparking&lt;/a&gt;&lt;/em&gt; (2012): How many bolts are in the floor of the Boston Garden basketball court? How many lanes would you need for the outermost lane of a running track to be the length of a marathon? How hard would you have to hit a baseball for it to never land?&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.physics.umd.edu/perg/fermi/fermi.htm&quot;&gt;University of Maryland Fermi Problems Site&lt;/a&gt;: How many sheets of letter-sized paper are used by all students at the University of Maryland in one semester? How many blades of grass are in the lawn of a typical suburban house in the summer? How many golf balls can be fit into a typical suitcase?&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3&gt;Conclusion&lt;/h3&gt;
&lt;p&gt;Fermi estimates can help you become more efficient in your day-to-day life, and give you increased confidence in the decisions you face. If you want to become proficient in making Fermi estimates, I recommend practicing them 30 minutes per day for three months. In that time, you should be able to make about (2 Fermis per day)&amp;#xD7;(90 days) = 180 Fermi estimates.&lt;/p&gt;
&lt;p&gt;If you'd like to write down your estimation attempts and then publish them here, please do so as a reply to &lt;a href=&quot;/lw/h5e/fermi_estimates/8ppa&quot;&gt;this comment&lt;/a&gt;. One Fermi estimate per comment, please!&lt;/p&gt;
&lt;p&gt;Alternatively, post your Fermi estimates to the &lt;a href=&quot;http://www.reddit.com/r/estimation/&quot;&gt;dedicated subreddit&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/h5e/fermi_estimates/#comments"&gt;104 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Decision Theory FAQ</title>
<link>http://lesswrong.com/lw/gu1/decision_theory_faq/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/gu1/decision_theory_faq/</guid>
<pubDate>Fri, 01 Mar 2013 01:15:55 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
45 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/gu1/decision_theory_faq/#comments"&gt;455 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;small&gt;Co-authored with &lt;a href=&quot;/user/crazy88/overview/&quot;&gt;crazy88&lt;/a&gt;. Please let us know when you find mistakes, and we'll fix them. Last updated 03-27-2013.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Contents&lt;/strong&gt;:&lt;/p&gt;
&lt;div id=&quot;TOC&quot;&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#what-is-decision-theory&quot;&gt;1. What is decision theory?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#is-the-rational-decision-always-the-right-decision&quot;&gt;2. Is the rational decision always the right decision?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-can-i-better-understand-a-decision-problem&quot;&gt;3. How can I better understand a decision problem?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-can-i-measure-an-agents-preferences&quot;&gt;4. How can I measure an agent's preferences?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#the-concept-of-utility&quot;&gt;4.1. The concept of utility&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#types-of-utility&quot;&gt;4.2. Types of utility&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#what-do-decision-theorists-mean-by-risk-ignorance-and-uncertainty&quot;&gt;5. What do decision theorists mean by &quot;risk,&quot; &quot;ignorance,&quot; and &quot;uncertainty&quot;?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-should-i-make-decisions-under-ignorance&quot;&gt;6. How should I make decisions under ignorance?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#the-dominance-principle&quot;&gt;6.1. The dominance principle&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#maximin-and-leximin&quot;&gt;6.2. Maximin and leximin&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#maximax-and-optimism-pessimism&quot;&gt;6.3. Maximax and optimism-pessimism&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#other-decision-principles&quot;&gt;6.4. Other decision principles&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#can-decisions-under-ignorance-be-transformed-into-decisions-under-uncertainty&quot;&gt;7. Can decisions under ignorance be transformed into decisions under uncertainty?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-should-i-make-decisions-under-uncertainty&quot;&gt;8. How should I make decisions under uncertainty?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#the-law-of-large-numbers&quot;&gt;8.1. The law of large numbers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#the-axiomatic-approach&quot;&gt;8.2. The axiomatic approach&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#the-von-neumann-morgenstern-utility-theorem&quot;&gt;8.3. The Von Neumann-Morgenstern utility theorem&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#vnm-utility-theory-and-rationality&quot;&gt;8.4. VNM utility theory and rationality&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#objections-to-vnm-rationality&quot;&gt;8.5. Objections to VNM-rationality&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#should-we-accept-the-vnm-axioms&quot;&gt;8.6. Should we accept the VNM axioms?&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#does-axiomatic-decision-theory-offer-any-action-guidance&quot;&gt;9. Does axiomatic decision theory offer any action guidance?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-does-probability-theory-play-a-role-in-decision-theory&quot;&gt;10. How does probability theory play a role in decision theory?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#the-basics-of-probability-theory&quot;&gt;10.1. The basics of probability theory&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#bayes-theorem-for-updating-probabilities&quot;&gt;10.2. Bayes theorem for updating probabilities&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#how-should-probabilities-be-interpreted&quot;&gt;10.3. How should probabilities be interpreted?&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#what-about-newcombs-problem-and-alternative-decision-algorithms&quot;&gt;11. What about &quot;Newcomb's problem&quot; and alternative decision algorithms?&lt;/a&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#newcomblike-problems-and-two-decision-algorithms&quot;&gt;11.1. Newcomblike problems and two decision algorithms&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#benchmark-theory-bt&quot;&gt;11.2. Benchmark theory (BT)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#timeless-decision-theory-tdt&quot;&gt;11.3. Timeless decision theory (TDT)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#decision-theory-and-winning&quot;&gt;11.4. Decision theory and &amp;#x201C;winning&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;h2 id=&quot;what-is-decision-theory&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;&lt;a href=&quot;#what-is-decision-theory&quot;&gt;1. What is decision theory?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;Decision theory&lt;/em&gt;, also known as &lt;em&gt;rational choice theory&lt;/em&gt;, concerns the study of preferences, uncertainties, and other issues related to making &quot;optimal&quot; or &quot;rational&quot; choices. It has been discussed by economists, psychologists, philosophers, mathematicians, statisticians, and computer scientists.&lt;/p&gt;
&lt;p&gt;We can divide decision theory into three parts (&lt;a href=&quot;http://www.owlnet.rice.edu/~econ501/lectures/Decision_EU.pdf&quot;&gt;Grant &amp;amp; Zandt 2009&lt;/a&gt;; &lt;a href=&quot;http://www.amazon.com/dp/0521680433/&quot;&gt;Baron 2008&lt;/a&gt;). &lt;em&gt;Normative&lt;/em&gt; decision theory studies what an ideal agent (a perfectly rational agent, with infinite computing power, etc.) would choose. &lt;em&gt;Descriptive&lt;/em&gt; decision theory studies how non-ideal agents (e.g. humans) &lt;em&gt;actually&lt;/em&gt; choose. &lt;em&gt;Prescriptive&lt;/em&gt; decision theory studies how non-ideal agents can improve their decision-making (relative to the normative model) despite their imperfections.&lt;/p&gt;
&lt;p&gt;For example, one's &lt;em&gt;normative&lt;/em&gt; model might be &lt;a href=&quot;http://kleene.ss.uci.edu/lpswiki/index.php/Expected_Utility_Theory&quot;&gt;expected utility theory&lt;/a&gt;, which says that a rational agent chooses the action with the highest expected utility. Replicated results in psychology &lt;em&gt;describe&lt;/em&gt; humans repeatedly &lt;em&gt;failing&lt;/em&gt; to maximize expected utility in particular, &lt;a href=&quot;http://www.amazon.com/Predictably-Irrational-Revised-Expanded-Edition/dp/0061353248/&quot;&gt;predictable&lt;/a&gt; ways: for example, they make some choices based not on potential future benefits but on irrelevant past efforts (the &quot;&lt;a href=&quot;http://en.wikipedia.org/wiki/Sunk_costs&quot;&gt;sunk cost fallacy&lt;/a&gt;&quot;). To help people avoid this error, some theorists &lt;em&gt;prescribe&lt;/em&gt; some basic training in microeconomics, which has been shown to reduce the likelihood that humans will commit the sunk costs fallacy (&lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/08/Larrick-et-al-Teaching-the-use-of-cost-benefit-reasoning-in-everyday-life.pdf&quot;&gt;Larrick et al. 1990&lt;/a&gt;). Thus, through a coordination of normative, descriptive, and prescriptive research we can help agents to succeed in life by acting more in accordance with the normative model than they otherwise would.&lt;/p&gt;
&lt;p&gt;This FAQ focuses on normative decision theory. Good sources on descriptive and prescriptive decision theory include &lt;a href=&quot;http://www.amazon.com/Rationality-Reflective-Mind-Keith-Stanovich/dp/0195341147/&quot;&gt;Stanovich (2010)&lt;/a&gt; and &lt;a href=&quot;http://www.amazon.com/Rational-Choice-Uncertain-World-Psychology/dp/1412959039/&quot;&gt;Hastie &amp;amp; Dawes (2009)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Two related fields beyond the scope of this FAQ are &lt;a href=&quot;http://en.wikipedia.org/wiki/Game_theory&quot;&gt;game theory&lt;/a&gt; and &lt;a href=&quot;http://en.wikipedia.org/wiki/Social_choice_theory&quot;&gt;social choice theory&lt;/a&gt;. Game theory is the study of conflict and cooperation among multiple decision makers, and is thus sometimes called &quot;interactive decision theory.&quot; Social choice theory is the study of making a collective decision by combining the preferences of multiple decision makers in various ways.&lt;/p&gt;
&lt;p&gt;This FAQ draws heavily from two textbooks on decision theory: &lt;a href=&quot;http://www.amazon.com/Choices-An-Introduction-Decision-Theory/dp/0816614407/&quot;&gt;Resnik (1987)&lt;/a&gt; and &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;Peterson (2009)&lt;/a&gt;. It also draws from more recent results in decision theory, published in journals such as &lt;em&gt;&lt;a href=&quot;http://www.springerlink.com/content/0039-7857&quot;&gt;Synthese&lt;/a&gt;&lt;/em&gt; and &lt;em&gt;&lt;a href=&quot;http://www.springerlink.com/content/0040-5833&quot;&gt;Theory and Decision&lt;/a&gt;&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&quot;is-the-rational-decision-always-the-right-decision&quot;&gt;&lt;a href=&quot;#is-the-rational-decision-always-the-right-decision&quot;&gt;2. Is the rational decision always the right decision?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;No. Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 1) explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;[In 1700], King Carl of Sweden and his 8,000 troops attacked the Russian army [which] had about ten times as many troops... Most historians agree that the Swedish attack was irrational, since it was almost certain to fail... However, because of an unexpected blizzard that blinded the Russian army, the Swedes won...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Looking back, the Swedes' decision to attack the Russian army was no doubt right, since the &lt;em&gt;actual outcome&lt;/em&gt; turned out to be success. However, since the Swedes had no &lt;em&gt;good reason&lt;/em&gt; for expecting that they were going to win, the decision was nevertheless irrational.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;More generally speaking, we say that a decision is &lt;em&gt;right&lt;/em&gt; if and only if its actual outcome is at least as good as that of every other possible outcome. Furthermore, we say that a decision is &lt;em&gt;rational&lt;/em&gt; if and only if the decision maker [&lt;em&gt;aka&lt;/em&gt; the &quot;agent&quot;] chooses to do what she has most reason to do at the point in time at which the decision is made.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Unfortunately, we cannot know with certainty what the right decision is. Thus, the best we can do is to try to make &quot;rational&quot; or &quot;optimal&quot; decisions based on our preferences and incomplete information.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;how-can-i-better-understand-a-decision-problem&quot;&gt;&lt;a href=&quot;#how-can-i-better-understand-a-decision-problem&quot;&gt;3. How can I better understand a decision problem?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;First, we must &lt;em&gt;formalize&lt;/em&gt; a decision problem. It usually helps to &lt;em&gt;visualize&lt;/em&gt; the decision problem, too.&lt;/p&gt;
&lt;p&gt;In decision theory, decision rules are only defined relative to a formalization of a given decision problem, and a formalization of a decision problem can be visualized in multiple ways. Here is an example from Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 2):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose... that you are thinking about taking out fire insurance on your home. Perhaps it costs $100 to take out insurance on a house worth $100,000, and you ask: Is it worth it?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The most common way to formalize a decision problem is to break it into states, acts, and outcomes. When facing a decision problem, the decision maker aims to choose the &lt;em&gt;act&lt;/em&gt; that will have the best &lt;em&gt;outcome&lt;/em&gt;. But the outcome of each act depends on the &lt;em&gt;state&lt;/em&gt; of the world, which is unknown to the decision maker.&lt;/p&gt;
&lt;p&gt;In this framework, speaking loosely, a state is a part of the world that is not an act (that can be performed now by the decision maker) or an outcome (the question of what, more precisely, states are is a complex question that is beyond the scope of this document). Luckily, not all states are relevant to a particular decision problem. We only need to take into account states that affect the agent's preference among acts. A simple formalization of the fire insurance problem might include only two states: the state in which your house doesn't (later) catch on fire, and the state in which your house &lt;em&gt;does&lt;/em&gt; (later) catch on fire.&lt;/p&gt;
&lt;p&gt;Presumably, the agent prefers some outcomes to others. Suppose the four conceivable outcomes in the above decision problem are: (1) House and $0, (2) House and -$100, (3) No house and $99,900, and (4) No house and $0. In this case, the decision maker might prefer outcome 1 over outcome 2, outcome 2 over outcome 3, and outcome 3 over outcome 4. (We'll discuss measures of value for outcomes in the next section.)&lt;/p&gt;
&lt;p&gt;An act is commonly taken to be a function that takes one set of the possible states of the world as input and gives a particular outcome as output. For the above decision problem we could say that if the act &quot;Take out insurance&quot; has the world-state &quot;Fire&quot; as its input, then it will give the outcome &quot;No house and $99,900&quot; as its output.&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/DhCAW.jpg&quot; alt=&quot;An outline of the states, acts and outcomes in the insurance case&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;An outline of the states, acts and outcomes in the insurance case&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Note that decision theory is concerned with &lt;em&gt;particular&lt;/em&gt; acts rather than &lt;em&gt;generic&lt;/em&gt; acts, e.g. &quot;sailing west in 1492&quot; rather than &quot;sailing.&quot; Moreover, the acts of a decision problem must be &lt;em&gt;alternative&lt;/em&gt; acts, so that the decision maker has to choose exactly &lt;em&gt;one&lt;/em&gt; act.&lt;/p&gt;
&lt;p&gt;Once a decision problem has been formalized, it can then be visualized in any of several ways.&lt;/p&gt;
&lt;p&gt;One way to visualize this decision problem is to use a &lt;em&gt;decision matrix&lt;/em&gt;:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Fire&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;No fire&lt;/em&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Take out insurance&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;No house and $99,900&lt;/td&gt;
&lt;td&gt;House and -$100&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;No insurance&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;No house and $0&lt;/td&gt;
&lt;td&gt;House and $0&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Another way to visualize this problem is to use a &lt;em&gt;decision tree&lt;/em&gt;:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/09/basic-decision-tree.gif&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;The square is a &lt;em&gt;choice node&lt;/em&gt;, the circles are &lt;em&gt;chance nodes&lt;/em&gt;, and the triangles are &lt;em&gt;terminal nodes&lt;/em&gt;. At the choice node, the decision maker chooses which branch of the decision tree to take. At the chance nodes, &lt;em&gt;nature&lt;/em&gt; decides which branch to follow. The triangles represent outcomes.&lt;/p&gt;
&lt;p&gt;Of course, we could add more branches to each choice node and each chance node. We could also add more choice nodes, in which case we are representing a &lt;em&gt;sequential&lt;/em&gt; decision problem. Finally, we could add probabilities to each branch, as long as the probabilities of all the branches extending from each single node sum to 1. And because a decision tree obeys the laws of probability theory, we can calculate the probability of any given node by multiplying the probabilities of all the branches preceding it.&lt;/p&gt;
&lt;p&gt;Our decision problem could also be represented as a &lt;em&gt;vector&lt;/em&gt; &amp;#x2014; an ordered list of mathematical objects that is perhaps most suitable for computers:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;[&lt;br&gt; [a&lt;sub&gt;1&lt;/sub&gt; = take out insurance,&lt;br&gt; a&lt;sub&gt;2&lt;/sub&gt; = do not];&lt;br&gt; [s&lt;sub&gt;1&lt;/sub&gt; = fire,&lt;br&gt; s&lt;sub&gt;2&lt;/sub&gt; = no fire];&lt;br&gt; [(a&lt;sub&gt;1&lt;/sub&gt;, s&lt;sub&gt;1&lt;/sub&gt;) = No house and $99,900,&lt;br&gt; (a&lt;sub&gt;1&lt;/sub&gt;, s&lt;sub&gt;2&lt;/sub&gt;) = House and -$100,&lt;br&gt; (a&lt;sub&gt;2&lt;/sub&gt;, s&lt;sub&gt;1&lt;/sub&gt;) = No house and $0,&lt;br&gt; (a&lt;sub&gt;2&lt;/sub&gt;, s&lt;sub&gt;2&lt;/sub&gt;) = House and $0]&lt;br&gt; ]&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;For more details on formalizing and visualizing decision problems, see &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Analysis-3rd-Edition/dp/0964793865/&quot;&gt;Skinner (1993)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;how-can-i-measure-an-agents-preferences&quot;&gt;&lt;a href=&quot;#how-can-i-measure-an-agents-preferences&quot;&gt;4. How can I measure an agent's preferences?&lt;/a&gt;&lt;/h2&gt;
&lt;h3 id=&quot;the-concept-of-utility&quot;&gt;&lt;a href=&quot;#the-concept-of-utility&quot;&gt;4.1. The concept of utility&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;It is important not to measure an agent's preferences in terms of &lt;em&gt;objective&lt;/em&gt; value, e.g. monetary value. To see why, consider the absurdities that can result when we try to measure an agent's preference with money alone.&lt;/p&gt;
&lt;p&gt;Suppose you may choose between (A) receiving a million dollars &lt;em&gt;for sure&lt;/em&gt;, and (B) a 50% chance of winning either $3 million or nothing. The &lt;em&gt;expected monetary value&lt;/em&gt; (EMV) of your act is computed by multiplying the monetary value of each possible outcome by its probability. So, the EMV of choice A is (1)($1 million) = $1 million. The EMV of choice B is (0.5)($3 million) + (0.5)($0) = $1.5 million. Choice B has a higher expected monetary value, and yet many people would prefer the guaranteed million.&lt;/p&gt;
&lt;p&gt;Why? For many people, the difference between having $0 and $1 million is &lt;em&gt;subjectively&lt;/em&gt; much larger than the difference between having $1 million and $3 million, even if the latter difference is larger in dollars.&lt;/p&gt;
&lt;p&gt;To capture an agent's &lt;em&gt;subjective&lt;/em&gt; preferences, we use the concept of &lt;em&gt;utility&lt;/em&gt;. A &lt;em&gt;utility function&lt;/em&gt; assigns numbers to outcomes such that outcomes with higher numbers are preferred to outcomes with lower numbers. For example, for a particular decision maker &amp;#x2014; say, one who has no money &amp;#x2014; the utility of $0 might be 0, the utility of $1 million might be 1000, and the utility of $3 million might be 1500. Thus, the &lt;em&gt;expected utility&lt;/em&gt; (EU) of choice A is, for this decision maker, (1)(1000) = 1000. Meanwhile, the EU of choice B is (0.5)(1500) + (0.5)(0) = 750. In this case, the expected utility of choice A is greater than that of choice B, even though choice B has a greater expected monetary value.&lt;/p&gt;
&lt;p&gt;Note that those from the field of statistics who work on decision theory tend to talk about a &quot;loss function,&quot; which is simply an &lt;em&gt;inverse&lt;/em&gt; utility function. For an overview of decision theory from this perspective, see &lt;a href=&quot;http://www.amazon.com/Statistical-Decision-Bayesian-Analysis-Statistics/dp/1441930744/&quot;&gt;Berger (1985)&lt;/a&gt; and &lt;a href=&quot;http://www.amazon.com/Bayesian-Choice-Decision-Theoretic-Computational-Implementation/dp/0387715983/&quot;&gt;Robert (2001)&lt;/a&gt;. For a critique of some standard results in statistical decision theory, see &lt;a href=&quot;http://www.amazon.com/Probability-Theory-The-Logic-Science/dp/0521592712/&quot;&gt;Jaynes (2003, ch. 13)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;types-of-utility&quot;&gt;&lt;a href=&quot;#types-of-utility&quot;&gt;4.2. Types of utility&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;An agent's utility function can't be directly observed, so it must be constructed &amp;#x2014; e.g. by asking them which options they prefer for a large set of pairs of alternatives (as on &lt;a href=&quot;http://www.whoishotter.com&quot;&gt;WhoIsHotter.com&lt;/a&gt;). The number that corresponds to an outcome's utility can convey different information depending on the &lt;em&gt;utility scale&lt;/em&gt; in use, and the utility scale in use depends on how the utility function is constructed.&lt;/p&gt;
&lt;p&gt;Decision theorists distinguish three kinds of utility scales:&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;
&lt;p&gt;Ordinal scales (&quot;12 is better than 6&quot;). In an ordinal scale, preferred outcomes are assigned higher numbers, but the numbers don't tell us anything about the differences or ratios between the utility of different outcomes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Interval scales (&quot;the difference between 12 and 6 equals that between 6 and 0&quot;). An interval scale gives us more information than an ordinal scale. Not only are preferred outcomes assigned higher numbers, but also the numbers accurately reflect the &lt;em&gt;difference&lt;/em&gt; between the utility of different outcomes. They do not, however, necessarily reflect the ratios of utility between different outcomes. If outcome A has utility 0, outcome B has utility 6, and outcome C has utility 12 on an interval scale, then we know that the difference in utility between outcomes A and B and between outcomes B and C is the same, but we can't know whether outcome B is &quot;twice as good&quot; as outcome A.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Ratio scales (&quot;12 is exactly &lt;em&gt;twice&lt;/em&gt; as valuable as 6&quot;). Numerical utility assignments on a ratio scale give us the most information of all. They accurately reflect preference rankings, differences, &lt;em&gt;and&lt;/em&gt; ratios. Thus, we can say that an outcome with utility 12 is exactly &lt;em&gt;twice&lt;/em&gt; as valuable to the agent in question as an outcome with utility 6.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Note that neither &lt;em&gt;experienced utility&lt;/em&gt; (happiness) nor the notions of &quot;average utility&quot; or &quot;total utility&quot; discussed by utilitarian moral philosophers are the same thing as the &lt;em&gt;decision utility&lt;/em&gt; that we are discussing now to describe decision preferences. As the situation merits, we can be even more specific. For example, when discussing the type of decision utility used in an interval scale utility function constructed using Von Neumann &amp;amp; Morgenstern's axiomatic approach (see section 8), some people use the term &lt;em&gt;VNM-utility&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;Now that you know that an agent's preferences can be represented as a &quot;utility function,&quot; and that assignments of utility to outcomes can mean different things depending on the utility scale of the utility function, we are ready to think more formally about the challenge of making &quot;optimal&quot; or &quot;rational&quot; choices. (We will return to the problem of constructing an agent's utility function later, in section 8.3.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;what-do-decision-theorists-mean-by-risk-ignorance-and-uncertainty&quot;&gt;&lt;a href=&quot;#what-do-decision-theorists-mean-by-risk-ignorance-and-uncertainty&quot;&gt;5. What do decision theorists mean by &quot;risk,&quot; &quot;ignorance,&quot; and &quot;uncertainty&quot;?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 1) explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;In decision theory, everyday terms such as &lt;em&gt;risk&lt;/em&gt;, &lt;em&gt;ignorance&lt;/em&gt;, and &lt;em&gt;uncertainty&lt;/em&gt; are used as technical terms with precise meanings. In decisions under risk the decision maker knows the probability of the possible outcomes, whereas in decisions under ignorance the probabilities are either unknown or non-existent. Uncertainty is either used as a synonym for ignorance, or as a broader term referring to both risk and ignorance.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In this FAQ, a &quot;decision under ignorance&quot; is one in which probabilities are &lt;em&gt;not&lt;/em&gt; assigned to all outcomes, and a &quot;decision under uncertainty&quot; is one in which probabilities &lt;em&gt;are&lt;/em&gt; assigned to all outcomes. The term &quot;risk&quot; will be reserved for discussions related to utility.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;how-should-i-make-decisions-under-ignorance&quot;&gt;&lt;a href=&quot;#how-should-i-make-decisions-under-ignorance&quot;&gt;6. How should I make decisions under ignorance?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A decision maker faces a &quot;decision under ignorance&quot; when she (1) knows which acts she could choose and which outcomes they may result in, but (2) is unable to assign probabilities to the outcomes.&lt;/p&gt;
&lt;p&gt;(Note that many theorists think that all decisions under ignorance can be transformed into decisions under uncertainty, in which case this section will be irrelevant except for subsection 6.1. For details, see section 7.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;the-dominance-principle&quot;&gt;&lt;a href=&quot;#the-dominance-principle&quot;&gt;6.1. The dominance principle&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;To borrow an example from Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 3), suppose that Jane isn't sure whether to order hamburger or monkfish at a new restaurant. Just about any chef can make an edible hamburger, and she knows that monkfish is fantastic if prepared by a world-class chef, but she also recalls that monkfish is difficult to cook. Unfortunately, she knows too little about this restaurant to assign any probability to the prospect of getting good monkfish. Her decision matrix might look like this:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Good chef&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Bad chef&lt;/em&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Monkfish&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;good monkfish&lt;/td&gt;
&lt;td&gt;terrible monkfish&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Hamburger&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;edible hamburger&lt;/td&gt;
&lt;td&gt;edible hamburger&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;No main course&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;hungry&lt;/td&gt;
&lt;td&gt;hungry&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Here, decision theorists would say that the &quot;hamburger&quot; choice &lt;em&gt;dominates&lt;/em&gt; the &quot;no main course&quot; choice. This is because choosing the hamburger leads to a better outcome for Jane no matter which possible state of the world (good chef or bad chef) turns out to be true.&lt;/p&gt;
&lt;p&gt;This &lt;em&gt;dominance principle&lt;/em&gt; comes in two forms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Weak dominance&lt;/em&gt;: One act is &lt;em&gt;more&lt;/em&gt; rational than another if (1) all its possible outcomes are at least as good as those of the other, and if (2) there is at least one possible outcome that is better than that of the other act.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Strong dominance&lt;/em&gt;: One act is &lt;em&gt;more&lt;/em&gt; rational than another if all of its possible outcome are better than that of the other act.&lt;/li&gt;
&lt;/ul&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/7fU6U.jpg&quot; alt=&quot;A comparison of strong and weak dominance&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;A comparison of strong and weak dominance&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;The dominance principle can also be applied to decisions under uncertainty (in which probabilities &lt;em&gt;are&lt;/em&gt; assigned to all the outcomes). If we assign probabilities to outcomes, it is still rational to choose one act over another act if all its outcomes are at least as good as the outcomes of the other act.&lt;/p&gt;
&lt;p&gt;However, the dominance principle only applies (non-controversially) when the agent&amp;#x2019;s acts are independent of the state of the world. So consider the decision of whether to steal a coat:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Charged with theft&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Not charged with theft&lt;/em&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Theft&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;Jail and coat&lt;/td&gt;
&lt;td&gt;Freedom and coat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;No theft&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;Jail&lt;/td&gt;
&lt;td&gt;Freedom&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;In this case, stealing the coat dominates not doing so but isn&amp;#x2019;t necessarily the rational decision. After all, stealing increases your chance of getting charged with theft and might be irrational for this reason. So dominance doesn&amp;#x2019;t apply in cases like this where the state of the world is not independent of the agents act.&lt;/p&gt;
&lt;p&gt;On top of this, not all decision problems include an act that dominates all the others. Consequently additional principles are often required to reach a decision.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;maximin-and-leximin&quot;&gt;&lt;a href=&quot;#maximin-and-leximin&quot;&gt;6.2. Maximin and leximin&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Some decision theorists have suggested the &lt;em&gt;maximin principle&lt;/em&gt;: if the worst possible outcome of one act is better than the worst possible outcome of another act, then the former act should be chosen. In Jane's decision problem above, the maximin principle would prescribe choosing the hamburger, because the worst possible outcome of choosing the hamburger (&quot;edible hamburger&quot;) is better than the worst possible outcome of choosing the monkfish (&quot;terrible monkfish&quot;) and is also better than the worst possible outcome of eating no main course (&quot;hungry&quot;).&lt;/p&gt;
&lt;p&gt;If the worst outcomes of two or more acts are equally good, the maximin principle tells you to be indifferent between them. But that doesn't seem right. For this reason, fans of the maximin principle often invoke the &lt;em&gt;lexical&lt;/em&gt; maximin principle (&quot;leximin&quot;), which says that if the worst outcomes of two or more acts are equally good, one should choose the act for which the &lt;em&gt;second worst&lt;/em&gt; outcome is best. (If that doesn't single out a single act, then the &lt;em&gt;third worst&lt;/em&gt; outcome should be considered, and so on.)&lt;/p&gt;
&lt;p&gt;Why adopt the leximin principle? Advocates point out that the leximin principle transforms a decision problem under ignorance into a decision problem under partial certainty. The decision maker doesn't know what the outcome will be, but they know what the worst possible outcome will be.&lt;/p&gt;
&lt;p&gt;But in some cases, the leximin rule seems clearly irrational. Imagine this decision problem, with two possible acts and two possible states of the world:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;$1&lt;/td&gt;
&lt;td&gt;$10,001.01&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;$1.01&lt;/td&gt;
&lt;td&gt;$1.01&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;In this situation, the leximin principle prescribes choosing a&lt;sub&gt;2&lt;/sub&gt;. But most people would agree it is rational to risk losing out on a single cent for the chance to get an extra $10,000.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;maximax-and-optimism-pessimism&quot;&gt;&lt;a href=&quot;#maximax-and-optimism-pessimism&quot;&gt;6.3. Maximax and optimism-pessimism&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The maximin and leximin rules focus their attention on the worst possible outcomes of a decision, but why not focus on the &lt;em&gt;best&lt;/em&gt; possible outcome? The &lt;em&gt;maximax principle&lt;/em&gt; prescribes that if the best possible outcome of one act is better than the best possible outcome of another act, then the former act should be chosen.&lt;/p&gt;
&lt;p&gt;More popular among decision theorists is the &lt;em&gt;optimism-pessimism rule&lt;/em&gt; (&lt;em&gt;aka&lt;/em&gt; the &lt;em&gt;alpha-index rule&lt;/em&gt;). The optimism-pessimism rule prescribes that one consider both the best and worst possible outcome of each possible act, and then choose according to one's degree of optimism or pessimism.&lt;/p&gt;
&lt;p&gt;Here's an example from Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 3):&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;3&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;4&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;5&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;6&lt;/sub&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;55&lt;/td&gt;
&lt;td&gt;18&lt;/td&gt;
&lt;td&gt;28&lt;/td&gt;
&lt;td&gt;10&lt;/td&gt;
&lt;td&gt;36&lt;/td&gt;
&lt;td&gt;100&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;50&lt;/td&gt;
&lt;td&gt;87&lt;/td&gt;
&lt;td&gt;55&lt;/td&gt;
&lt;td&gt;90&lt;/td&gt;
&lt;td&gt;75&lt;/td&gt;
&lt;td&gt;70&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;We represent the decision maker's level of optimism on a scale of 0 to 1, where 0 is maximal pessimism and 1 is maximal optimism. For a&lt;sub&gt;1&lt;/sub&gt;, the worst possible outcome is 10 and the best possible outcome is 100. That is, min(a&lt;sub&gt;1&lt;/sub&gt;) = 10 and max(a&lt;sub&gt;1&lt;/sub&gt;) = 100. So if the decision maker is 0.85 optimistic, then the total value of a&lt;sub&gt;1&lt;/sub&gt; is (0.85)(100) + (1 - 0.85)(10) = 86.5, and the total value of a&lt;sub&gt;2&lt;/sub&gt; is (0.85)(90) + (1 - 0.85)(50) = 84. In this situation, the optimism-pessimism rule prescribes action a&lt;sub&gt;1&lt;/sub&gt;.&lt;/p&gt;
&lt;p&gt;If the decision maker's optimism is 0, then the optimism-pessimism rule collapses into the maximin rule because (0)(max(a&lt;sub&gt;i&lt;/sub&gt;)) + (1 - 0)(min(a&lt;sub&gt;i&lt;/sub&gt;)) = min(a&lt;sub&gt;i&lt;/sub&gt;). And if the decision maker's optimism is 1, then the optimism-pessimism rule collapses into the maximax rule. Thus, the optimism-pessimism rule turns out to be a generalization of the maximin and maximax rules. (Well, sort of. The minimax and maximax principles require only that we measure value on an ordinal scale, whereas the optimism-pessimism rule requires that we measure value on an interval scale.)&lt;/p&gt;
&lt;p&gt;The optimism-pessimism rule pays attention to both the best-case and worst-case scenarios, but is it rational to ignore all the outcomes in between? Consider this example:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;s&lt;sub&gt;3&lt;/sub&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;1&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;100&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;a&lt;sub&gt;2&lt;/sub&gt;&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;99&lt;/td&gt;
&lt;td&gt;100&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;The maximum and minimum values for a&lt;sub&gt;1&lt;/sub&gt; and a&lt;sub&gt;2&lt;/sub&gt; are the same, so for every degree of optimism both acts are equally good. But it seems obvious that one should choose a&lt;sub&gt;2&lt;/sub&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;other-decision-principles&quot;&gt;&lt;a href=&quot;#other-decision-principles&quot;&gt;6.4. Other decision principles&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Many other decision principles for dealing with decisions under ignorance have been proposed, including &lt;a href=&quot;http://teaching.ust.hk/~bee/papers/misc/Regret%20Theory%20An%20Alternative%20Theory%20of%20Rational%20Choice%20Under%20Uncertainty.pdf&quot;&gt;minimax regret&lt;/a&gt;, &lt;a href=&quot;http://www.amazon.com/Info-Gap-Decision-Theory-Second-Edition/dp/0123735521/&quot;&gt;info-gap&lt;/a&gt;, and &lt;a href=&quot;http://www.existential-risk.org/concept.pdf&quot;&gt;maxipok&lt;/a&gt;. For more details on making decisions under ignorance, see &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009)&lt;/a&gt; and &lt;a href=&quot;http://www.dss.dpem.tuc.gr/pdf/Choice%20under%20complete%20uncertainty%20-%20axiomatic%20characterizati.pdf&quot;&gt;Bossert et al. (2000)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;One queer feature of the decision principles discussed in this section is that they willfully disregard some information relevant to making a decision. Such a move could make sense when trying to find a decision algorithm that performs well under tight limits on available computation (&lt;a href=&quot;http://www.dss.dpem.tuc.gr/pdf/An%20axiomatic%20treatment%20of%20three%20qualitative%20decision%20criteri.pdf&quot;&gt;Brafman &amp;amp; Tennenholtz (2000)&lt;/a&gt;), but it's unclear why an &lt;em&gt;ideal&lt;/em&gt; agent with infinite computing power (fit for a &lt;em&gt;normative&lt;/em&gt; rather than a &lt;em&gt;prescriptive&lt;/em&gt; theory) should willfully disregard information.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;can-decisions-under-ignorance-be-transformed-into-decisions-under-uncertainty&quot;&gt;&lt;a href=&quot;#can-decisions-under-ignorance-be-transformed-into-decisions-under-uncertainty&quot;&gt;7. Can decisions under ignorance be transformed into decisions under uncertainty?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Can decisions under ignorance be transformed into decisions under uncertainty? This would simplify things greatly, because there is near-universal agreement that decisions under uncertainty should be handled by &quot;maximizing expected utility&quot; (see section 11 for clarifications), whereas decision theorists still debate what should be done about decisions under ignorance.&lt;/p&gt;
&lt;p&gt;For &lt;a href=&quot;http://en.wikipedia.org/wiki/Bayesian_probability&quot;&gt;Bayesians&lt;/a&gt; (see section 10), &lt;em&gt;all&lt;/em&gt; decisions under ignorance are transformed into decisions under uncertainty (&lt;a href=&quot;http://www.amazon.com/Introduction-Bayesian-Inference-Decision-Edition/dp/0964793849/&quot;&gt;Winkler 2003&lt;/a&gt;, ch. 5) when the decision maker assigns an &quot;ignorance prior&quot; to each outcome for which they don't know how to assign a probability. (Another way of saying this is to say that a Bayesian decision maker never faces a decision under ignorance, because a Bayesian must always assign a prior probability to events.) One must then consider how to assign priors, an important debate among Bayesians (see section 10).&lt;/p&gt;
&lt;p&gt;Many non-Bayesian decision theorists also think that decisions under ignorance can be transformed into decisions under uncertainty due to something called the &lt;em&gt;principle of insufficient reason&lt;/em&gt;. The principle of insufficient reason prescribes that if you have literally &lt;em&gt;no&lt;/em&gt; reason to think that one state is more probable than another, then one should assign &lt;em&gt;equal&lt;/em&gt; probability to both states.&lt;/p&gt;
&lt;p&gt;One objection to the principle of insufficient reason is that it is very sensitive to how states are individuated. Peterson (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;2009&lt;/a&gt;, ch. 3) explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose that before embarking on a trip you consider whether to bring an umbrella or not. [But] you know nothing about the weather at your destination. If the formalization of the decision problem is taken to include only two states, viz. rain and no rain, [then by the principle of insufficient reason] the probability of each state will be 1/2. However, it seems that one might just as well go for a formalization that divides the space of possibilities into three states, viz. heavy rain, moderate rain, and no rain. If the principle of insufficient reason is applied to the latter set of states, their probabilities will be 1/3. In some cases this difference will affect our decisions. Hence, it seems that anyone advocating the principle of insufficient reason must [defend] the rather implausible hypothesis that there is only one correct way of making up the set of states.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/kXn03.jpg&quot; alt=&quot;An objection to the principle of insufficient reason&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;An objection to the principle of insufficient reason&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Advocates of the principle of insufficient reason might respond that one must consider &lt;em&gt;symmetric&lt;/em&gt; states. For example if someone gives you a die with &lt;em&gt;n&lt;/em&gt; sides and you have no reason to think the die is biased, then you should assign a probability of 1/&lt;em&gt;n&lt;/em&gt; to each side. But, Peterson notes:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;...not all events can be described in symmetric terms, at least not in a way that justifies the conclusion that they are equally probable. Whether Ann's marriage will be a happy one depends on her future emotional attitude toward her husband. According to one description, she could be either in love or not in love with him; then the probability of both states would be 1/2. According to another equally plausible description, she could either be deeply in love, a little bit in love or not at all in love with her husband; then the probability of each state would be 1/3.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id=&quot;how-should-i-make-decisions-under-uncertainty&quot;&gt;&lt;a href=&quot;#how-should-i-make-decisions-under-uncertainty&quot;&gt;8. How should I make decisions under uncertainty?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A decision maker faces a &quot;decision under uncertainty&quot; when she (1) knows which acts she could choose and which outcomes they may result in, and she (2) assigns probabilities to the outcomes.&lt;/p&gt;
&lt;p&gt;Decision theorists generally agree that when facing a decision under uncertainty, it is rational to choose the act with the highest expected utility. This is the principle of &lt;em&gt;expected utility maximization&lt;/em&gt; (EUM).&lt;/p&gt;
&lt;p&gt;Decision theorists offer two kinds of justifications for EUM. The first has to do with the law of large numbers (see section 8.1). The second has to do with the axiomatic approach (see sections 8.2 through 8.6).&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;the-law-of-large-numbers&quot;&gt;&lt;a href=&quot;#the-law-of-large-numbers&quot;&gt;8.1. The law of large numbers&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The &quot;law of large numbers,&quot; which states that &lt;em&gt;in the long run&lt;/em&gt;, if you face the same decision problem again and again and again, and you always choose the act with the highest expected utility, then you will almost certainly be better off than if you choose any other acts.&lt;/p&gt;
&lt;p&gt;There are two problems with using the law of large numbers to justify EUM. The first problem is that the world is ever-changing, so we rarely if ever face the same decision problem &quot;again and again and again.&quot; The law of large numbers says that if you face the same decision problem infinitely many times, then the probability that you could do better by not maximizing expected utility approaches zero. But you won't ever face the same decision problem infinitely many times! Why should you care what would happen if a certain condition held, if you know that condition will never hold?&lt;/p&gt;
&lt;p&gt;The second problem with using the law of large numbers to justify EUM has to do with a mathematical theorem known as &lt;em&gt;gambler's ruin&lt;/em&gt;. Imagine that you and I flip a fair coin, and I pay you $1 every time it comes up heads and you pay me $1 every time it comes up tails. We both start with $100. If we flip the coin enough times, one of us will face a situation in which the sequence of heads or tails is longer than we can afford. If a long-enough sequence of heads comes up, I'll run out of $1 bills with which to pay you. If a long-enough sequence of tails comes up, you won't be able to pay me. So in this situation, the law of large numbers guarantees that you will be better off in the long run by maximizing expected utility only if you start the game with an infinite amount of money (so that you never go broke), which is an unrealistic assumption. (For technical convenience, assume utility increases linearly with money. But the basic point holds without this assumption.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;the-axiomatic-approach&quot;&gt;&lt;a href=&quot;#the-axiomatic-approach&quot;&gt;8.2. The axiomatic approach&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The other method for justifying EUM seeks to show that EUM can be derived from axioms that hold regardless of what happens in the long run.&lt;/p&gt;
&lt;p&gt;In this section we will review perhaps the most famous axiomatic approach, from &lt;a href=&quot;http://www.amazon.com/Economic-Behavior-Commemorative-Princeton-Editions/dp/0691130612/&quot;&gt;Von Neumann and Morgenstern (1947)&lt;/a&gt;. Other axiomatic approaches include &lt;a href=&quot;http://www.amazon.com/The-Foundations-Statistics-Leonard-Savage/dp/0486623491/&quot;&gt;Savage (1954)&lt;/a&gt;, &lt;a href=&quot;http://www.amazon.com/The-Logic-Decision-Richard-Jeffrey/dp/0226395820/&quot;&gt;Jeffrey (1983)&lt;/a&gt;, and &lt;a href=&quot;http://pages.stern.nyu.edu/~dbackus/Exotic/1Ambiguity/AnscombeAumann%20AMS%2063.pdf&quot;&gt;Anscombe &amp;amp; Aumann (1963)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;the-von-neumann-morgenstern-utility-theorem&quot;&gt;&lt;a href=&quot;#the-von-neumann-morgenstern-utility-theorem&quot;&gt;8.3. The Von Neumann-Morgenstern utility theorem&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The first decision theory axiomatization appeared in an appendix to the second edition of Von Neumann &amp;amp; Morgenstern's &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Economic-Behavior-Commemorative-Princeton-Editions/dp/0691130612/&quot;&gt;Theory of Games and Economic Behavior&lt;/a&gt;&lt;/em&gt; (1947). An important point to note up front is that, in this axiomatization, Von Neumann and Morgenstern take the options that the agent chooses between to not be acts, as we&amp;#x2019;ve defined them, but lotteries (where a lottery is a set of outcomes, each paired with a probability). As such, while discussing their axiomatization, we will talk of lotteries. (Despite making this distinction, acts and lotteries are closely related. Under the conditions of uncertainty that we are considering here, each act will be associated with some lottery and so preferences over lotteries could be used to determine preferences over acts, if so desired).&lt;/p&gt;
&lt;p&gt;The key feature of the Von Neumann and Morgenstern axiomatization is a proof that if a decision maker states her preferences over a set of lotteries, and if her preferences conform to a set of intuitive structural constraints (axioms), then we can construct a utility function (on an interval scale) from her preferences over lotteries and show that she acts &lt;em&gt;as if&lt;/em&gt; she maximizes expected utility with respect to that utility function.&lt;/p&gt;
&lt;p&gt;What are the axioms to which an agent's preferences over lotteries must conform? There are four of them.&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;
&lt;p&gt;The &lt;em&gt;completeness axiom&lt;/em&gt; states that the agent must &lt;em&gt;bother to state a preference&lt;/em&gt; for each pair of lotteries. That is, the agent must prefer A to B, or prefer B to A, or be indifferent between the two.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;The &lt;em&gt;transitivity axiom&lt;/em&gt; states that if the agent prefers A to B and B to C, she must also prefer A to C.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;The &lt;em&gt;independence axiom&lt;/em&gt; states that, for example, if an agent prefers an apple to an orange, then she must also prefer the lottery [55% chance she gets an apple, otherwise she gets cholera] over the lottery [55% chance she gets an orange, otherwise she gets cholera]. More generally, this axiom holds that a preference must hold independently of the possibility of another outcome (e.g. cholera).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;The &lt;em&gt;continuity axiom&lt;/em&gt; holds that if the agent prefers A to B to C, then there exists a unique &lt;em&gt;p&lt;/em&gt; (probability) such that the agent is indifferent between [&lt;em&gt;p&lt;/em&gt;(A) + (1 - &lt;em&gt;p&lt;/em&gt;)(C)] and [outcome B with certainty].&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The continuity axiom requires &lt;a href=&quot;http://www.youtube.com/watch?v=hSUsiA8dhKM&quot;&gt;more explanation&lt;/a&gt;. Suppose that A = $1 million, B = $0, and C = Death. If &lt;em&gt;p&lt;/em&gt; = 0.5, then the agent's two lotteries under consideration for the moment are:&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;(0.5)($1M) + (1 - 0.5)(Death) [win $1M with 50% probability, die with 50% probability]&lt;/li&gt;
&lt;li&gt;(1)($0) [win $0 with certainty]&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Most people would &lt;em&gt;not&lt;/em&gt; be indifferent between $0 with certainty and [50% chance of $1M, 50% chance of Death] &amp;#x2014; the risk of Death is too high! But if you have continuous preferences, there is &lt;em&gt;some&lt;/em&gt; probability &lt;em&gt;p&lt;/em&gt; for which you'd be indifferent between these two lotteries. Perhaps &lt;em&gt;p&lt;/em&gt; is very, very high:&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;(0.999999)($1M) + (1 - 0.999999)(Death) [win $1M with 99.9999% probability, die with 0.0001% probability]&lt;/li&gt;
&lt;li&gt;(1)($0) [win $0 with certainty]&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Perhaps now you'd be indifferent between lottery 1 and lottery 2. Or maybe you'd be &lt;em&gt;more&lt;/em&gt; willing to risk Death for the chance of winning $1M, in which case the &lt;em&gt;p&lt;/em&gt; for which you'd be indifferent between lotteries 1 and 2 is lower than 0.999999. As long as there is &lt;em&gt;some&lt;/em&gt; &lt;em&gt;p&lt;/em&gt; at which you'd be indifferent between lotteries 1 and 2, your preferences are &quot;continuous.&quot;&lt;/p&gt;
&lt;p&gt;Given this setup, Von Neumann and Morgenstern proved their theorem, which states that if the agent's preferences over lotteries obeys their axioms, then:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The agent's preferences can be represented by a utility function that assigns higher utility to preferred lotteries.&lt;/li&gt;
&lt;li&gt;The agent acts in accordance with the principle of maximizing expected utility.&lt;/li&gt;
&lt;li&gt;All utility functions satisfying the above two conditions are &quot;positive linear transformations&quot; of each other. (Without going into the details: this is why VNM-utility is measured on an interval scale.)&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&quot;vnm-utility-theory-and-rationality&quot;&gt;&lt;br&gt;&lt;/h3&gt;
&lt;h3&gt;&lt;a href=&quot;#vnm-utility-theory-and-rationality&quot;&gt;8.4. VNM utility theory and rationality&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;An agent which conforms to the VNM axioms is sometimes said to be &quot;VNM-rational.&quot; But why should &quot;VNM-rationality&quot; constitute our notion of &lt;em&gt;rationality in general&lt;/em&gt;? How could VNM's result justify the claim that a rational agent maximizes expected utility when facing a decision under uncertainty? The argument goes like this:&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;If an agent chooses lotteries which it prefers (in decisions under uncertainty), and if its preferences conform to the VNM axioms, then it is rational. Otherwise, it is irrational.&lt;/li&gt;
&lt;li&gt;If an agent chooses lotteries which it prefers (in decisions under uncertainty), and if its preferences conform to the VNM axioms, then it maximizes expected utility.&lt;/li&gt;
&lt;li&gt;Therefore, a rational agent maximizes expected utility (in decisions under uncertainty).&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Von Neumann and Morgenstern proved premise 2, and the conclusion follows from premise 1 and 2. But why accept premise 1?&lt;/p&gt;
&lt;p&gt;Few people deny that it would be irrational for an agent to choose a lottery which it does not prefer. But why is it irrational for an agent's preferences to violate the VNM axioms? I will save that discussion for section 8.6.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;objections-to-vnm-rationality&quot;&gt;&lt;a href=&quot;#objections-to-vnm-rationality&quot;&gt;8.5. Objections to VNM-rationality&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Several objections have been raised to Von Neumann and Morgenstern's result:&lt;/p&gt;
&lt;ol style=&quot;list-style-type: decimal&quot;&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;The VNM axioms are too strong&lt;/em&gt;. Some have argued that the VNM axioms are not self-evidently true. See section 8.6.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;The VNM system offers no action guidance&lt;/em&gt;. A VNM-rational decision maker cannot use VNM utility theory for action guidance, because she must state her preferences over lotteries at the start. But if an agent can state her preferences over lotteries, then she already knows which lottery to choose. (For more on this, see section 9.)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;In the VNM system, utility is defined via preferences over lotteries rather than preferences over outcomes&lt;/em&gt;. To many, it seems odd to &lt;em&gt;define&lt;/em&gt; utility with respect to preferences over lotteries. Many would argue that utility should be defined in relation to preferences over &lt;em&gt;outcomes&lt;/em&gt; or &lt;em&gt;world-states&lt;/em&gt;, and that's not what the VNM system does. (Also see section 9.)&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id=&quot;should-we-accept-the-vnm-axioms&quot;&gt;&lt;br&gt;&lt;/h3&gt;
&lt;h3&gt;&lt;a href=&quot;#should-we-accept-the-vnm-axioms&quot;&gt;8.6. Should we accept the VNM axioms?&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The VNM preference axioms define what it is for an agent to be VNM-rational. But why should we accept these axioms? Usually, it is argued that each of the axioms are &lt;em&gt;pragmatically justified&lt;/em&gt; because an agent which violates the axioms can face situations in which they are guaranteed end up worse off (from &lt;em&gt;their own&lt;/em&gt; perspective).&lt;/p&gt;
&lt;p&gt;In sections 8.6.1 and 8.6.2 I go into some detail about pragmatic justifications offered for the transitivity and completeness axioms. For more detail, including arguments about the justification of the other axioms, see &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, ch. 8)&lt;/a&gt; and &lt;a href=&quot;http://www.amazon.com/Foundations-Rational-Choice-Under-Risk/dp/0198774427/&quot;&gt;Anand (1993)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-transitivity-axiom&quot;&gt;&lt;a href=&quot;#the-transitivity-axiom&quot;&gt;8.6.1. The transitivity axiom&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Consider the &lt;em&gt;money-pump argument&lt;/em&gt; in favor of the transitivity axiom (&quot;if the agent prefers A to B and B to C, she must also prefer A to C&quot;).&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Imagine that a friend offers to give you exactly one of her three... novels, x or y or z... [and] that your preference ordering over the three novels is... [that] you prefer x to y, and y to z, and z to x... [That is, your preferences are &lt;em&gt;cyclic&lt;/em&gt;, which is a type of &lt;em&gt;intransitive&lt;/em&gt; preference relation.] Now suppose that you are in possession of z, and that you are invited to swap z for y. Since you prefer y to z, rationality obliges you to swap. So you swap, and temporarily get y. You are then invited to swap y for x, which you do, since you prefer x to y. Finally, you are offered to &lt;em&gt;pay a small amount&lt;/em&gt;, say one cent, for swapping x for z. Since z is strictly [preferred to] x, even after you have paid the fee for swapping, rationality tells you that you should accept the offer. This means that you end up where you started, the only difference being that you now have one cent less. This procedure is thereafter iterated over and over again. After a billion cycles you have lost ten million dollars, for which you have got nothing in return. (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;Peterson 2009&lt;/a&gt;, ch. 8)&lt;/p&gt;
&lt;/blockquote&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/45csd.jpg&quot; alt=&quot;An example of a money-pump argument&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;An example of a money-pump argument&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Similar arguments (e.g. &lt;a href=&quot;http://johanegustafsson.net/papers/a_money-pump_for_acyclic_intransitive_preferences.pdf&quot;&gt;Gustafsson 2010&lt;/a&gt;) aim to show that the other kind of intransitive preferences (acyclic preferences) are irrational, too.&lt;/p&gt;
&lt;p&gt;(Of course, pragmatic arguments need not be framed in monetary terms. We could just as well construct an argument showing that an agent with intransitive preferences can be &quot;pumped&quot; of all their happiness, or all their moral virtue, or all their Twinkies.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-completeness-axiom&quot;&gt;&lt;a href=&quot;#the-completeness-axiom&quot;&gt;8.6.2. The completeness axiom&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;The completeness axiom (&quot;the agent must prefer A to B, or prefer B to A, or be indifferent between the two&quot;) is often attacked by saying that some goods or outcomes are incommensurable &amp;#x2014; that is, they cannot be compared. For example, must a rational agent be able to state a preference (or indifference) between money and human welfare?&lt;/p&gt;
&lt;p&gt;Perhaps the completeness axiom can be justified with a pragmatic argument. If you think it is rationally permissible to swap between two incommensurable goods, then one can construct a money pump argument in favor of the completeness axiom. But if you think it is &lt;em&gt;not&lt;/em&gt; rational to swap between incommensurable goods, then one cannot construct a money pump argument for the completeness axiom. (In fact, even if it is rational to swap between incommensurable goods, &lt;a href=&quot;http://personal.rhul.ac.uk/uhte/035/incomplete%20preferences.geb.pdf&quot;&gt;Mandler, 2005&lt;/a&gt; has demonstrated that an agent that allows their current choices to depend on the previous ones can avoid being money pumped.)&lt;/p&gt;
&lt;p&gt;And in fact, there is a popular argument &lt;em&gt;against&lt;/em&gt; the completeness axiom: the &quot;small improvement argument.&quot; For details, see &lt;a href=&quot;http://www.amazon.com/Incommensurability-Incomparability-Practical-Reason-Chang/dp/0674447565/&quot;&gt;Chang (1997)&lt;/a&gt; and &lt;a href=&quot;https://www.msb.se/Upload/Om%20MSB/Forskning/Projektrapporter/Peterson_artiklar/Small_Improvment_Argument.pdf&quot;&gt;Espinoza (2007)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Note that in &lt;a href=&quot;http://en.wikipedia.org/wiki/Revealed_preference&quot;&gt;revealed preference theory&lt;/a&gt;, according to which preferences are revealed through choice behavior, there is no room for incommensurable preferences because every choice always reveals a preference relation of &quot;better than,&quot; &quot;worse than,&quot; or &quot;equally as good as.&quot;&lt;/p&gt;
&lt;p&gt;Another proposal for dealing with the apparent incommensurability of some goods (such as money and human welfare) is the &lt;em&gt;multi-attribute approach&lt;/em&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;In a multi-attribute approach, each type of attribute is measured in the unit deemed to be most suitable for that attribute. Perhaps money is the right unit to use for measuring financial costs, whereas the number of lives saved is the right unit to use for measuring human welfare. The total value of an alternative is thereafter determined by aggregating the attributes, e.g. money and lives, into an overall ranking of available alternatives...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Several criteria have been proposed for choosing among alternatives with multiple attributes... [For example,] additive criteria assign weights to each attribute, and rank alternatives according to the weighted sum calculated by multiplying the weight of each attribute with its value... [But while] it is perhaps contentious to measure the utility of very different objects on a common scale, ...it seems equally contentious to assign numerical weights to attributes as suggested here....&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;[Now let us] consider a very general objection to multi-attribute approaches. According to this objection, there exist several equally plausible but different ways of constructing the list of attributes. Sometimes the outcome of the decision process depends on which set of attributes is chosen. (&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;Peterson 2009&lt;/a&gt;, ch. 8)&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;For more on the multi-attribute approach, see &lt;a href=&quot;http://www.amazon.com/Decisions-Multiple-Objectives-Preferences-Tradeoffs/dp/0521438837/&quot;&gt;Keeney &amp;amp; Raiffa (1993)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-allais-paradox&quot;&gt;&lt;a href=&quot;#the-allais-paradox&quot;&gt;8.6.3. The Allais paradox&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Having considered the transitivity and completeness axioms, we can now turn to independence (a preference holds independently of considerations of other possible outcomes). Do we have any reason to reject this axiom? Here&amp;#x2019;s one reason to think we might: in a case known as the &lt;em&gt;Allais paradox&lt;/em&gt; &lt;a href=&quot;http://www.jstor.org/stable/1907921&quot;&gt;Allais (1953)&lt;/a&gt; it may seem reasonable to act in a way that contradicts independence.&lt;/p&gt;
&lt;p&gt;The Allais paradox asks us to consider two decisions (this version of the paradox is based on &lt;a href=&quot;/lw/my/the_allais_paradox/&quot;&gt;Yudkowsky (2008)&lt;/a&gt;).The first decision involves the choice between:&lt;/p&gt;
&lt;p&gt;(1A) A certain $24,000; and (1B) A 33/34 chance of $27,000 and a 1/34 chance of nothing.&lt;/p&gt;
&lt;p&gt;The second involves the choice between:&lt;/p&gt;
&lt;p&gt;(2A) A 34% chance of $24, 000 and a 66% chance of nothing; and (2B) A 33% chance of $27, 000 and a 67% chance of nothing.&lt;/p&gt;
&lt;p&gt;Experiments have shown that many people prefer (1A) to (1B) and (2B) to (2A). However, these preferences contradict independence. Option 2A is the same as [a 34% chance of option 1A and a 66% chance of nothing] while 2B is the same as [a 34% chance of option 1B and a 66% chance of nothing]. So independence implies that anyone that prefers (1A) to (1B) must also prefer (2A) to (2B).&lt;/p&gt;
&lt;p&gt;When this result was first uncovered, it was presented as evidence against the independence axiom. However, while the Allais paradox clearly reveals that independence fails as a &lt;em&gt;descriptive&lt;/em&gt; account of choice, it&amp;#x2019;s less clear what it implies about the normative account of rational choice that we are discussing in this document. As noted in &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521716543/&quot;&gt;Peterson (2009, ch. 4)&lt;/a&gt;, however:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;[S]ince many people who have thought very hard about this example still feel that it would be rational to stick to the problematic preference pattern described above, there seems to be something wrong with the expected utility principle.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;However, Peterson then goes on to note that, many people, like the statistician Leonard Savage, argue that it is people&amp;#x2019;s preference in the Allais paradox that are in error rather than the independence axiom. If so, then the paradox seems to reveal the danger of relying too strongly on intuition to determine the form that should be taken by normative theories of rational.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-ellsberg-paradox&quot;&gt;&lt;a href=&quot;#the-ellsberg-paradox&quot;&gt;8.6.4. The Ellsberg paradox&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;The Allais paradox is far from the only case where people fail to act in accordance with EUM. Another well-known case is the Ellsberg paradox (the following is taken from &lt;a href=&quot;http://www.amazon.com/Choices-An-Introduction-Decision-Theory/dp/0816614407/&quot;&gt;Resnik (1987)&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;An urn contains ninety uniformly sized balls, which are randomly distributed. Thirty of the balls are yellow, the remaining sixty are red or blue. We are not told how many red (blue) balls are in the urn &amp;#x2013; except that they number anywhere from zero to sixty. Now consider the following pair of situations. In each situation a ball will be drawn and we will be offered a bet on its color. In situation A we will choose between betting that it is yellow or that it is red. In situation B we will choose between betting that it is red or blue or that it is yellow or blue.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;If we guess the correct color, we will receive a payout of $100. In the Ellsberg paradox, many people bet &lt;em&gt;yellow&lt;/em&gt; in situation A and &lt;em&gt;red or blue&lt;/em&gt; in situation B. Further, many people make these decisions not because they are indifferent in both situations, and so happy to choose either way, but rather because they have a strict preference to choose in this manner.&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/tZKOsHx.jpg&quot; alt=&quot;The Ellsberg paradox&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;The Ellsberg paradox&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;However, such behavior cannot be in accordance with EUM. In order for EUM to endorse a strict preference for choosing &lt;em&gt;yellow&lt;/em&gt; in situation A, the agent would have to assign a probability of more than 1/3 to the ball selected being blue. On the other hand, in order for EUM to endorse a strict preference for choosing &lt;em&gt;red or blue&lt;/em&gt; in situation B the agent would have to assign a probability of less than 1/3 to the selected ball being blue. As such, these decisions can&amp;#x2019;t be jointly endorsed by an agent following EUM.&lt;/p&gt;
&lt;p&gt;Those who deny that decisions making under ignorance can be transformed into decision making under uncertainty have an easy response to the Ellsberg paradox: as this case involves deciding under a situation of ignorance, it is irrelevant whether people&amp;#x2019;s decisions violate EUM in this case as EUM is not applicable to such situations.&lt;/p&gt;
&lt;p&gt;Those who believe that EUM provides a suitable standard for choice in such situations, however, need to find some other way of responding to the paradox. As with the Allais paradox, there is some disagreement about how best to do so. Once again, however, many people, including Leonard Savage, argue that EUM reaches the right decision in this case. It is our intuitions that are flawed (see again &lt;a href=&quot;http://www.amazon.com/Choices-An-Introduction-Decision-Theory/dp/0816614407/&quot;&gt;Resnik (1987)&lt;/a&gt; for a nice summary of Savage&amp;#x2019;s argument to this conclusion).&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-st-petersburg-paradox&quot;&gt;&lt;a href=&quot;#the-st-petersburg-paradox&quot;&gt;8.6.5. The St Petersburg paradox&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Another objection to the VNM approach (and to expected utility approaches generally), the &lt;a href=&quot;http://en.wikipedia.org/wiki/St._Petersburg_paradox&quot;&gt;St. Petersburg paradox&lt;/a&gt;, draws on the possibility of infinite utilities. The St. Petersburg paradox is based around a game where a fair coin is tossed until it lands heads up. At this point, the agent receives a prize worth 2&lt;sup&gt;n&lt;/sup&gt; utility, where &lt;em&gt;n&lt;/em&gt; is equal to the number of times the coin was tossed during the game. The so-called paradox occurs because the expected utility of choosing to play this game is infinite and so, according to a standard expected utility approach, the agent should be willing to pay any finite amount to play the game. However, this seems unreasonable. Instead, it seems that the agent should only be willing to pay a relatively small amount to do so. As such, it seems that the expected utility approach gets something wrong.&lt;/p&gt;
&lt;p&gt;Various responses have been suggested. Most obviously, we could avoid the paradox either by refusing to consider outcomes with a small enough probability or by limiting agents' utilities to finite values. However, in both cases, these moves seem ad hoc. It's unclear why we should set some limit to the amount of utility an agent can receive, and it seems unreasonable to discount outcomes entirely just because they're unlikely. Further, it is unclear whether the second of these approaches, limiting an agent's utility to some finite value, really solves the problem. After all, at it's core, the St. Petersburg paradox is not about infinite utilities but rather about cases where expected utility approaches seem to overvalue some choice, and such cases seem to exist even in finite cases. For example, if we let &lt;em&gt;L&lt;/em&gt; be a finite limit on utility we could consider the following scenario (from &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson, 2009, p. 85&lt;/a&gt;):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A fair coin is tossed until it lands heads up. The player thereafter receives a prize worth min {2&lt;sup&gt;n&lt;/sup&gt; &amp;#xB7; 10&lt;sup&gt;-100&lt;/sup&gt;, L} units of utility, where &lt;em&gt;n&lt;/em&gt; is the number of times the coin was tossed.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In this case, even if an extremely low value is set for &lt;em&gt;L&lt;/em&gt; (say, 1), it seems that paying this amount to play the game is unreasonable. After all, as Peterson notes, about nine times out of ten an agent that plays this game will win no more than 8 &amp;#xB7; 10&lt;sup&gt;-100&lt;/sup&gt; utility. If paying 1 utility is, in fact, unreasonable in this case, then simply limiting an agent's utility to some finite value doesn't provide a defence of expected utility approaches. (Other problems abound. See &lt;a href=&quot;/lw/kd/pascals_mugging_tiny_probabilities_of_vast/&quot;&gt;Yudkowsky, 2007&lt;/a&gt; for an interesting finite problem and &lt;a href=&quot;http://philrsss.anu.edu.au/people-defaults/alanh/papers/vexing_expectations.pdf&quot;&gt;Nover &amp;amp; Hajek, 2004&lt;/a&gt; for a particularly perplexing problem with links to the St Petersburg paradox.)&lt;/p&gt;
&lt;p&gt;As it stands, there is no agreement about precisely what the St Petersburg paradox reveals. Some people accept one of the various resolutions of the case and so find the paradox unconcerning. Others think the paradox reveals a serious problem for expected utility theories. Still others think the paradox is unresolved but don't think that we should respond by abandoning expected utility theory.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;does-axiomatic-decision-theory-offer-any-action-guidance&quot;&gt;&lt;a href=&quot;#does-axiomatic-decision-theory-offer-any-action-guidance&quot;&gt;9. Does axiomatic decision theory offer any action guidance?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;For the decision theories listed in section 8.2, it's often claimed the answer is &quot;no.&quot; To explain this, I must first examine some differences between &lt;em&gt;direct&lt;/em&gt; and &lt;em&gt;indirect&lt;/em&gt; approaches to axiomatic decision theory.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, ch. 4)&lt;/a&gt; explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;In the indirect approach, which is the dominant approach, the decision maker does not prefer a risky act [or lottery] to another &lt;em&gt;because&lt;/em&gt; the expected utility of the former exceeds that of the latter. Instead, the decision maker is asked to state a set of preferences over a set of risky acts... Then, if the set of preferences stated by the decision maker is consistent with a small number of structural constraints (axioms), it can be shown that her decisions can be described &lt;em&gt;as if&lt;/em&gt; she were choosing what to do by assigning numerical probabilities and utilities to outcomes and then maximising expected utility...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;[In contrast] the direct approach seeks to generate preferences over acts from probabilities and utilities &lt;em&gt;directly&lt;/em&gt; assigned to outcomes. In contrast to the indirect approach, it is not assumed that the decision maker has access to a set of preferences over acts before he starts to deliberate.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The axiomatic decision theories listed in section 8.2 all follow the indirect approach. These theories, it might be said, cannot offer any action guidance because they require an agent to state its preferences over acts &quot;up front.&quot; But an agent that states its preferences over acts already knows which act it prefers, so the decision theory can't offer any action guidance not already present in the agent's own stated preferences over acts.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, ch .10)&lt;/a&gt; gives a practical example:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;For example, a forty-year-old woman seeking advice about whether to, say, divorce her husband, is likely to get very different answers from the [two approaches]. The [indirect approach] will advise the woman to first figure out what her preferences are over a very large set of risky acts, including the one she is thinking about performing, and then just make sure that all preferences are consistent with certain structural requirements. Then, as long as none of the structural requirements is violated, the woman is free to do whatever she likes, no matter what her beliefs and desires actually are... The [direct approach] will [instead] advise the woman to first assign numerical utilities and probabilities to her desires and beliefs, and then aggregate them into a decision by applying the principle of maximizing expected utility.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Thus, it seems only the direct approach offers an agent any action guidance. But the direct approach is very recent (&lt;a href=&quot;http://www.amazon.com/Non-Bayesian-Decision-Theory-Beliefs-Desires/dp/9048179572/&quot;&gt;Peterson 2008&lt;/a&gt;; &lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/05/Cozic-Review-of-Non-Bayesian-Decision-Theory.pdf&quot;&gt;Cozic 2011&lt;/a&gt;), and only time will show whether it can stand up to professional criticism.&lt;/p&gt;
&lt;p&gt;Warning: Peterson's (&lt;a href=&quot;http://www.amazon.com/Non-Bayesian-Decision-Theory-Beliefs-Desires/dp/9048179572/&quot;&gt;2008&lt;/a&gt;) direct approach is confusingly called &quot;non-Bayesian decision theory&quot; despite assuming Bayesian probability theory.&lt;/p&gt;
&lt;p&gt;For other attempts to pull action guidance from normative decision theory, see &lt;a href=&quot;/lw/fu1/why_you_must_maximize_expected_utility/&quot;&gt;Fallenstein (2012)&lt;/a&gt; and &lt;a href=&quot;/lw/gap/a_fungibility_theorem/&quot;&gt;Stiennon (2013)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2 id=&quot;how-does-probability-theory-play-a-role-in-decision-theory&quot;&gt;&lt;a href=&quot;#how-does-probability-theory-play-a-role-in-decision-theory&quot;&gt;10. How does probability theory play a role in decision theory?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;In order to calculate the expected utility of an act (or lottery), it is necessary to determine a probability for each outcome. In this section, I will explore some of the details of probability theory and its relationship to decision theory.&lt;/p&gt;
&lt;p&gt;For further introductory material to probability theory, see &lt;a href=&quot;http://www.amazon.com/Scientific-Reasoning-The-Bayesian-Approach/dp/081269578X/&quot;&gt;Howson &amp;amp; Urbach (2005)&lt;/a&gt;, &lt;a href=&quot;http://www.amazon.com/Probability-Random-Processes-Geoffrey-Grimmett/dp/0198572220/&quot;&gt;Grimmet &amp;amp; Stirzacker (2001)&lt;/a&gt;, and &lt;a href=&quot;http://www.amazon.com/Probabilistic-Graphical-Models-Principles-Computation/dp/0262013193/&quot;&gt;Koller &amp;amp; Friedman (2009)&lt;/a&gt;. This section draws heavily on &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, chs. 6 &amp;amp; 7)&lt;/a&gt; which provides a very clear introduction to probability in the context of decision theory.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;the-basics-of-probability-theory&quot;&gt;&lt;a href=&quot;#the-basics-of-probability-theory&quot;&gt;10.1. The basics of probability theory&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Intuitively, a probability is a number between 0 or 1 that labels how likely an event is to occur. If an event has probability 0 then it is impossible and if it has probability 1 then it can't possibly be false. If an event has a probability between these values, then this event it is more probable the higher this number is.&lt;/p&gt;
&lt;p&gt;As with EUM, probability theory can be derived from a small number of simple axioms. In the probability case, there are three of these, which are named the Kolmogorov axioms after the mathematician Andrey Kolmogorov. The first of these states that probabilities are real numbers between 0 and 1. The second, that if a set of events are mutually exclusive and exhaustive then their probabilities should sum to 1. The third that if two events are mutually exclusive then the probability that one or the other of these events will occur is equal to the sum of their individual probabilities.&lt;/p&gt;
&lt;p&gt;From these three axioms, the remainder of probability theory can be derived. In the remainder of this section, I will explore some aspects of this broader theory.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;bayes-theorem-for-updating-probabilities&quot;&gt;&lt;a href=&quot;#bayes-theorem-for-updating-probabilities&quot;&gt;10.2. Bayes theorem for updating probabilities&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;From the perspective of decision theory, one particularly important aspect of probability theory is the idea of a conditional probability. These represent how probable something is given a piece of information. So, for example, a conditional probability could represent how likely it is that it will be raining, conditioning on the fact that the weather forecaster predicted rain. A powerful technique for calculating conditional probabilities is Bayes theorem (see &lt;a href=&quot;http://yudkowsky.net/rational/bayes&quot;&gt;Yudkowsky, 2003&lt;/a&gt; for a detailed introduction). This formula states that:&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/lTKXA.gif&quot; alt=&quot;P(A|B)=(P(B|A)P(A))/P(B)&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;P(A|B)=(P(B|A)P(A))/P(B)&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Bayes theorem is used to calculate the probability of some event, A, given some evidence, B. As such, this formula can be used to &lt;em&gt;update&lt;/em&gt; probabilities based on new evidence. So if you are trying to predict the probability that it will rain tomorrow and someone gives you the information that the weather forecaster predicted that it will do so then this formula tells you how to calculate a new probability that it will rain based on your existing information. The initial probability in such cases (before the information is factored into account) is called the &lt;em&gt;prior probability&lt;/em&gt; and the result of applying Bayes theorem is a new, &lt;em&gt;posterior probability&lt;/em&gt;.&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/vM0yW.jpg&quot; alt=&quot;Using Bayes theorem to update probabilities based on the evidence provided by a weather forecast&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;Using Bayes theorem to update probabilities based on the evidence provided by a weather forecast&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Bayes theorem can be seen as solving the problem of how to update prior probabilities based on new information. However, it leaves open the question of how to determine the prior probability in the first place. In some cases, there will be no obvious way to do so. One solution to this problem suggests that any reasonable prior can be selected. Given enough evidence, repeated applications of Bayes theorem will lead this prior probability to be updated to much the same posterior probability, even for people with widely different initial priors. As such, the initially selected prior is less crucial than it may at first seem.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;how-should-probabilities-be-interpreted&quot;&gt;&lt;a href=&quot;#how-should-probabilities-be-interpreted&quot;&gt;10.3. How should probabilities be interpreted?&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;There are two main views about what probabilities mean: objectivism and subjectivism. Loosely speaking, the objectivist holds that probabilities tell us something about the external world while the subjectivist holds that they tell us something about our beliefs. Most decision theorists hold a subjectivist view about probability. According to this sort of view, probabilities represent a subjective degrees of belief. So to say the probability of rain is 0.8 is to say that the agent under consideration has a high degree of belief that it will rain (see &lt;a href=&quot;http://www.amazon.com/Probability-Theory-The-Logic-Science/dp/0521592712/&quot;&gt;Jaynes, 2003&lt;/a&gt; for a defense of this view). Note that, according to this view, another agent in the same circumstance could assign a different probability that it will rain.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;why-should-degrees-of-belief-following-the-laws-of-probability&quot;&gt;&lt;a href=&quot;#why-should-degrees-of-belief-following-the-laws-of-probability&quot;&gt;10.3.1. Why should degrees of belief follow the laws of probability?&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;One question that might be raised against the subjective account of probability is why, on this account, our degrees of belief should satisfy the Kolmogorov axioms. For example, why should our subjective degrees of belief in mutually exclusive, exhaustive events add to 1? One answer to this question shows that agents whose degrees of belief don&amp;#x2019;t satisfy these axioms will be subject to Dutch Book bets. These are bets where the agent will inevitably lose money. &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, ch. 7)&lt;/a&gt; explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose, for instance, that you believe to degree 0.55 that at least one person from India will win a gold medal in the next Olympic Games (event G), and that your subjective degree of belief is 0.52 that no Indian will win a gold medal in the next Olympic Games (event &amp;#xAC;G). Also suppose that a cunning bookie offers you a bet on both of these events. The bookie promises to pay you $1 for each event that actually takes place. Now, since your subjective degree of belief that G will occur is 0.55 it would be rational to pay up to $1&amp;#xB7;0.55 = $0.55 for entering this bet. Furthermore, since your degree of belief in &amp;#xAC;G is 0.52 you should be willing to pay up to $0.52 for entering the second bet, since $1&amp;#xB7;0.52 = $0.52. However, by now you have paid $1.07 for taking on two bets that are certain to give you a payoff of $1 &lt;em&gt;no matter what happens&lt;/em&gt;...Certainly, this must be irrational. Furthermore, the reason why this is irrational is that your subjective degrees of belief violate the probability calculus.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/9xoLg.jpg&quot; alt=&quot;A Dutch Book argument&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;A Dutch Book argument&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;It can be proven that an agent is subject to Dutch Book bets if, and only if, their degrees of belief violate the axioms of probability. This provides an argument for why degrees of beliefs should satisfy these axioms.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;measuring-subjective-probabilities&quot;&gt;&lt;a href=&quot;#measuring-subjective-probabilities&quot;&gt;10.3.2. Measuring subjective probabilities&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Another challenges raised by the subjective view is how we can measure probabilities. If these represent subjective degrees of belief there doesn&amp;#x2019;t seem to be an easy way to determine these based on observations of the world. However, a number of responses to this problem have been advanced, one of which is explained succinctly by &lt;a href=&quot;http://www.amazon.com/Introduction-Decision-Cambridge-Introductions-Philosophy/dp/0521888379/&quot;&gt;Peterson (2009, ch. 7)&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The main innovations presented by... Savage can be characterised as systematic procedures for linking probability... to claims about objectively observable behavior, such as preference revealed in choice behavior. Imagine, for instance, that we wish to measure Caroline's subjective probability that the coin she is holding in her hand will land heads up the next time it is tossed. First, we ask her which of the following very generous options she would prefer.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;A: &quot;If the coin lands heads up you win a sports car; otherwise you win nothing.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;B: &quot;If the coin &lt;em&gt;does not&lt;/em&gt; land heads up you win a sports car; otherwise you win nothing.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose Caroline prefers A to B. We can then safely conclude that she thinks it is &lt;em&gt;more probable&lt;/em&gt; that the coin will land heads up rather than not. This follows from the assumption that Caroline prefers to win a sports car rather than nothing, and that her preference between uncertain prospects is entirely determined by her beliefs and desires with respect to her prospects of winning the sports car...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Next, we need to generalise the measurement procedure outlined above such that it allows us to always represent Caroline's degrees of belief with precise numerical probabilities. To do this, we need to ask Caroline to state preferences over a &lt;em&gt;much larger&lt;/em&gt; set of options and then &lt;em&gt;reason backwards&lt;/em&gt;... Suppose, for instance, that Caroline wishes to measure her subjective probability that her car worth $20,000 will be stolen within one year. If she considers $1,000 to be... the highest price she is prepared to pay for a gamble in which she gets $20,000 if the event S: &quot;The car stolen within a year&quot; takes place, and nothing otherwise, then Caroline's subjective probability for S is 1,000/20,000 = 0.05, given that she forms her preferences in accordance with the principle of maximising expected monetary value...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;The problem with this method is that very few people form their preferences in accordance with the principle of maximising expected monetary value. Most people have a decreasing marginal utility for money...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Fortunately, there is a clever solution to [this problem]. The basic idea is to impose a number of structural conditions on preferences over uncertain options [e.g. the transitivity axiom]. Then, the subjective probability function is established by reasoning backwards while taking the structural axioms into account: Since the decision maker preferrred some uncertain options to others, and her preferences... satisfy a number of structure axioms, the decision maker behaves &lt;em&gt;as if&lt;/em&gt; she were forming her preferences over uncertain options by first assigning subjective probabilities and utilities to each option and thereafter maximising expected utility.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;A peculiar feature of this approach is, thus, that probabilities (and utilities) are derived from 'within' the theory. The decision maker does not prefer an uncertain option to another &lt;em&gt;because&lt;/em&gt; she judges the subjective probabilities (and utilities) of the outcomes to be more favourable than those of another. Instead, the... structure of the decision maker's preferences over uncertain options logically implies that they can be described &lt;em&gt;as if&lt;/em&gt; her choices were governed by a subjective probability function and a utility function...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;...Savage's approach [seeks] to explicate subjective interpretations of the probability axioms by making certain claims about preferences over... uncertain options. But... why on earth should a theory of subjective probability involve assumptions about preferences, given that preferences and beliefs are separate entities? Contrary to what is claimed by [Savage and others], emotionally inert decision makers failing to muster any preferences at all... could certainly hold partial beliefs.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Other theorists, for example &lt;a href=&quot;http://www.amazon.com/Optimal-Statistical-Decisions-Classics-Library/dp/047168029X/&quot;&gt;DeGroot (1970)&lt;/a&gt;, propose other approaches:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;DeGroot's basic assumption is that decision makers can make &lt;em&gt;qualitative&lt;/em&gt; comparisons between pairs of events, and judge which one they think is most likely to occur. For example, he assumes that one can judge whether it is &lt;em&gt;more&lt;/em&gt;, &lt;em&gt;less&lt;/em&gt;, or &lt;em&gt;equally&lt;/em&gt; likely, according to one's own beliefs, that it will rain today in Cambridge than in Cairo. DeGroot then shows that if the agent's qualitative judgments are sufficiently fine-grained and satisfy a number of structural axioms, then [they can be described by a probability distribution]. So in DeGroot's... theory, the probability function is obtained by fine-tuning qualitative data, thereby making them quantitative.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id=&quot;what-about-newcombs-problem-and-alternative-decision-algorithms&quot;&gt;&lt;br&gt;&lt;/h2&gt;
&lt;h2&gt;&lt;a href=&quot;#what-about-newcombs-problem-and-alternative-decision-algorithms&quot;&gt;11. What about &quot;Newcomb's problem&quot; and alternative decision algorithms?&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Saying that a rational agent &quot;maximizes expected utility&quot; is, unfortunately, not specific enough. There are a variety of decision algorithms which aim to maximize expected utility, and they give &lt;em&gt;different answers&lt;/em&gt; to some decision problems, for example &quot;Newcomb's problem.&quot;&lt;/p&gt;
&lt;p&gt;In this section, we explain these decision algorithms and show how they perform on Newcomb's problem and related &quot;Newcomblike&quot; problems.&lt;/p&gt;
&lt;p&gt;General sources on this topic include: &lt;a href=&quot;http://www.amazon.com/Paradoxes-Rationality-Cooperation-Prisoners-Newcombs/dp/0774802154/&quot;&gt;Campbell &amp;amp; Sowden (1985)&lt;/a&gt;, &lt;a href=&quot;http://kops.ub.uni-konstanz.de/bitstream/handle/urn:nbn:de:bsz:352-opus-5241/ledwig.pdf?sequence=1&quot;&gt;Ledwig (2000)&lt;/a&gt;, &lt;a href=&quot;http://www.amazon.com/Foundations-Decision-Cambridge-Probability-Induction/dp/0521063566/&quot;&gt;Joyce (1999)&lt;/a&gt;, and &lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky (2010)&lt;/a&gt;. &lt;a href=&quot;http://www.operalgo.com/PDF/Moertelmaier_Newcomblike_2013.pdf&quot;&gt;Moertelmaier (2013)&lt;/a&gt; discusses Newcomblike problems in the context of the agent-environment framework.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;newcomblike-problems-and-two-decision-algorithms&quot;&gt;&lt;a href=&quot;#newcomblike-problems-and-two-decision-algorithms&quot;&gt;11.1. Newcomblike problems and two decision algorithms&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;I'll begin with an exposition of several Newcomblike problems, so that I can refer to them in later sections. I'll also introduce our first two decision algorithms, so that I can show how one's choice of decision algorithm affects an agent's outcomes on these problems.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;newcombs-problem&quot;&gt;&lt;a href=&quot;#newcombs-problem&quot;&gt;11.1.1. Newcomb's Problem&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Newcomb's problem was formulated by the physicist &lt;a href=&quot;http://en.wikipedia.org/wiki/William_Newcomb&quot;&gt;William Newcomb&lt;/a&gt; but first published in &lt;a href=&quot;http://faculty.arts.ubc.ca/rjohns/nozick_newcomb.pdf&quot;&gt;Nozick (1969)&lt;/a&gt;. Below I present a version of it inspired by &lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky (2010)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;A superintelligent machine named Omega visits Earth from another galaxy and shows itself to be very good at predicting events. This isn't because it has magical powers, but because it knows more science than we do, has billions of sensors scattered around the globe, and runs efficient algorithms for modeling humans and other complex systems with unprecedented precision &amp;#x2014; on an array of computer hardware the size of our moon.&lt;/p&gt;
&lt;p&gt;Omega presents you with two boxes. Box A is transparent and contains $1000. Box B is opaque and contains either $1 million or nothing. You may choose to take both boxes (called &quot;two-boxing&quot;), or you may choose to take only box B (called &quot;one-boxing&quot;). If Omega predicted you'll two-box, then Omega has left box B empty. If Omega predicted you'll one-box, then Omega has placed $1M in box B.&lt;/p&gt;
&lt;p&gt;By the time you choose, Omega has already left for its next game &amp;#x2014; the contents of box B won't change after you make your decision. Moreover, you've watched Omega play a thousand games against people like you, and on every occasion Omega predicted the human player's choice accurately.&lt;/p&gt;
&lt;p&gt;Should you one-box or two-box?&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/4MFhs.jpg&quot; alt=&quot;Newcomb&amp;#x2019;s problem&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;Newcomb&amp;#x2019;s problem&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Here's an argument for two-boxing. The $1M either &lt;em&gt;is&lt;/em&gt; or &lt;em&gt;is not&lt;/em&gt; in the box; your choice cannot affect the contents of box B now. So, you should two-box, because then you get $1K plus whatever is in box B. This is a straightforward application of the dominance principle (section 6.1). Two-boxing dominantes one-boxing.&lt;/p&gt;
&lt;p&gt;Convinced? Well, here's an argument for one-boxing. On all those earlier games you watched, everyone who two-boxed received $1K, and everyone who one-boxed received $1M. So you're almost certain that you'll get $1K for two-boxing and $1M for one-boxing, which means that to maximize your expected utility, you should one-box.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://faculty.arts.ubc.ca/rjohns/nozick_newcomb.pdf&quot;&gt;Nozick (1969)&lt;/a&gt; reports:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I have put this problem to a large number of people... To almost everyone it is perfectly clear and obvious what should be done. The difficulty is that these people seem to divide almost evenly on the problem, with large numbers thinking that the opposing half is just being silly.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This is not a &quot;merely verbal&quot; dispute (&lt;a href=&quot;http://philreview.dukejournals.org/content/120/4/515.short&quot;&gt;Chalmers 2011&lt;/a&gt;). Decision theorists have offered different &lt;em&gt;algorithms&lt;/em&gt; for making a choice, and they have different outcomes. Translated into English, the first algorithm (&lt;em&gt;evidential decision theory&lt;/em&gt; or EDT) says &quot;Take actions such that you would be glad to receive the news that you had taken them.&quot; The second algorithm (&lt;em&gt;causal decision theory&lt;/em&gt; or CDT) says &quot;Take actions which you expect to have a positive effect on the world.&quot;&lt;/p&gt;
&lt;p&gt;Many decision theorists have the intuition that CDT is right. But a CDT agent appears to &quot;lose&quot; on Newcomb's problem, ending up with $1000, while an EDT agent gains $1M. Proponents of EDT can ask proponents of CDT: &quot;If you're so smart, why aren't you rich?&quot; As &lt;a href=&quot;http://www-ihpst.univ-paris1.fr/fichiers/programmes/20/Spohn-One-Boxing3.pdf&quot;&gt;Spohn (2012)&lt;/a&gt; writes, &quot;this must be poor rationality that complains about the reward for irrationality.&quot; Or as &lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky (2010)&lt;/a&gt; argues:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;An expected utility maximizer should maximize &lt;em&gt;utility&lt;/em&gt; &amp;#x2014; not formality, reasonableness, or defensibility...&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In response to EDT's apparent &quot;win&quot; over CDT on Newcomb's problem, proponents of CDT have presented similar problems on which a CDT agent &quot;wins&quot; and an EDT agent &quot;loses.&quot; Proponents of EDT, meanwhile, have replied with additional Newcomblike problems on which EDT wins and CDT loses. Let's explore each of them in turn.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;evidential-and-causal-decision-theory&quot;&gt;&lt;a href=&quot;#evidential-and-causal-decision-theory&quot;&gt;11.1.2. Evidential and causal decision theory&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;First, however, we will consider our two decision algorithms in a little more detail.&lt;/p&gt;
&lt;p&gt;EDT can be described simply: according to this theory, agents should use conditional probabilities when determining the expected utility of different acts. Specifically, they should use the probability of the world being in each possible state conditioning on them carrying out the act under consideration. So in Newcomb&amp;#x2019;s problem they consider the probability that Box B contains $1 million or nothing conditioning on the evidence provided by their decision to one-box or two-box. This is how the theory formalizes the notion of an act providing good news.&lt;/p&gt;
&lt;p&gt;CDT is more complex, at least in part because it has been formulated in a variety of different ways and these formulations are equivalent to one another only if certain background assumptions are met. However, a good sense of the theory can be gained by considering the counterfactual approach, which is one of the more intuitive of these formulations. This approach utilizes the probabilities of certain counterfactual conditionals, which can be thought of as representing the causal influence of an agent&amp;#x2019;s acts on the state of the world. These conditionals take the form &amp;#x201C;if I were to carry out a certain act, then the world would be in a certain state.&quot; So in Newcomb&amp;#x2019;s problem, for example, this formulation of CDT considers the probability of the counterfactuals like &amp;#x201C;if I were to one-box, then Box B would contain $1 million&amp;#x201D; and, in doing so, considers the causal influence of one-boxing on the contents of the boxes.&lt;/p&gt;
&lt;p&gt;The same distinction can be made in formulaic terms. Both EDT and CDT agree that decision theory should be about maximizing expected utility where the expected utility of an act, A, given a set of possible outcomes, O, is defined as follows:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/CSwK4.gif&quot; alt=&quot;expected utility formula&quot;&gt;.&lt;/p&gt;
&lt;p&gt;In this equation, V(A &amp;amp; O) represents the value to the agent of the combination of an act and an outcome. So this is the utility that the agent will receive if they carry out a certain act and a certain outcome occurs. Further, Pr&lt;sub&gt;A&lt;/sub&gt;O represents the probability of each outcome occurring on the supposition that the agent carries out a certain act. It is in terms of this probability that CDT and EDT differ. EDT uses the conditional probability, Pr(O|A), while CDT uses the probability of subjunctive conditionals, Pr(A &lt;img src=&quot;http://i.imgur.com/G8xec.gif&quot; alt=&quot;&quot;&gt; O).&lt;/p&gt;
&lt;p&gt;Using these two versions of the expected utility formula, it's possible to demonstrate in a formal manner why EDT and CDT give the advice they do in Newcomb's problem. To demonstrate this it will help to make two simplifying assumptions. First, we will presume that each dollar of money is worth 1 unit of utility to the agent (and so will presume that the agent's utility is linear with money). Second, we will presume that Omega is a perfect predictor of human actions so that if the agent two-boxes it provides definitive evidence that there is nothing in the opaque box and if the agent one-boxes it provides definitive evidence that there is $1 million in this box. Given these assumptions, EDT calculates the expected utility of each decision as follows:&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/aCe4Y.gif&quot; alt=&quot;EU for two-boxing according to EDT&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;EU for two-boxing according to EDT&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/vJtVr.gif&quot; alt=&quot;EU for one-boxing according to EDT&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;EU for one-boxing according to EDT&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Given that one-boxing has a higher expected utility according to these calculations, an EDT agent will one-box.&lt;/p&gt;
&lt;p&gt;On the other hand, given that the agent's decision doesn't causally influence Omega's earlier prediction, CDT will use the same probability regardless of whether you one or two box. The decision endorsed will be the same regardless of what probability we use so, to demonstrate the theory, we can simply arbitrarily assign an 0.5 probability that the opaque box has nothing in it and an 0.5 probability that it has one million dollars in it. CDT then calculates the expected utility of each decision as follows:&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/oyHGl.gif&quot; alt=&quot;EU for two-boxing according to CDT&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;EU for two-boxing according to CDT&lt;/p&gt;
&lt;/div&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/7uX9t.gif&quot; alt=&quot;EU for one-boxing according to CDT&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;EU for one-boxing according to CDT&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;Given that two-boxing has a higher expected utility according to these calculations, a CDT agent will two-box. This approach demonstrates the result given more informally in the previous section: CDT agents will two-box in Newcomb's problem and EDT agents will one box.&lt;/p&gt;
&lt;p&gt;As mentioned before, there are also alternative formulations of CDT. What are these? For example, David Lewis &lt;a href=&quot;http://www.tandfonline.com/doi/abs/10.1080/00048408112340011&quot;&gt;(1981)&lt;/a&gt; and Brian Skyrms &lt;a href=&quot;http://www.amazon.com/Causal-Necessity-Pragmatic-Investigation-Laws/dp/0300023391&quot;&gt;(1980)&lt;/a&gt; both present approaches that rely on the partition of the world into states to capture causal information, rather than counterfactual conditionals. On Lewis&amp;#x2019;s version of this account, for example, the agent calculates the expected utility of acts using their unconditional credence in states of the world that are &lt;em&gt;dependency hypotheses&lt;/em&gt;, which are descriptions of the possible ways that the world can depend on the agent&amp;#x2019;s actions. These dependency hypotheses intrinsically contain the required causal information.&lt;/p&gt;
&lt;p&gt;Other traditional approaches to CDT include the imaging approach of &lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2012/09/Sobel-Probability-Chance-and-Choice-a-Theory-of-Rational-Agency.pdf&quot;&gt;Sobel (1980)&lt;/a&gt; (also see &lt;a href=&quot;http://www.tandfonline.com/doi/abs/10.1080/00048408112340011&quot;&gt;Lewis 1981&lt;/a&gt;) and the unconditional expectations approach of Leonard Savage &lt;a href=&quot;http://www.amazon.com/Foundations-Statistics-Leonard-J-Savage/dp/0486623491&quot;&gt;(1954)&lt;/a&gt;. Those interested in the various traditional approaches to CDT would be best to consult Lewis &lt;a href=&quot;http://www.tandfonline.com/doi/abs/10.1080/00048408112340011&quot;&gt;(1981)&lt;/a&gt;, &lt;a href=&quot;http://plato.stanford.edu/entries/decision-causal/&quot;&gt;Weirich (2008)&lt;/a&gt;, and &lt;a href=&quot;http://www.amazon.com/Foundations-Decision-Cambridge-Probability-Induction/dp/0521063566/&quot;&gt;Joyce (1999)&lt;/a&gt;. More recently, work in computer science on a tool called causal Bayesian networks has led to an innovative approach to CDT that has received some recent attention in the philosophical literature (&lt;a href=&quot;http://www.amazon.com/Causality-Reasoning-Inference-Judea-Pearl/dp/0521773628&quot;&gt;Pearl 2000, ch. 4&lt;/a&gt; and &lt;a href=&quot;http://www-ihpst.univ-paris1.fr/fichiers/programmes/20/Spohn-One-Boxing3.pdf&quot;&gt;Spohn 2012&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Now we return to an analysis of decision scenarios, armed with EDT and the counterfactual formulation of CDT.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;medical-newcomb-problems&quot;&gt;&lt;a href=&quot;#medical-newcomb-problems&quot;&gt;11.1.3. Medical Newcomb problems&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Medical Newcomb problems share a similar form but come in many variants, including Solomon's problem (&lt;a href=&quot;https://www.kellogg.northwestern.edu/research/math/papers/194.pdf&quot;&gt;Gibbard &amp;amp; Harper 1976&lt;/a&gt;) and the smoking lesion problem (&lt;a href=&quot;http://fitelson.org/few/few_05/egan.pdf&quot;&gt;Egan 2007&lt;/a&gt;). Below I present a variant called the &quot;chewing gum problem&quot; (&lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky 2010&lt;/a&gt;):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose that a recently published medical study shows that chewing gum seems to cause throat abscesses &amp;#x2014; an outcome-tracking study showed that of people who chew gum, 90% died of throat abscesses before the age of 50. Meanwhile, of people who do not chew gum, only 10% die of throat abscesses before the age of 50. The researchers, to explain their results, wonder if saliva sliding down the throat wears away cellular defenses against bacteria. Having read this study, would you choose to chew gum? But now a second study comes out, which shows that most gum-chewers have a certain gene, CGTA, and the researchers produce a table showing the following mortality rates:&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;table border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;CGTA present&lt;/td&gt;
&lt;td&gt;CGTA absent&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Chew Gum&lt;/td&gt;
&lt;td&gt;89% die&lt;/td&gt;
&lt;td&gt;8% die&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Don&amp;#x2019;t chew&lt;/td&gt;
&lt;td&gt;99% die&lt;/td&gt;
&lt;td&gt;11% die&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;This table shows that whether you have the gene CGTA or not, your chance of dying of a throat abscess goes down if you chew gum. Why are fatalities so much higher for gum-chewers, then? Because people with the gene CGTA tend to chew gum and die of throat abscesses. The authors of the second study also present a test-tube experiment which shows that the saliva from chewing gum can kill the bacteria that form throat abscesses. The researchers hypothesize that because people with the gene CGTA are highly susceptible to throat abscesses, natural selection has produced in them a tendency to chew gum, which protects against throat abscesses. The strong correlation between chewing gum and throat abscesses is not because chewing gum causes throat abscesses, but because a third factor, CGTA, leads to chewing gum and throat abscesses.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Having learned of this new study, would you choose to chew gum? Chewing gum helps protect against throat abscesses whether or not you have the gene CGTA. Yet a friend who heard that you had decided to chew gum (as people with the gene CGTA often do) would be quite alarmed to hear the news &amp;#x2014; just as she would be saddened by the news that you had chosen to take both boxes in Newcomb&amp;#x2019;s Problem. This is a case where [EDT] seems to return the wrong answer, calling into question the validity of the... rule &amp;#x201C;Take actions such that you would be glad to receive the news that you had taken them.&amp;#x201D; Although the news that someone has decided to chew gum is alarming, medical studies nonetheless show that chewing gum protects against throat abscesses. [CDT's] rule of &amp;#x201C;Take actions which you expect to have a positive physical effect on the world&amp;#x201D; seems to serve us better.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;One response to this claim, called the &lt;em&gt;tickle defense&lt;/em&gt; (&lt;a href=&quot;http://www.jstor.org/discover/10.2307/20115662?uid=3737536&amp;amp;uid=2129&amp;amp;uid=2&amp;amp;uid=70&amp;amp;uid=4&amp;amp;sid=21101205363271&quot;&gt;Eells, 1981&lt;/a&gt;), argues that EDT actually reaches the right decision in such cases. According to this defense, the most reasonable way to construe the &amp;#x201C;chewing gum problem&amp;#x201D; involves presuming that CGTA causes a desire (a mental &amp;#x201C;tickle&amp;#x201D;) which then causes the agent to be more likely to chew gum, rather than CGTA directly causing the action. Given this, if we presume that the agent already knows their own desires and hence already knows whether they&amp;#x2019;re likely to have the CGTA gene, chewing gum will not provide the agent with further bad news. Consequently, an agent following EDT will chew in order to get the good news that they have decreased their chance of getting abscesses.&lt;/p&gt;
&lt;p&gt;Unfortunately, the tickle defense fails to achieve its aims. In introducing this approach, Eells hoped that EDT could be made to mimic CDT but without an allegedly inelegant reliance on causation. However, &lt;a href=&quot;http://www.amazon.com/Taking-Chances-Cambridge-Probability-Induction/dp/0521038987&quot;&gt;Sobel (1994, ch. 2)&lt;/a&gt; demonstrated that the tickle defense failed to ensure that EDT and CDT would decide equivalently in all cases. On the other hand, those who feel that EDT originally got it right by one-boxing in Newcomb&amp;#x2019;s problem will be disappointed to discover that the tickle defense leads an agent to two-box in some versions of Newcomb&amp;#x2019;s problem and so solves one problem for the theory at the expense of introducing another.&lt;/p&gt;
&lt;p&gt;So just as CDT &amp;#x201C;loses&amp;#x201D; on Newcomb&amp;#x2019;s problem, EDT will &quot;lose&amp;#x201D; on Medical Newcomb problems (if the tickle defense fails) or will join CDT and &quot;lose&quot; on Newcomb&amp;#x2019;s Problem itself (if the tickle defense succeeds).&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;newcombs-soda&quot;&gt;&lt;a href=&quot;#newcombs-soda&quot;&gt;11.1.4. Newcomb's soda&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;There are also similar problematic cases for EDT where the evidence provided by your decision relates not to a feature that you were born (or created) with but to some other feature of the world. One such scenario is the &lt;em&gt;Newcomb&amp;#x2019;s soda&lt;/em&gt; problem, introduced in &lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky (2010)&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;You know that you will shortly be administered one of two sodas in a double-blind clinical test. After drinking your assigned soda, you will enter a room in which you find a chocolate ice cream and a vanilla ice cream. The first soda produces a strong but entirely subconscious desire for chocolate ice cream, and the second soda produces a strong subconscious desire for vanilla ice cream. By &amp;#x201C;subconscious&amp;#x201D; I mean that you have no introspective access to the change, any more than you can answer questions about individual neurons firing in your cerebral cortex. You can only infer your changed tastes by observing which kind of ice cream you pick.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;It so happens that all participants in the study who test the Chocolate Soda are rewarded with a million dollars after the study is over, while participants in the study who test the Vanilla Soda receive nothing. But subjects who actually eat vanilla ice cream receive an additional thousand dollars, while subjects who actually eat chocolate ice cream receive no additional payment. You can choose one and only one ice cream to eat. A pseudo-random algorithm assigns sodas to experimental subjects, who are evenly divided (50/50) between Chocolate and Vanilla Sodas. You are told that 90% of previous research subjects who chose chocolate ice cream did in fact drink the Chocolate Soda, while 90% of previous research subjects who chose vanilla ice cream did in fact drink the Vanilla Soda. Which ice cream would you eat?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/FAZnb.jpg&quot; alt=&quot;Newcomb&amp;#x2019;s soda&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;Newcomb&amp;#x2019;s soda&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;In this case, an EDT agent will decide to eat chocolate ice cream as this would provide evidence that they drank the chocolate soda and hence that they will receive $1 million after the experiment. However, this seems to be the wrong decision and so, once again, the EDT agent &amp;#x201C;loses&amp;#x201D;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;bostroms-meta-newcomb-problem&quot;&gt;&lt;a href=&quot;#bostroms-meta-newcomb-problem&quot;&gt;11.1.5. Bostrom's meta-Newcomb problem&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;In response to attacks on their theory, the proponent of EDT can present alternative scenarios where EDT &amp;#x201C;wins&amp;#x201D; and it is CDT that &amp;#x201C;loses&amp;#x201D;. One such case is the &lt;em&gt;meta-Newcomb problem&lt;/em&gt; proposed in &lt;a href=&quot;http://www.nickbostrom.com/papers/newcomb.html&quot;&gt;Bostrom (2001)&lt;/a&gt;. Adapted to fit my earlier story about Omega the superintelligent machine (section 11.1.1), the problem runs like this: Either Omega has &lt;em&gt;already&lt;/em&gt; placed $1M or nothing in box B (depending on its prediction about your choice), or else Omega is watching as you choose and &lt;em&gt;after&lt;/em&gt; your choice it will place $1M into box B only if you have one-boxed. But you don't know which is the case. Omega makes its move before the human player's choice about half the time, and the rest of the time it makes its move &lt;em&gt;after&lt;/em&gt; the player's choice.&lt;/p&gt;
&lt;p&gt;But now suppose there is another superintelligent machine, Meta-Omega, who has a perfect track record of predicting both Omega's choices and the choices of human players. Meta-Omega tells you that either you will two-box and Omega will &quot;make its move&quot; &lt;em&gt;after&lt;/em&gt; you make your choice, or else you will one-box and Omega has &lt;em&gt;already&lt;/em&gt; made its move (and gone on to the next game, with someone else).&lt;/p&gt;
&lt;p&gt;Here, an EDT agent one-boxes and walks away with a million dollars. On the face of it, however, a CDT agent faces a dilemma: if she two-boxes then Omega's action depends on her choice, so the &quot;rational&quot; choice is to one-box. But if the CDT agent one-boxes, then Omega's action temporally precedes (and is thus physically independent of) her choice, so the &quot;rational&quot; action is to two-box. It might seem, then, that a CDT agent will be unable to reach any decision in this scenario. However, further reflection reveals that the issue is more complicated. According to CDT, what the agent ought to do in this scenario depends on their credences about their own actions. If they have a high credence that they will two-box, they ought to one-box and if they have a high credence that they will one-box, they ought to two box. Given that the agent's credences in their actions are not given to us in the description of the meta-Newcomb problem, the scenario is underspecified and it is hard to know what conclusions should be drawn from it.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;the-psychopath-button&quot;&gt;&lt;a href=&quot;#the-psychopath-button&quot;&gt;11.1.6. The psychopath button&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Fortunately, another case has been introduced where, according to CDT, what an agent ought to do depends on their credences about what they will do. This is the &lt;em&gt;psychopath button&lt;/em&gt;, introduced in &lt;a href=&quot;http://philreview.dukejournals.org/content/116/1/93.citation&quot;&gt;Egan (2007)&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Paul is debating whether to press the &amp;#x201C;kill all psychopaths&amp;#x201D; button. It would, he thinks, be much better to live in a world with no psychopaths. Unfortunately, Paul is quite confident that only a psychopath would press such a button. Paul very strongly prefers living in a world with psychopaths to dying. Should Paul press the button?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Many people think Paul should not. After all, if he does so, he is almost certainly a psychopath and so pressing the button will almost certainly cause his death. This is also the response that an EDT agent will give. After all, pushing the button would provide the agent with the bad news that they are almost certainly a psychopath and so will die as a result of their action.&lt;/p&gt;
&lt;p&gt;On the other hand, if Paul is fairly certain that he is not a psychopath, then CDT will say that he ought to press the button. CDT will note that, given Paul&amp;#x2019;s confidence that he isn&amp;#x2019;t a psychopath, his decision will almost certainly have a positive impact as it will result in the death of all psychopaths and Paul&amp;#x2019;s survival. On the face of it, then, a CDT agent would decide inappropriately in this case by pushing the button. Importantly, unlike in the meta-Newcomb problem, the agent's credences about their own behavior are specified in Egan's full version of this scenario (in non-numeric terms, the agent thinks they're unlikely to be a psychopath and hence unlikely to press the button).&lt;/p&gt;
&lt;p&gt;However, in order to produce this problem for CDT, Egan made a number of assumptions about how an agent should decide when what they ought to do depends on what they think they will do. In response, alternative views about deciding in such cases have been advanced (particular in &lt;a href=&quot;http://www.jstor.org/discover/10.2307/40267481?uid=3737536&amp;amp;uid=2&amp;amp;uid=4&amp;amp;sid=21101299066461&quot;&gt;Arntzenius, 2008&lt;/a&gt; and &lt;a href=&quot;http://rd.springer.com/article/10.1007/s11229-011-0022-6&quot;&gt;Joyce, 2012&lt;/a&gt;). Given these factors, opinions are split about whether the psychopath button problem does in fact pose a challenge to CDT.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;parfits-hitchhiker&quot;&gt;&lt;a href=&quot;#parfits-hitchhiker&quot;&gt;11.1.7. Parfit's hitchhiker&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Not all decision scenarios are problematic for just one of EDT or CDT. There are also cases that can be presented where both an EDT agent and a CDT agent will both &quot;lose&quot;. One such case is &lt;em&gt;Parfit&amp;#x2019;s Hitchhiker&lt;/em&gt; (&lt;a href=&quot;http://www.amazon.com/Reasons-Persons-Oxford-Paperbacks-Parfit/dp/019824908X&quot;&gt;Parfit, 1984, p. 7&lt;/a&gt;):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suppose that I am driving at midnight through some desert. My car breaks down. You are a stranger, and the only other driver near. I manage to stop you, and I offer you a great reward if you rescue me. I cannot reward you now, but I promise to do so when we reach my home. Suppose next that I am &lt;em&gt;transparent&lt;/em&gt;, unable to deceive others. I cannot lie convincingly. Either a blush, or my tone of voice, always gives me away. Suppose, finally, that I know myself to be never self-denying. If you drive me to my home, it would be worse for me if I gave you the promised reward. Since I know that I never do what will be worse for me, I know that I shall break my promise. Given my inability to lie convincingly, you know this too. You do not believe my promise, and therefore leave me stranded in the desert.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In this scenario the agent &quot;loses&quot; if they would later refuse to give the stranger the reward. However, both EDT agents and CDT agents will refuse to do so. After all, by this point the agent will already be safe so giving the reward can neither provide good news about, nor cause, their safety. So this seems to be a case where both theories &amp;#x201C;lose&amp;#x201D;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;transparent-newcombs-problem&quot;&gt;&lt;a href=&quot;#transparent-newcombs-problem&quot;&gt;11.1.8. Transparent Newcomb's problem&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;There are also other cases where both EDT and CDT &quot;lose&quot;. One of these is the &lt;em&gt;Transparent Newcomb's problem&lt;/em&gt; which, in at least one version, is due to &lt;a href=&quot;http://www.amazon.com/Good-Real-Demystifying-Paradoxes-Bradford/dp/0262042339&quot;&gt;Drescher (2006, p. 238-242)&lt;/a&gt;. This scenario is like the original Newcomb's problem but, in this case, both boxes are transparent so you can see their contents when you make your decision. Again, Omega has filled box A with $1000 and Box B with either $1 million or nothing based on a prediction of your behavior. Specifically, Omega has predicted how you would decide if you witnessed $1 million in Box B. If Omega predicted that you would one-box in this case, he placed $1 million in Box B. On the other hand, if Omega predicted that you would two-box in this case then he placed nothing in Box B.&lt;/p&gt;
&lt;p&gt;Both EDT and CDT agents will two-box in this case. After all, the contents of the boxes are determined and known so the agent's decision can neither provide good news about what they contain nor cause them to contain something desirable. As with two-boxing in the original version of Newcomb&amp;#x2019;s problem, many philosophers will endorse this behavior.&lt;/p&gt;
&lt;p&gt;However, it&amp;#x2019;s worth noting that Omega will almost certainly have predicted this decision and so filled Box B with nothing. CDT and EDT agents will end up with $1000. On the other hand, just as in the original case, the agent that one-boxes will end up with $1 million. So this is another case where both EDT and CDT &amp;#x201C;lose&amp;#x201D;. Consequently, to those that agree with the earlier comments (in section 11.1.1) that a decision theory shouldn't lead an agent to &quot;lose&quot;, neither of these theories will be satisfactory.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;counterfactual-mugging&quot;&gt;&lt;a href=&quot;#counterfactual-mugging&quot;&gt;11.1.9. Counterfactual mugging&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Another similar case, known as &lt;em&gt;counterfactual mugging&lt;/em&gt;, was developed in &lt;a href=&quot;/lw/3l/counterfactual_mugging/&quot;&gt;Nesov (2009)&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Imagine that one day, Omega comes to you and says that it has just tossed a fair coin, and given that the coin came up tails, it decided to ask you to give it $100. Whatever you do in this situation, nothing else will happen differently in reality as a result. Naturally you don't want to give up your $100. But see, the Omega tells you that if the coin came up heads instead of tails, it'd give you $10000, but only if you'd agree to give it $100 if the coin came up tails.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Should you give up the $100?&lt;/p&gt;
&lt;p&gt;Both CDT and EDT say no. After all, giving up your money neither provides good news about nor influences your chances of getting $10 000 out of the exchange. Further, this intuitively seems like the right decision. On the face of it, then, it is appropriate to retain your money in this case.&lt;/p&gt;
&lt;p&gt;However, presuming you take Omega to be perfectly trustworthy, there seems to be room to debate this conclusion. If you are the sort of agent that gives up the $100 in counterfactual mugging then you will tend to do better than the sort of agent that won&amp;#x2019;t give up the $100. Of course, in the particular case at hand you will lose but rational agents often lose in specific cases (as, for example, when such an agent loses a rational bet). It could be argued that what a rational agent should not do is be the type of agent that loses. Given that agents that refuse to give up the $100 are the type of agent that loses, there seem to be grounds to claim that counterfactual mugging is another case where both CDT and EDT act inappropriately.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4 id=&quot;prisoners-dilemma&quot;&gt;&lt;a href=&quot;#prisoners-dilemma&quot;&gt;11.1.10. Prisoner's dilemma&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;Before moving on to a more detailed discussion of various possible decision theories, I&amp;#x2019;ll consider one final scenario: the &lt;em&gt;prisoner&amp;#x2019;s dilemma&lt;/em&gt;. &lt;a href=&quot;http://www.amazon.com/Choices-An-Introduction-Decision-Theory/dp/0816614407/&quot;&gt;Resnik (1987, pp. 147-148 )&lt;/a&gt; outlines this scenario as follows:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Two prisoners...have been arrested for vandalism and have been isolated from each other. There is sufficient evidence to convict them on the charge for which they have been arrested, but the prosecutor is after bigger game. He thinks that they robbed a bank together and that he can get them to confess to it. He summons each separately to an interrogation room and speaks to each as follows: &quot;I am going to offer the same deal to your partner, and I will give you each an hour to think it over before I call you back. This is it: If one of you confesses to the bank robbery and the other does not, I will see to it that the confessor gets a one-year term and that the other guy gets a twenty-five year term. If you both confess, then it's ten years apiece. If neither of you confesses, then I can only get two years apiece on the vandalism charge...&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The decision matrix of each vandal will be as follows:&lt;/p&gt;
&lt;table cellpadding=&quot;3&quot; cellspacing=&quot;5&quot; border=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;#xA0;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Partner confesses&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;&lt;em&gt;Partner lies&lt;/em&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Confess&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;10 years in jail&lt;/td&gt;
&lt;td&gt;1 year in jail&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;em&gt;Lie&lt;/em&gt;&lt;/td&gt;
&lt;td&gt;25 years in jail&lt;/td&gt;
&lt;td&gt;2 years in jail&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Faced with this scenario, a CDT agent will confess. After all, the agent&amp;#x2019;s decision can&amp;#x2019;t influence their partner&amp;#x2019;s decision (they&amp;#x2019;ve been isolated from one another) and so the agent is better off confessing regardless of what their partner chooses to do. According to the majority of decision (and game) theorists, confessing is in fact the rational decision in this case.&lt;/p&gt;
&lt;p&gt;Despite this, however, an EDT agent may lie in a prisoner&amp;#x2019;s dilemma. Specifically, if they think that their partner is similar enough to them, the agent will lie because doing so will provide the good news that they will both lie and hence that they will both get two years in jail (good news as compared with the bad news that they will both confess and hence that they will get 10 years in jail).&lt;/p&gt;
&lt;p&gt;To many people, there seems to be something compelling about this line of reasoning. For example, &lt;a href=&quot;http://www.amazon.com/Metamagical-Themas-Questing-Essence-Pattern/dp/0465045669&quot;&gt;Douglas Hofstadter (1985, pp. 737-780)&lt;/a&gt; has argued that an agent acting &amp;#x201C;superrationally&amp;#x201D; would co-operate with other superrational agents for precisely this sort of reason: a superrational agent would take into account the fact that other such agents will go through the same thought process in the &lt;em&gt;prisoner&amp;#x2019;s dilemma&lt;/em&gt; and so make the same decision. As such, it is better that that the decision that both agents reach be to lie than that it be to confess. More broadly, it could perhaps be argued that a rational agent should lie in the &lt;em&gt;prisoner&amp;#x2019;s dilemma&lt;/em&gt; as long as they believe that they are similar enough to their partner that they are likely to reach the same decision.&lt;/p&gt;
&lt;div class=&quot;figure&quot;&gt;&lt;img src=&quot;http://i.imgur.com/fPUcm.jpg&quot; alt=&quot;An argument for cooperation in the prisoners&amp;#x2019; dilemma&quot;&gt;
&lt;p class=&quot;caption&quot;&gt;An argument for cooperation in the prisoners&amp;#x2019; dilemma&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;It is unclear, then, precisely what should be concluded from the prisoner&amp;#x2019;s dilemma. However, for those that are sympathetic to Hofstadter&amp;#x2019;s point or the line of reasoning appealed to by the EDT agent, the scenario seems to provide an additional reason to seek out an alternative theory to CDT.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;benchmark-theory-bt&quot;&gt;&lt;a href=&quot;#benchmark-theory-bt&quot;&gt;11.2. Benchmark theory (BT)&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;One recent response to the apparent failure of EDT to decide appropriately in medical Newcomb problems and CDT to decide appropriately in the psychopath button is Benchmark Theory (BT) which was developed in &lt;a href=&quot;http://www.springerlink.com/content/a66107137n821610/?MUD=MP&quot;&gt;Wedgwood (2011)&lt;/a&gt; and discussed further in &lt;a href=&quot;http://philreview.dukejournals.org/content/119/1/1.abstract&quot;&gt;Briggs (2010)&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In English, we could think of this decision algorithm as saying that agents should decide so as to give their future self good news about how well off they are compared to how well off they could have been. In formal terms, BT uses the following formula to calculate the expected utility of an act, A:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://i.imgur.com/fUjmj.gif&quot; alt=&quot;BT expected value formula&quot;&gt;.&lt;/p&gt;
&lt;p&gt;In other words, it uses the conditional probability, as in EDT but calculates the value differently (as indicated by the use of V&amp;#x2019; rather than V). V&amp;#x2019; is calculated relative to a benchmark value in order to give a comparative measure of value (both of the above sources go into more detail about this process).&lt;/p&gt;
&lt;p&gt;Taking the informal perspective, in the &lt;em&gt;chewing gum problem&lt;/em&gt;, BT will note that by chewing gum, the agent will always get the good news that they are comparatively better off than they could have been (because chewing gum helps control throat abscesses) whereas by not chewing, the agent will always get the bad news that they could have been comparatively better off by chewing. As such, a BT agent will chew in this scenario.&lt;/p&gt;
&lt;p&gt;Further, BT seems to reach what many consider to be the right decision in the &lt;em&gt;psychopath button&lt;/em&gt;. In this case, the BT agent will note that if they push the button they will get the bad news that they are almost certainly a psychopath and so that they would have been comparatively much better off by not pushing (as pushing will kill them). On the other hand, if they don&amp;#x2019;t push they will get the less bad news that they are almost certainly not a psychopath and so could have been comparatively a little better off it they had pushed the button (as this would have killed all the psychopaths but not them). So refraining from pushing the button gives the less bad news and so is the rational decision.&lt;/p&gt;
&lt;p&gt;On the face of it, then, there seem to be strong reasons to find BT compelling: it decides appropriately in these scenarios while, according to some people, EDT and CDT only decide appropriately in one or the other of them.&lt;/p&gt;
&lt;p&gt;Unfortunately, a BT agent will fail to decide appropriately in other scenarios. First, those that hold that one-boxing is the appropriate decision in Newcomb&amp;#x2019;s problem will immediately find a flaw in BT. After all, in this scenario two-boxing gives the good news that the agent did comparatively better than they could have done (because they gain the $1000 from Box A which is more than they would have received otherwise) while one-boxing brings the bad news that they did comparatively worse than they could have done (as they did not receive this money). As such, a BT agent will two-box in Newcomb&amp;#x2019;s problem.&lt;/p&gt;
&lt;p&gt;Further, &lt;a href=&quot;http://philreview.dukejournals.org/content/119/1/1.abstract&quot;&gt;Briggs (2010)&lt;/a&gt; argues, though &lt;a href=&quot;http://www.springerlink.com/content/a66107137n821610/?MUD=MP&quot;&gt;Wedgwood (2011)&lt;/a&gt; denies, that BT suffers from other problems. As such, even for those who support two-boxing in Newcomb&amp;#x2019;s problem, it could be argued that BT doesn&amp;#x2019;t represent an adequate theory of choice. It is unclear, then, whether BT is a desirable replacement to alternative theories.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;timeless-decision-theory-tdt&quot;&gt;&lt;a href=&quot;#timeless-decision-theory-tdt&quot;&gt;11.3. Timeless decision theory (TDT)&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky (2010)&lt;/a&gt; offers another decision algorithm, &lt;em&gt;timeless decision theory&lt;/em&gt; or TDT (see also &lt;a href=&quot;http://intelligence.org/files/Comparison.pdf&quot;&gt;Altair, 2013&lt;/a&gt;). Specifically, TDT is intended as an explicit response to the idea that a theory of rational choice should lead an agent to &amp;#x201C;win&amp;#x201D;. As such, it will appeal to those who think it is appropriate to one-box in Newcomb&amp;#x2019;s problem and chew in the chewing gum problem.&lt;/p&gt;
&lt;p&gt;In English, this algorithm can be approximated as saying that an agent ought to choose as if CDT were right but they were determining not their actual decision but rather the result of the abstract computation of which their decision is one concrete instance. Formalizing this decision algorithm would require a substantial document in its own right and so will not be carried out in full here. Briefly, however, TDT is built on top of causal Bayesian networks &lt;a href=&quot;http://www.amazon.com/Causality-Reasoning-Inference-Judea-Pearl/dp/0521773628&quot;&gt;(Pearl, 2000)&lt;/a&gt; which are graphs where the arrows represent causal influence. TDT supplements these graphs by adding nodes representing abstract computations and taking the abstract computation that determines an agent&amp;#x2019;s decision to be the object of choice rather than the concrete decision itself (see &lt;a href=&quot;http://intelligence.org/files/TDT.pdf&quot;&gt;Yudkowsky, 2010&lt;/a&gt; for a more detailed description).&lt;/p&gt;
&lt;p&gt;Returning to an informal discussion, an example will help clarify the form taken by TDT: imagine that two perfect replicas of a person are placed in identical rooms and asked to make the same decision. While each replica will make their own decision, in doing so, they will be carrying out the same computational process. As such, TDT will say that the replicas ought to act as if they are determining the result of this process and hence as if they are deciding the behavior of both copies.&lt;/p&gt;
&lt;p&gt;Something similar can be said about Newcomb&amp;#x2019;s problem. In this case it is almost like there is again a replica of the agent: Omega&amp;#x2019;s model of the agent that it used to predict the agent&amp;#x2019;s behavior. Both the original agent and this &amp;#x201C;replica&amp;#x201D; responds to the same abstract computational process as one another. In other words, both Omega&amp;#x2019;s prediction and the agent&amp;#x2019;s behavior are influenced by this process. As such, TDT advises the agent to act as if they are determining the result of this process and, hence, as if they can determine Omega&amp;#x2019;s box filling behavior. As such, a TDT agent will one-box in order to determine the result of this abstract computation in a way that leads to $1 million being placed in Box B.&lt;/p&gt;
&lt;p&gt;TDT also succeeds in other areas. For example, in the chewing gum problem there is no &amp;#x201C;replica&amp;#x201D; agent so TDT will decide in line with standard CDT and choose to chew gum. Further, in the prisoner&amp;#x2019;s dilemma, a TDT agent will lie if its partner is another TDT agent (or a relevantly similar agent). After all, in this case both agents will carry out the same computational process and so TDT will advise that the agent act as if they are determining this process and hence simultaneously determining both their own and their partner&amp;#x2019;s decision. If so then it is better for the agent that both of them lie than that both of them confess.&lt;/p&gt;
&lt;p&gt;However, despite its success, TDT also &amp;#x201C;loses&amp;#x201D; in some decision scenarios. For example, in counterfactual mugging, a TDT agent will not choose to give up the $100. This might seem surprising. After all, as with Newcomb&amp;#x2019;s problem, this case involves Omega predicting the agent&amp;#x2019;s behavior and hence involves a &amp;#x201C;replica&amp;#x201D;. However, this case differs in that the agent knows that the coin came up heads and so knows that they have nothing to gain by giving up the money.&lt;/p&gt;
&lt;p&gt;For those who feel that a theory of rational choice should lead an agent to &amp;#x201C;win&amp;#x201D;, then, TDT seems like a step in the right direction but further work is required if it is to &amp;#x201C;win&amp;#x201D; in the full range of decision scenarios.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h3 id=&quot;decision-theory-and-winning&quot;&gt;&lt;a href=&quot;#decision-theory-and-winning&quot;&gt;11.4. Decision theory and &amp;#x201C;winning&amp;#x201D;&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;In the previous section, I discussed TDT, a decision algorithm that could be advanced as replacements for CDT and EDT. One of the primary motivations for developing TDT is a sense that both CDT and EDT fail to reason in a desirable manner in some decision scenarios. However, despite acknowledging that CDT agents end up worse off in Newcomb's Problem, many (and perhaps the majority of) decision theorists are proponents of CDT. On the face of it, this may seem to suggest that these decision theorists aren't interested in developing a decision algorithm that &quot;wins&quot; but rather have some other aim in mind. If so then this might lead us to question the value of developing one-boxing decision algorithms.&lt;/p&gt;
&lt;p&gt;However, the claim that most decision theorists don&amp;#x2019;t care about finding an algorithm that &amp;#x201C;wins&amp;#x201D; mischaracterizes their position. After all, proponents of CDT tend to take the challenge posed by the fact that CDT agents &amp;#x201C;lose&amp;#x201D; in Newcomb's problem seriously (in the philosophical literature, it's often referred to as the &lt;em&gt;Why ain'cha rich?&lt;/em&gt; problem). A common reaction to this challenge is neatly summarized in &lt;a href=&quot;http://www.amazon.com/Foundations-Decision-Cambridge-Probability-Induction/dp/0521641640&quot;&gt;Joyce (1999, p. 153-154 )&lt;/a&gt; as a response to a hypothetical question about why, if two-boxing is rational, the CDT agent does not end up as rich as an agent that one-boxes:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Rachel has a perfectly good answer to the &quot;Why ain't you rich?&quot; question. &quot;I am not rich,&quot; she will say, &quot;because I am not the kind of person [Omega] thinks will refuse the money. I'm just not like you, Irene [the one-boxer]. Given that I know that I am the type who takes the money, and given that [Omega] knows that I am this type, it was reasonable of me to think that the $1,000,000 was not in [the box]. The $1,000 was the most I was going to get no matter what I did. So the only reasonable thing for me to do was to take it.&quot;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Irene may want to press the point here by asking, &quot;But don't you wish you were like me, Rachel?&quot;... Rachel can and should admit that she &lt;em&gt;does&lt;/em&gt; wish she were more like Irene... At this point, Irene will exclaim, &quot;You've admitted it! It wasn't so smart to take the money after all.&quot; Unfortunately for Irene, her conclusion does not follow from Rachel's premise. Rachel will patiently explain that wishing to be a [one-boxer] in a Newcomb problem is not inconsistent with thinking that one should take the $1,000 &lt;em&gt;whatever type one is&lt;/em&gt;. When Rachel wishes she was Irene's type she is wishing for &lt;em&gt;Irene's options&lt;/em&gt;, not sanctioning her choice... While a person who knows she will face (has faced) a Newcomb problem might wish that she were (had been) the type that [Omega] labels a [one-boxer], this wish does not provide a reason for &lt;em&gt;being&lt;/em&gt; a [one-boxer]. It might provide a reason to try (before [the boxes are filled]) to change her type &lt;em&gt;if she thinks this might affect [Omega's] prediction&lt;/em&gt;, but it gives her no reason for doing anything other than taking the money once she comes to believes that she will be unable to influence what [Omega] does.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In other words, this response distinguishes between the &lt;em&gt;winning decision&lt;/em&gt; and the &lt;em&gt;winning type of agent&lt;/em&gt; and claims that two-boxing is the winning decision in Newcomb&amp;#x2019;s problem (even if one-boxers are the winning type of agent). Consequently, insofar as decision theory is about determining which &lt;em&gt;decision&lt;/em&gt; is rational, on this account CDT reasons correctly in Newcomb&amp;#x2019;s problem.&lt;/p&gt;
&lt;p&gt;For those that find this response perplexing, an analogy could be drawn to the &lt;em&gt;chewing gum problem&lt;/em&gt;. In this scenario, there is near unanimous agreement that the rational decision is to chew gum. However, statistically, non-chewers will be better off than chewers. As such, the non-chewer could ask, &amp;#x201C;if you&amp;#x2019;re so smart, why aren&amp;#x2019;t you healthy?&amp;#x201D; In this case, the above response seems particularly appropriate. The chewers are less healthy not because of their decision but rather because they&amp;#x2019;re more likely to have an undesirable gene. Having good genes doesn&amp;#x2019;t make the non-chewer more rational but simply more lucky. The proponent of CDT simply makes a similar response to Newcomb&amp;#x2019;s problem: one-boxers aren&amp;#x2019;t richer because of their decision but rather because of the type of agent that they were when the boxes were filled.&lt;/p&gt;
&lt;p&gt;One final point about this response is worth noting. A proponent of CDT can accept the above argument but still acknowledge that, if given the choice before the boxes are filled, they would be rational to choose to modify themselves to be a one-boxing type of agent (as Joyce acknowledged in the above passage and as argued for in &lt;a href=&quot;http://www.jstor.org/stable/20118389&quot;&gt;Burgess, 2004&lt;/a&gt;). To the proponent of CDT, this is unproblematic: if we are sometimes rewarded not for the rationality of our decisions in the moment but for the type of agent we were at some past moment then it should be unsurprising that changing to a different type of agent might be beneficial.&lt;/p&gt;
&lt;p&gt;The response to this defense of two-boxing in Newcomb&amp;#x2019;s problem has been divided. Many find it compelling but others, like &lt;a href=&quot;http://link.springer.com/article/10.1007%2Fs10670-011-9355-2&quot;&gt;Ahmed and Price (2012)&lt;/a&gt; think it does not adequately address to the challenge:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;It is no use the causalist's whining that foreseeably, Newcomb problems do in fact reward irrationality, or rather CDT-irrationality. The point of the argument is that if everyone knows that the CDT-irrational strategy will in fact do better on average than the CDT-rational strategy, then it's rational to play the CDT-irrational strategy.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Given this, there seem to be two positions one could take on these issues. If the response given by the proponent of CDT is compelling, then we should be attempting to develop a decision theory that two-boxes on Newcomb&amp;#x2019;s problem. Perhaps the best theory for this role is CDT but perhaps it is instead BT, which many people think reasons better in the psychopath button scenario. On the other hand, if the response given by the proponents of CDT is not compelling, then we should be developing a theory that one-boxes in Newcomb&amp;#x2019;s problem. In this case, TDT, or something like it, seems like the most promising theory currently on offer.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/gu1/decision_theory_faq/#comments"&gt;455 comments&lt;/a&gt;
</description>
</item>
<item>
<title>New report: Intelligence Explosion Microeconomics</title>
<link>http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/</guid>
<pubDate>Tue, 30 Apr 2013 09:14:58 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
43 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/#comments"&gt;220 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;:&amp;#xA0;&lt;a href=&quot;http://intelligence.org/files/IEM.pdf&quot;&gt;Intelligence Explosion Microeconomics&lt;/a&gt;&amp;#xA0;(pdf) is 40,000 words&amp;#xA0;taking some initial steps toward tackling the key quantitative issue in the intelligence explosion, &quot;reinvestable returns on cognitive investments&quot;: what kind of returns can you get from an investment in cognition, can you reinvest it to make yourself even smarter, and does this process die out or blow up? This can be thought of as the compact and hopefully more coherent successor to the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/The_Hanson-Yudkowsky_AI-Foom_Debate&quot;&gt;AI Foom Debate&lt;/a&gt; of a few years back.&lt;/p&gt;
&lt;p&gt;(Sample idea you haven't heard before: &amp;#xA0;The increase in hominid brain size over evolutionary time should be interpreted as evidence about increasing marginal fitness returns on brain size, presumably due to improved brain wiring algorithms; not as direct evidence about an intelligence scaling factor from brain size.)&lt;/p&gt;
&lt;p&gt;I hope that the open problems posed therein inspire further work by economists or economically literate modelers, interested specifically in the intelligence explosion &lt;em&gt;qua&lt;/em&gt;&amp;#xA0;cognitive intelligence rather than non-cognitive&amp;#xA0;'technological acceleration'. &amp;#xA0;MIRI has an intended-to-be-small-and-technical mailing list for such discussion. &amp;#xA0;In case it's not clear from context, I (Yudkowsky) am the author of the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Abstract:&lt;/strong&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;I. J. Good's thesis of the 'intelligence explosion' is that a sufficiently advanced machine intelligence could build a smarter version of itself, which could in turn build an even smarter version of itself, and that this process could continue enough to vastly exceed human intelligence. &amp;#xA0;As Sandberg (2010) correctly notes, there are several attempts to lay down return-on-investment formulas intended to represent sharp speedups in economic or technological growth, but very little attempt has been made to deal formally with I. J. Good's intelligence explosion thesis as such.&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;I identify the key issue as &lt;em&gt;returns on cognitive reinvestment&lt;/em&gt; - the ability to invest more computing power, faster computers, or improved cognitive algorithms to yield cognitive labor which produces larger brains, faster brains, or better mind designs. &amp;#xA0;There are many phenomena in the world which have been argued as evidentially relevant to this question, from the observed course of hominid evolution, to Moore's Law, to the competence over time of machine chess-playing systems, and many more. &amp;#xA0;I go into some depth on the sort of debates which then arise on how to interpret such evidence. &amp;#xA0;I propose that the next step forward in analyzing positions on the intelligence explosion would be to formalize return-on-investment curves, so that each stance can say formally which possible microfoundations they hold to be falsified by historical observations already made. &amp;#xA0;More generally, I pose multiple open questions of 'returns on cognitive reinvestment' or 'intelligence explosion microeconomics'. &amp;#xA0;Although such questions have received little attention thus far, they seem highly relevant to policy choices affecting the outcomes for Earth-originating intelligent life.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The &lt;a href=&quot;https://docs.google.com/forms/d/1KElE2Zt_XQRqj8vWrc_rG89nrO4JtHWxIFldJ3IY_FQ/viewform&quot;&gt;&lt;strong&gt;dedicated mailing list&lt;/strong&gt;&lt;/a&gt;&amp;#xA0;will be small and restricted to technical discussants.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This topic was originally intended to be a sequence in&amp;#xA0;&lt;em&gt;Open Problems in Friendly AI,&lt;/em&gt;&amp;#xA0;but further work produced something compacted beyond where it could be easily broken up into subposts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outline of contents:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;1&lt;/strong&gt;: &amp;#xA0;Introduces the basic questions and the key quantitative issue of sustained reinvestable returns on cognitive investments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2&lt;/strong&gt;: &amp;#xA0;Discusses the basic language for talking about the intelligence explosion, and argues that we should pursue this project by looking for underlying microfoundations, not by pursuing analogies to allegedly similar historical events.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3&lt;/strong&gt;: &amp;#xA0;Goes into detail on what I see as the main arguments for a fast intelligence explosion, constituting the bulk of the paper with the following subsections:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;3.1&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;What the fossil record actually tells us about returns on brain size, given that most of the difference between Homo sapiens and Australopithecus was probably improved software.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.2&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How to divide credit for the human-chimpanzee performance gap between &quot;humans are individually smarter than chimpanzees&quot; and &quot;the hominid transition involved a one-time qualitative gain from being able to accumulate knowledge&quot;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.3&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How returns on speed (serial causal depth) contrast with returns from parallelism; how faster thought seems to contrast with more thought. &amp;#xA0;Whether sensing and manipulating technologies are likely to present a bottleneck for faster thinkers, or how large of a bottleneck.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.4&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How human populations seem to scale in problem-solving power; some reasons to believe that we scale inefficiently enough for it to be puzzling. &amp;#xA0;Garry Kasparov's chess match vs. The World, which Kasparov won.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.5&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;Some inefficiencies that might cumulate in an estimate of humanity's net computational efficiency on a cognitive problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.6&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;What the anthropological record actually tells us about cognitive returns on cumulative selection pressure, given that selection pressures were probably increasing over the course of hominid history. &amp;#xA0;How the observed history would be expected to look different, if there were in fact diminishing returns on cognition.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.7&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How to relate the curves for evolutionary difficulty, human-engineering difficulty, and AI-engineering difficulty, considering that they are almost certainly different.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.8&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;Correcting for anthropic bias in trying to estimate the intrinsic 'difficulty 'of hominid-level intelligence just from observing that intelligence evolved here on Earth.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.9&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The question of whether to expect a 'local' (one-project) FOOM or 'global' (whole economy) FOOM and how returns on cognitive reinvestment interact with that.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.10&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The great open uncertainty about the minimal conditions for starting a FOOM; why I. J. Good's postulate of starting from 'ultraintelligence' is probably much too strong (sufficient, but very far above what is necessary).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.11&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The enhanced probability of unknown unknowns in the scenario, since a smarter-than-human intelligence will selectively seek out and exploit flaws or gaps in our current knowledge.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;4&lt;/strong&gt;: &amp;#xA0;A tentative methodology for formalizing theories of the intelligence explosion - a project of formalizing possible microfoundations and explicitly stating their alleged relation to historical experience, such that some possibilities can allegedly be falsified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5&lt;/strong&gt;: &amp;#xA0;Which open sub-questions seem both high-value and possibly answerable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;6&lt;/strong&gt;: &amp;#xA0;Formally poses the Open Problem and mentions what it would take for MIRI itself to directly fund further work in this field.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/#comments"&gt;220 comments&lt;/a&gt;
</description>
</item>
<item>
<title>We Don't Have a Utility Function</title>
<link>http://lesswrong.com/lw/h45/we_dont_have_a_utility_function/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/h45/we_dont_have_a_utility_function/</guid>
<pubDate>Tue, 02 Apr 2013 03:49:10 +0000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/nyan_sandwich"&gt;nyan_sandwich&lt;/a&gt;
&amp;bull;
42 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/h45/we_dont_have_a_utility_function/#comments"&gt;123 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;strong&gt;Related:&lt;/strong&gt; &lt;a href=&quot;/lw/ggm/pinpointing_utility/&quot;&gt;Pinpointing Utility&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;If I ever say &quot;my utility function&quot;, you could reasonably accuse me of &lt;a href=&quot;https://en.wikipedia.org/wiki/Cargo_cult_science&quot;&gt;cargo-cult&lt;/a&gt; rationality; trying to become more rational by superficially immitating the abstract rationalists we study makes about as much sense as building an air traffic control station out of grass to summon cargo planes.&lt;/p&gt;
&lt;p&gt;There are two ways an agent could be said to have a utility function:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;It could behave in accordance with the VNM axioms; always choosing in a sane and consistent manner, such that &quot;there exists a U&quot;. The agent need not have an explicit representation of U.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It could have an explicit utility function that it tries to expected-maximize. The agent need not perfectly follow the VNM axioms all the time. (Real bounded decision systems will take shortcuts for efficiency and may not achieve perfect rationality, like how real floating point arithmetic isn't associative).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Neither of these is true of humans. Our behaviour and preferences are not consistent and sane enough to be VNM, and we are generally quite confused about what we even want, never mind having reduced it to a utility function. Nevertheless, you still see the occasional reference to &quot;my utility function&quot;.&lt;/p&gt;
&lt;p&gt;Sometimes &quot;my&quot; refers to &quot;abstract me who has solved moral philosophy and or become perfectly rational&quot;, which at least doesn't run afoul of the math, but is probably still wrong about the particulars of what such an abstract idealized self would actually want. But other times it's a more glaring error like using &quot;utility function&quot; as shorthand for &quot;entire self-reflective moral system&quot;, which may not even be VNMish.&lt;/p&gt;
&lt;p&gt;But this post isn't really about all the ways people misuse terminology, it's about where we're actually at on the whole problem for which a utility function might be the solution.&lt;/p&gt;
&lt;p&gt;As above, I don't think any of us have a utility function in either sense; we are not VNM, and we haven't worked out what we want enough to make a convincing attempt at trying. Maybe someone out there has a utility function in the second sense, but I doubt that it actually represents what they would want.&lt;/p&gt;
&lt;p&gt;Perhaps then we should speak of what we want in terms of &quot;terminal values&quot;? For example, I might say that it is a terminal value of mine that I should not murder, or that freedom from authority is good.&lt;/p&gt;
&lt;p&gt;But what does &quot;terminal value&quot; mean? Usually, it means that the value of something is not contingent on or derived from other facts or situations, like for example, I may value beautiful things in a way that is not derived from what they get me. The recursive chain of valuableness &lt;em&gt;terminates&lt;/em&gt; at some set of values.&lt;/p&gt;
&lt;p&gt;There's another connotation, though, which is that your terminal values are akin to &lt;em&gt;axioms&lt;/em&gt;; not subject to argument or evidence or derivation, and simply given, that there's no point in trying to reconcile them with people who don't share them. This is the meaning people are sometimes getting at when they explain failure to agree with someone as &quot;terminal value differences&quot; or &quot;different set of moral axioms&quot;. This is completely reasonable, if and only if that is in fact the nature of the beliefs in question.&lt;/p&gt;
&lt;p&gt;About two years ago, it very much felt like freedom from authority was a terminal value for me. Those hated authoritarians and fascists were simply &lt;em&gt;wrong&lt;/em&gt;, probably due to some fundamental neurological fault that could not be reasoned with. The very prototype of &quot;terminal value differences&quot;.&lt;/p&gt;
&lt;p&gt;And yet here I am today, having been reasoned out of that &quot;terminal value&quot;, such that I even appreciate a certain aesthetic in bowing to a strong leader.&lt;/p&gt;
&lt;p&gt;If that was a terminal value, I'm afraid the term has lost much of its meaning to me. If it was not, if even the most fundamental-seeming moral feelings are subject to argument, I wonder if there is any coherent sense in which I could be said to have terminal values at all.&lt;/p&gt;
&lt;p&gt;The situation here with &quot;terminal values&quot; is a lot like the situation with &quot;beliefs&quot; in other circles. Ask someone what they believe in most confidently, and they will take the opportunity to differentiate themselves from the opposing tribe on uncertain controversial issues; god exists, god does not exist, racial traits are genetic, race is a social construct. The pedant answer of course is that the sky is probably blue, and that that box over there is about a meter long.&lt;/p&gt;
&lt;p&gt;Likewise, ask someone for their terminal values, and they will take the opportunity to declare that those hated &lt;a href=&quot;/lw/gt/a_fable_of_science_and_politics/&quot;&gt;greens&lt;/a&gt; are utterly wrong on morality, and blueness is wired into their very core, rather than the obvious things like beauty and friendship being valuable, and paperclips not.&lt;/p&gt;
&lt;p&gt;So besides not having a utility function, those aren't your terminal values. I'd be suprised if even the most pedantic answer weren't subject to argument; I don't seem to have anything like a stable and non-negotiable value system at all, and I don't think that I am even especially confused relative to the rest of you.&lt;/p&gt;
&lt;p&gt;Instead of a nice consistent value system, we have a mess of intuitions and hueristics and beliefs that often contradict, fail to give an answer, and change with time and mood and memes. And that's all we have. One of the intuitions is that we want to fix this mess.&lt;/p&gt;
&lt;p&gt;People have tried to do this &quot;Moral Philosophy&quot; thing before, &lt;a href=&quot;/lw/9nm/terminal_bias/&quot;&gt;myself included&lt;/a&gt;, but it hasn't generally turned out well. We've made all kinds of overconfident leaps to what turn out to be unjustified conclusions (utilitarianism, egoism, hedonism, etc), or just ended up wallowing in confused despair.&lt;/p&gt;
&lt;p&gt;The zeroth step in solving a problem is to notice that we have a problem.&lt;/p&gt;
&lt;p&gt;The problem here, in my humble opinion, is that we have no idea what we are doing when we try to do Moral Philosophy. We need to go up a meta-level and get a handle on Moral MetaPhilosophy. What's the problem? What are the relevent knowns? What are the unknowns? What's the solution process?&lt;/p&gt;
&lt;p&gt;Ideally, we could do for Moral Philosphy approximately what Bayesian probability theory has done for Epistemology. My moral intuitions are a horrible mess, but so are my epistemic intuitions, and yet we more-or-less know what we are doing in epistemology. A problem like this has been solved before, and this one seems solvable too, if a bit harder.&lt;/p&gt;
&lt;p&gt;It might be that when we figure this problem out to the point where we can be said to have a consistent moral system with real terminal values, we will end up with a utility function, but on the other hand, we might not. Either way, let's keep in mind that we are still on rather shaky ground, and at least refrain from believing the confident declarations of moral wisdom that we so like to make.&lt;/p&gt;
&lt;p&gt;Moral Philosophy is an important problem, but the way is not clear yet.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/h45/we_dont_have_a_utility_function/#comments"&gt;123 comments&lt;/a&gt;
</description>
</item>
</channel>
</rss>