<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
<channel>
<title>
Articles Tagged ‘economics’ - Less Wrong
</title> <link>http://lesswrong.com/</link>
<description></description>
<item>
<title>New report: Intelligence Explosion Microeconomics</title>
<link>http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/</guid>
<pubDate>Tue, 30 Apr 2013 09:14:58 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Eliezer_Yudkowsky"&gt;Eliezer_Yudkowsky&lt;/a&gt;
&amp;bull;
43 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/#comments"&gt;225 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;strong&gt;Summary&lt;/strong&gt;:&amp;#xA0;&lt;a href=&quot;http://intelligence.org/files/IEM.pdf&quot;&gt;Intelligence Explosion Microeconomics&lt;/a&gt;&amp;#xA0;(pdf) is 40,000 words&amp;#xA0;taking some initial steps toward tackling the key quantitative issue in the intelligence explosion, &quot;reinvestable returns on cognitive investments&quot;: what kind of returns can you get from an investment in cognition, can you reinvest it to make yourself even smarter, and does this process die out or blow up? This can be thought of as the compact and hopefully more coherent successor to the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/The_Hanson-Yudkowsky_AI-Foom_Debate&quot;&gt;AI Foom Debate&lt;/a&gt; of a few years back.&lt;/p&gt;
&lt;p&gt;(Sample idea you haven't heard before: &amp;#xA0;The increase in hominid brain size over evolutionary time should be interpreted as evidence about increasing marginal fitness returns on brain size, presumably due to improved brain wiring algorithms; not as direct evidence about an intelligence scaling factor from brain size.)&lt;/p&gt;
&lt;p&gt;I hope that the open problems posed therein inspire further work by economists or economically literate modelers, interested specifically in the intelligence explosion &lt;em&gt;qua&lt;/em&gt;&amp;#xA0;cognitive intelligence rather than non-cognitive&amp;#xA0;'technological acceleration'. &amp;#xA0;MIRI has an intended-to-be-small-and-technical mailing list for such discussion. &amp;#xA0;In case it's not clear from context, I (Yudkowsky) am the author of the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Abstract:&lt;/strong&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;I. J. Good's thesis of the 'intelligence explosion' is that a sufficiently advanced machine intelligence could build a smarter version of itself, which could in turn build an even smarter version of itself, and that this process could continue enough to vastly exceed human intelligence. &amp;#xA0;As Sandberg (2010) correctly notes, there are several attempts to lay down return-on-investment formulas intended to represent sharp speedups in economic or technological growth, but very little attempt has been made to deal formally with I. J. Good's intelligence explosion thesis as such.&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;I identify the key issue as &lt;em&gt;returns on cognitive reinvestment&lt;/em&gt; - the ability to invest more computing power, faster computers, or improved cognitive algorithms to yield cognitive labor which produces larger brains, faster brains, or better mind designs. &amp;#xA0;There are many phenomena in the world which have been argued as evidentially relevant to this question, from the observed course of hominid evolution, to Moore's Law, to the competence over time of machine chess-playing systems, and many more. &amp;#xA0;I go into some depth on the sort of debates which then arise on how to interpret such evidence. &amp;#xA0;I propose that the next step forward in analyzing positions on the intelligence explosion would be to formalize return-on-investment curves, so that each stance can say formally which possible microfoundations they hold to be falsified by historical observations already made. &amp;#xA0;More generally, I pose multiple open questions of 'returns on cognitive reinvestment' or 'intelligence explosion microeconomics'. &amp;#xA0;Although such questions have received little attention thus far, they seem highly relevant to policy choices affecting the outcomes for Earth-originating intelligent life.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The &lt;a href=&quot;https://docs.google.com/forms/d/1KElE2Zt_XQRqj8vWrc_rG89nrO4JtHWxIFldJ3IY_FQ/viewform&quot;&gt;&lt;strong&gt;dedicated mailing list&lt;/strong&gt;&lt;/a&gt;&amp;#xA0;will be small and restricted to technical discussants.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This topic was originally intended to be a sequence in&amp;#xA0;&lt;em&gt;Open Problems in Friendly AI,&lt;/em&gt;&amp;#xA0;but further work produced something compacted beyond where it could be easily broken up into subposts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outline of contents:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;1&lt;/strong&gt;: &amp;#xA0;Introduces the basic questions and the key quantitative issue of sustained reinvestable returns on cognitive investments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2&lt;/strong&gt;: &amp;#xA0;Discusses the basic language for talking about the intelligence explosion, and argues that we should pursue this project by looking for underlying microfoundations, not by pursuing analogies to allegedly similar historical events.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3&lt;/strong&gt;: &amp;#xA0;Goes into detail on what I see as the main arguments for a fast intelligence explosion, constituting the bulk of the paper with the following subsections:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;3.1&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;What the fossil record actually tells us about returns on brain size, given that most of the difference between Homo sapiens and Australopithecus was probably improved software.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.2&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How to divide credit for the human-chimpanzee performance gap between &quot;humans are individually smarter than chimpanzees&quot; and &quot;the hominid transition involved a one-time qualitative gain from being able to accumulate knowledge&quot;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.3&lt;/strong&gt;:&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How returns on speed (serial causal depth) contrast with returns from parallelism; how faster thought seems to contrast with more thought. &amp;#xA0;Whether sensing and manipulating technologies are likely to present a bottleneck for faster thinkers, or how large of a bottleneck.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.4&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How human populations seem to scale in problem-solving power; some reasons to believe that we scale inefficiently enough for it to be puzzling. &amp;#xA0;Garry Kasparov's chess match vs. The World, which Kasparov won.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.5&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;Some inefficiencies that might cumulate in an estimate of humanity's net computational efficiency on a cognitive problem.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.6&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;What the anthropological record actually tells us about cognitive returns on cumulative selection pressure, given that selection pressures were probably increasing over the course of hominid history. &amp;#xA0;How the observed history would be expected to look different, if there were in fact diminishing returns on cognition.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.7&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;How to relate the curves for evolutionary difficulty, human-engineering difficulty, and AI-engineering difficulty, considering that they are almost certainly different.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.8&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;Correcting for anthropic bias in trying to estimate the intrinsic 'difficulty 'of hominid-level intelligence just from observing that intelligence evolved here on Earth.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.9&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The question of whether to expect a 'local' (one-project) FOOM or 'global' (whole economy) FOOM and how returns on cognitive reinvestment interact with that.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.10&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The great open uncertainty about the minimal conditions for starting a FOOM; why I. J. Good's postulate of starting from 'ultraintelligence' is probably much too strong (sufficient, but very far above what is necessary).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;3.11&lt;/strong&gt;:&amp;#xA0;&lt;span style=&quot;white-space: pre;&quot;&gt; &lt;/span&gt;The enhanced probability of unknown unknowns in the scenario, since a smarter-than-human intelligence will selectively seek out and exploit flaws or gaps in our current knowledge.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;4&lt;/strong&gt;: &amp;#xA0;A tentative methodology for formalizing theories of the intelligence explosion - a project of formalizing possible microfoundations and explicitly stating their alleged relation to historical experience, such that some possibilities can allegedly be falsified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5&lt;/strong&gt;: &amp;#xA0;Which open sub-questions seem both high-value and possibly answerable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;6&lt;/strong&gt;: &amp;#xA0;Formally poses the Open Problem and mentions what it would take for MIRI itself to directly fund further work in this field.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/hbd/new_report_intelligence_explosion_microeconomics/#comments"&gt;225 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Is Sunk Cost Fallacy a Fallacy?</title>
<link>http://lesswrong.com/lw/9si/is_sunk_cost_fallacy_a_fallacy/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/9si/is_sunk_cost_fallacy_a_fallacy/</guid>
<pubDate>Sat, 04 Feb 2012 15:33:49 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/gwern"&gt;gwern&lt;/a&gt;
&amp;bull;
16 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/9si/is_sunk_cost_fallacy_a_fallacy/#comments"&gt;52 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;I just finished the first draft of my essay, &lt;a href=&quot;http://www.gwern.net/Sunk%20cost&quot;&gt;&quot;Are Sunk Costs Fallacies?&quot;&lt;/a&gt;; there is still material I need to go through, but the bulk of the material is now there. The formatting is too gnarly to post here, so I ask everyone's forgiveness in clicking through.&lt;/p&gt;
&lt;p&gt;To summarize:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;sunk costs are probably issues in big organizations
&lt;ul&gt;
&lt;li&gt;but maybe not ones that can be helped&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;sunk costs are not issues in animals&lt;/li&gt;
&lt;li&gt;they appear to be in children &amp;amp; adults
&lt;ul&gt;
&lt;li&gt;but many apparent problems can be explained as part of a learning strategy&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;there are few clear indications sunk costs are genuine problems&lt;/li&gt;
&lt;li&gt;much of what we call 'sunk cost' looks like simple carelessness &amp;amp; thoughtlessness&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;(If any of that seems unlikely or absurd to you, click through. I've worked very hard to provide multiple citations where possible, and fulltext for practically everything.)&lt;/p&gt;
&lt;p&gt;I started this a while ago; but Luke/SIAI paid for much of the work, and that motivation plus academic library access made this essay more comprehensive than it would have been and finished months in advance.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/9si/is_sunk_cost_fallacy_a_fallacy/#comments"&gt;52 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Prediction is hard, especially of medicine</title>
<link>http://lesswrong.com/lw/8yp/prediction_is_hard_especially_of_medicine/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/8yp/prediction_is_hard_especially_of_medicine/</guid>
<pubDate>Sat, 24 Dec 2011 07:34:13 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/gwern"&gt;gwern&lt;/a&gt;
&amp;bull;
46 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/8yp/prediction_is_hard_especially_of_medicine/#comments"&gt;46 comments&lt;/a&gt;
&lt;div&gt;&lt;blockquote&gt;
&lt;p&gt;Summary: medical progress has been much slower than even recently predicted.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In the &lt;a href=&quot;http://www.alcor.org/cryonics/cryonics8802.txt&quot;&gt;February&lt;/a&gt; and &lt;a href=&quot;http://www.alcor.org/cryonics/cryonics8803.txt&quot;&gt;March&lt;/a&gt; 1988 issues of &lt;a href=&quot;http://www.alcor.org/magazine/&quot;&gt;&lt;em&gt;Cryonics&lt;/em&gt;&lt;/a&gt;, &lt;a href=&quot;http://chronopause.com/&quot;&gt;Mike Darwin&lt;/a&gt; (&lt;a href=&quot;http://en.wikipedia.org/wiki/Mike_Darwin&quot;&gt;Wikipedia&lt;/a&gt;/&lt;a href=&quot;/user/mikedarwin/&quot;&gt;LessWrong&lt;/a&gt;) and Steve Harris published a two-part article &amp;#x201C;The Future of Medicine&amp;#x201D; attempting to forecast the medical state of the art for 2008. Darwin has republished it on the &lt;a href=&quot;http://tech.groups.yahoo.com/group/New_Cryonet/message/1691&quot;&gt;New_Cryonet&lt;/a&gt; email list.&lt;/p&gt;
&lt;p&gt;Darwin is a pretty savvy forecaster (who you will remember correctly predicting in 1981 in &lt;a href=&quot;http://www.alcor.org/cryonics/cryonics8201.txt&quot;&gt;&amp;#x201C;The High Cost of Cryonics&amp;#x201D;&lt;/a&gt;/&lt;a href=&quot;http://www.alcor.org/cryonics/cryonics8202.txt&quot;&gt;part 2&lt;/a&gt; ALCOR&amp;#x2019;s recent &lt;a href=&quot;/lw/8fe/cryonics_costs_given_estimates_are_low/&quot;&gt;troubles with grandfathering&lt;/a&gt;), so given my &lt;a href=&quot;http://www.gwern.net/Prediction%20markets&quot;&gt;standing interests&lt;/a&gt; in tracking predictions, I read it with great interest; but they still blew most of them, and not the ones we would prefer them to&amp;#x2019;ve.&lt;/p&gt;
&lt;p&gt;The full essay is ~10k words, so I will excerpt roughly half of it below; feel free to skip to the &lt;a href=&quot;#reactions&quot;&gt;reactions&lt;/a&gt; section and other links.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id=&quot;the-future-of-medicine&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1&lt;/span&gt; The Future of Medicine&lt;/h1&gt;
&lt;h2 id=&quot;part-1&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1&lt;/span&gt; Part 1&lt;/h2&gt;
&lt;blockquote&gt;
&lt;p&gt;What we hope we are especially good at as cryonicists is predicting the future &amp;#x2014; particularly the future of medicine. After all, our lives depend upon it. Because that&amp;#x2019;s what cryonics is about &amp;#x2014; tomorrow&amp;#x2019;s medicine today. In order for cryonics to seem reasonable, in order for it to be reasonable, it is necessary to have some idea, at least in broad outline, of where medicine is going and of where it ultimately can go. I think that the cryonicists&amp;#x2019; record on this point in a broad sense has been very good.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;One thing which is rarely seen in cryonics publications is an attempt to see the shape of things to come in the near or intermediate future. Oddly enough, that&amp;#x2019;s a far more difficult and dangerous undertaking than predicting ultimates. Nor is this a problem confined to cryonics or the future of medicine. Sadi Carnot (the founder of thermodynamics) could tell you all about the &amp;#x201C;perfect heat engine,&amp;#x201D; but would have no doubt had trouble giving you hard numbers on how well heat engines would be made to perform over the 20 years or so following publication of his work&amp;#x2026;.When I look over predictions made in the 1950&amp;#x2019;s or the 1960&amp;#x2019;s about the future of medicine and/or technology, I always chuckle about just how far afield these guys were. A good example is a list of predictions made by Herman Kahn which was summarized in CRYONICS REPORTS in August of 1967 (volume 2, #8). They are reproduced as Table 1 below. Read &amp;#x2019;em and weep &amp;#x2014; or laugh if you will!&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Table 1. Less Likely But Important Possibilities, from: &lt;em&gt;The Next 33 Years: A Framework For Speculation&lt;/em&gt;, by Herman Kahn and Anthony J. Weiner (1967) [predictions for 2000 AD]&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&amp;#x201C;True&amp;#x201D; artificial intelligence&lt;/li&gt;
&lt;li&gt;Practical use of sustained fusion to produce neutrons&lt;/li&gt;
&lt;li&gt;Artificial growth of new limbs and organs&lt;/li&gt;
&lt;li&gt;Room temperature superconductors&lt;/li&gt;
&lt;li&gt;Major use of rockets for transportation (either terrestrial or extraterrestrial)&lt;/li&gt;
&lt;li&gt;Effective chemical or biological treatment for most mental illnesses&lt;/li&gt;
&lt;li&gt;Almost complete control of marginal changes of heredity&lt;/li&gt;
&lt;li&gt;Suspended animation (for years or centuries)&lt;/li&gt;
&lt;li&gt;Practical materials with nearly &amp;#x201C;theoretical limit&amp;#x201D; strengths&lt;/li&gt;
&lt;li&gt;Conversion of mammals (humans?) to fluid breathers&lt;/li&gt;
&lt;li&gt;Direct input into human memory banks&lt;/li&gt;
&lt;li&gt;&amp;#x2026;&lt;/li&gt;
&lt;/ul&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;#x2026;My personal perspective is one of being a hard-core cryonicist who was involved in clinical medicine for the better part of a decade. My biases about predicting the future could probably be summarized as follows: I have a lot of sympathy for the incrementalist view of progress - particularly in the highly regulated area of medicine. It&amp;#x2019;s regulated because it directly and powerfully touches people&amp;#x2019;s well-being and because it is not a very fault-tolerant area &amp;#x2014; mistakes are costly and since people like being alive (at least in the short run) they get edgy if an error separates them from their actuarial expectations.&lt;/p&gt;
&lt;p&gt;I thus believe that any predictions about the future of medicine have to include what I call the &amp;#x201C;space program factor&amp;#x201D; (SPF). By this I mean simply that progress in the space program would have proceeded far, far faster (and thus approximated more closely what was theoretically possible) if it were not a high-visibility project with lots of political and social overtones which make it fault-intolerant &amp;#x2014; if you could burn up as many astronauts as you do test pilots every month, it would cost a lot less to get where you&amp;#x2019;re going. First-shot fail-safe engineering is costly. Medicine suffers from the same kinds of problems &amp;#x2014; witness the FDA as both the solution and the problem.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;diagnostics&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1.1&lt;/span&gt; Diagnostics&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;I foresee a veritable explosion of diagnostic techniques and procedures. A large number of illnesses which are poorly understood today will be well-characterized the next decade and will be easy to diagnose very early in their development or even before they develop because they will be found to have direct or indirect genetic causes. Fairly predictive tests for Alzheimer&amp;#x2019;s disease, schizophrenia, depression, some malignancies, heart disease, and most of the rest of the major killers and disablers will probably be in place by 2000 to 2010. Many if not most of these ailments will be assessable in terms of a very sophisticated genetic risk profile which it will be possible to generate in infancy or childhood (or in utero). A wide range of genetic probes for illness-generating genes should be available by the end of the century.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;A side-note: genetic associations have been a very fertile field for John Ioannidis, and a big study just blew away a bunch of &lt;a href=&quot;http://www.arts.cornell.edu/econ/dbenjamin/IQ-SNPs-PsychSci-20111205-accepted.pdf&quot;&gt;SNP-IQ correlations&lt;/a&gt;.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Real-time diagnosis will also be revolutionized by the turn of the century. The next 10 to 15 years will see increasing miniaturization of sensors and chemistry packages. Tissue probes or biosensors which can measure a wide array of biological and biochemical factors will be packaged in very small, very stable devices which hold calibration over prolonged periods of time (weeks to months to years) and which can easily be inserted into the patient&amp;#x2019;s body or tissues. For example, I foresee multi-sensor units mounted on very small needle or catheter tips which can be inserted intravenously, intracranially, intra-cerebrally, subcutaneously, and so on.&lt;/p&gt;
&lt;p&gt;These sensors will be able to give real-time measurements of blood gases, pH, electrolytes, enzyme levels, and a host of other biochemical parameters that now involve costly, time-consuming, and/or impossible &amp;#x201C;laboratory studies&amp;#x201D; requiring withdrawal of a sample and processing. Real-time biosensors will revolutionize acute care of critically ill patients.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;The first generation of these devices should be in the marketplace somewhere between 1990 and 1995. More sophisticated instruments capable of a wider array of measurements will quickly follow. These sensors will also have a profound impact in acute stabilization of patients in a field setting. It will be possible for paramedical personnel to quickly and effectively insert such instruments in an acutely ill patient &amp;#x2014; a victim of cardiac arrest or trauma, and immediately and globally assess that patient&amp;#x2019;s condition, relaying that information to an expert (more on who that expert will be later).&lt;/p&gt;
&lt;p&gt;&amp;#x2026;Diagnostic imaging should rapidly come down to a battle between ultrasound and MRI (NMR; (nuclear) magnetic resonance imaging). Because ultrasound units owe their size and weight almost entirely to the computer that processes the information, the size and effectiveness of these units will change on the same rapid exponential curve as the size and power of computers. MRI is a technology which has some other physical limitations, but by the year 2000, even MRI units will be far smaller, less costly, and capable of far, far better results. Bedside units or &amp;#x201C;on floor&amp;#x201D; units (i.e., units in the ICU or CCU) may be available for repeated assessment of the patient&amp;#x2019;s condition. MRI and its grandchildren and cousins should in particular be expected to undergo considerable refinement. Metabolic MRI will also be in wider use, allowing for real-time evaluation of the metabolic and working state of patient&amp;#x2019;s hearts, brains and other organs. By 2000 to 2010 the cost and size of these units may be drastically reduced and they may be in field use for acute metabolic and structural evaluation of patients with trauma or in cardiac arrest.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;I recently learned that, besides the usual blame for &lt;a href=&quot;http://articles.baltimoresun.com/2011-02-22/news/bs-ed-health-costs-20110222_1_health-care-high-quality-care-mri&quot;&gt;increasing medical costs&lt;/a&gt;, some categories of doctors have been strenuously urged to &lt;a href=&quot;http://www.wired.com/magazine/2011/12/ff_causation/all/1#post-50632&quot;&gt;reduce MRI use&lt;/a&gt; as actively harmful.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;By the late 1990&amp;#x2019;s there should be an answer to this problem in the development of the Portable Doctor or Expert Medical Device (EMD). The EMD will be both a diagnostician and therapist integrated into one unit. In an emergency medical setting (either in an ambulance or in an ICU or CCU) this powerful computer will be directly coupled to a wide array of both simple and complex medical assessment devices&amp;#x2026;.EMDs will be a very hot item. Initially (i.e., the 1990&amp;#x2019;s) they will be confined to ambulances and the ICU, CCU, and specialty areas of the hospital, such as radiology and cardiology labs. But there will be powerful incentives for wider application of these devices. As computing capacity drops in cost and increases radically in sophistication (i.e., parallel processors, neural networks, truly massive memories, and so on) expert medical (and other) systems will see increasing application. There will be devices on the market such as a &amp;#x201C;Home Doctor&amp;#x201D; diagnostic program, which will basically be an internal medicine physician in a can.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;After 2000, many people will probably have a small sensor array permanently implanted and coupled to telemetry equipment which can be activated to call for help or alert the person that trouble is brewing. People with a known risk of sudden health problems will be the first to use these kinds of devices. With the development of smaller and cheaper telemetry equipment (directly linked to large-antenna satellites), separate telemetry arrangements will disappear. Implantable, computer-controlled defibrillators are already a reality; analogous devices to deliver drugs in case of cardiac or brain infarct (stroke) will eventually become reality.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;resuscitation&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1.2&lt;/span&gt; Resuscitation&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Expect a shift back to open-chest heart massage and away from closed-chest massage in medical and perhaps even paramedical settings. Closed-chest CPR will be realized to be ineffective at maintaining cerebral viability and will be replaced by far more effective open chest methods. In paramedical (i.e., field) settings the emphasis will be on very rapid defibrillation &amp;#x2014; or actually &amp;#x201C;leaving the patient alone&amp;#x201D; until circulation can be effectively restored and medications given to inhibit reperfusion injury. Closed chest CPR and restarting circulation by laymen &amp;#x201C;in the field&amp;#x201D; will be realized to be doing more harm than good and there may well be a move away from field CPR, with laymen being instructed to leave the patient without circulation until it can be restarted adequately and under controlled conditions.&lt;/p&gt;
&lt;p&gt;By the late 1990&amp;#x2019;s, extended use of CPR will be a thing of the past and major metropolitan areas will have &amp;#x201C;death reversal units&amp;#x201D; (DRUs) in emergency rooms and perhaps even in larger paramedical units. The DRUs will employ rapid femoral cut-downs and blood-pump/oxygenator supported resuscitation to recover people who have suffered extended periods of ischemia (in the 30 minute to 1 hour range). CPR will be realized very often to be ineffective at recovering patients who are profoundly ischemic and the advent of pharmacologic intervention allowing for cerebral resuscitation will provide tremendous pressure for emergency rooms to develop the capability to very rapidly put an ischemic patient on bypass and completely and adequately support his circulatory and respiratory needs until his brain can recover and/or his heart can be repaired and restarted. An intermediate scenario would be the development of small, flexible impeller pumps that can be collapsed and passed through a large bore percutaneous catheter through the femoral artery and into the abdominal aorta. Such a pump (acting much like the propeller on an outboard boat motor) could then be used to supplement CPR, perhaps providing 2&amp;#x2013;3 liters per minute of cardiac output.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;Another effect of drugs like the lazaroids and calcium channel blockers will be the more effective treatment of acute injuries to a wide range of tissues such as the spinal cord and brain. Much of the damage that occurs to these tissues is free radical related and can be inhibited by use of these drugs&amp;#x2026;Intervention into secondary inflammation will be most important in the brain and spinal cord. Deployment of these techniques will result in the salvage of many spinal cords that would be considered irreversibly injured by today&amp;#x2019;s medicine. There will be far, far fewer paraplegics. However, expect an increase in the number of permanently brain-injured patients and in the number of patients with &amp;#x201C;subtle&amp;#x201D; forms of cerebral injury resembling mild stroke or the cognitive or mood disorders seen in diseases like multiple sclerosis or acute head injury. These disease states will result because people with brain trauma who would have died acutely from secondary free radical mediated injury (cerebral edema and so on) will be saved with lazaroids and other cerebral rescue techniques.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;antibiotics&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1.3&lt;/span&gt; Antibiotics&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;The next twenty years should see many powerful new antibiotics engineered directly from knowledge of the structure of the relevant microbial enzyme which it is desired to inhibit. Not only will these antibiotics be more powerful, but because they do not exist in nature, strain resistance will not so easily develop toward them as it has for the antibiotics of today.&lt;/p&gt;
&lt;p&gt;In addition, the next generation of antibiotics will include many which have been designed for effect against viruses, an area where medicine is presently largely powerless.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The pharmaceutical industry and antibiotics have been a case-study in stagnation, failure, and diminishing marginal returns. There is &lt;a href=&quot;http://en.wikipedia.org/wiki/DRACO_%28antiviral%29&quot;&gt;only one&lt;/a&gt;, highly experimental, anti-viral that I have heard of. In a &lt;a href=&quot;http://tech.groups.yahoo.com/group/New_Cryonet/message/1719&quot;&gt;followup email&lt;/a&gt;, Darwin responded to someone else pointing out DRACO:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;span style=&quot;font-family: Arial; color: #000000; font-size: x-small;&quot;&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Finally, while Geoff cites this putative advance in antiviral drug therapy, the fact is that my prediction about a plethora of new and highly effective targeted molecular antimicrobials by 2008 was &lt;strong&gt;WRONG&lt;/strong&gt;. In fact, antibiotic research is all but dead, and there are virtually no fundamentally new antibiotics in the drug pipeline. This should scare the crap out of all us, because we are rapidly approaching complete antibiotic resistance with a number of common and highly lethal bugs, including staph (MRSA), streptococcus, &amp;#xA0;E. coli, pseudomonas and candida. It is only a matter of months to a few years, at most, &amp;#xA0;before completely antibiotic resistance staph and streptococcus emerge. Pharmaceutical companies have a large &lt;strong&gt;negative&lt;/strong&gt; incentive for developing new antimicrobials. At the cost of over a billion dollars a new drug (regulatory) and the high risk of withdrawal of the drug within 5 years (2 out of 3), as well as the near certainty of punishing litigation for adverse effects, antibiotics are not merely uneconomical to develop, they are fiscal suicide. Only drugs that will be chronically used by very large numbers of patients are now worth developing.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;span style=&quot;font-size: small;&quot;&gt;(This agrees with my own general impressions, which I didn't feel competent to baldly state.)&lt;br&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h3 id=&quot;immunology-and-cancer&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1.4&lt;/span&gt; Immunology and cancer&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;&amp;#x2026;Monoclonal and synthetic antibodies carrying toxins or regulatory molecules will be used to turn off or destroy the fraction of immune cells which initially respond and proliferate when a transplant is carried out. More widespread transplantation of tissues will be undertaken, including transplantation of limbs and scalp. Xenografts will be used increasingly in the mid to late 1990&amp;#x2019;s and it will not be uncommon for people to have pancreatic tissue from bovine or porcine sources and perhaps hearts, lungs, and livers from other animals. Expect the first workable transplants to be from great apes (chimps, gorillas, orangutans), with porcine and bovine grafts coming later.&lt;/p&gt;
&lt;p&gt;Immunology and immunotherapy will also be revolutionized by a far more complete understanding of the immune system resulting from the AIDS epidemic and basic research in the immunology of diseases such as multiple sclerosis and aging. The ability to rapidly and cheaply synthesize bioregulatory molecules will open up a wide array of therapeutic possibilities. Expect effective treatments for most autoimmune diseases (lupus, multiple sclerosis, myasthenia gravis, and so on) by the mid to late 1990&amp;#x2019;s. The mid to late 1990&amp;#x2019;s should also see the wider application of immunorestoratives for use with the aged and ill. Cancer therapy will improve considerably as a result of these advances as well as a result of selective targeting techniques. By the early to mid&amp;#x2013;1990s the first generations of monoclonal antibodies linked to chemotherapeutic agents or powerful natural toxins will be used against a few cancers.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;atherosclerosis&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.1.5&lt;/span&gt; Atherosclerosis&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Atherosclerosis will undergo a very marked but nevertheless gradual reduction in frequency and severity of occurrence as physicians slowly become educated about what is already known and begin to use existing therapeutic modalities more aggressively. By the mid to late 1990&amp;#x2019;s it will be more widely understood that atherosclerosis can be reversed, and there will be wider use of drugs such as lovastatin to reduce serum cholesterol, coupled with sound dietary advice. However, even well into the late 1990&amp;#x2019;s and perhaps beyond, atherosclerotic disease (heart attack, stroke, ischemic limb disease, and so on) will continue to be a serious source of morbidity and mortality. By the late 1990&amp;#x2019;s, 2nd and 3rd generation therapies will be coming on-line which will be able to reverse atherosclerotic disease and more directly inhibit it&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2 id=&quot;part-2&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2&lt;/span&gt; Part 2&lt;/h2&gt;
&lt;h3 id=&quot;anesthesia&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.1&lt;/span&gt; Anesthesia&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Expect &amp;#x201C;modular&amp;#x201D; anesthesia by the 1990&amp;#x2019;s to the early 2000&amp;#x2019;s. The development of potent anxieolytics (anxiety removers) which do not depress consciousness and the development of total pain inhibitors will allow for complicated surgical procedures on conscious patients. Expect to see major thoracic and limb surgery on high risk patients (i.e., patients unable to tolerate anesthesia) using such agents.Major abdominal surgery requiring deep muscle relaxation will continue to require skeletal muscle paralysis and general anesthesia. However, expect new drugs in the market place in the late 1990&amp;#x2019;s which induce unconsciousness without respiratory or cardiac depression.&lt;/p&gt;
&lt;p&gt;Surgical and post surgical mortality will decrease sharply due to such anesthetics and the use of real-time physiological and biochemical monitoring during and after surgery using biosensors.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;surgery&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.2&lt;/span&gt; Surgery&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;&amp;#x2026;Catheters, laparascopes, and thorascopes with sensors, operating tools, and an impressive array of capabilities will be increasingly used. Abdominal surgery will shift more and more towards the use of the fiberoptic laparascope, endoscope, and laser as miniaturization of tools occurs and disease is diagnosed earlier. Early diagnosis will create the need for less drastic procedures.&lt;/p&gt;
&lt;p&gt;Fine-tuned repair of heart valves and blood vessels, and examination and biopsy of suspected abdominal and retroperitoneal lesions will be early candidates for application of this technology.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;In contrast to therapeutic surgery, the frequency of cosmetic surgery will probably increase dramatically as techniques are refined and prosthetics improve in quality and drop in cost. As people live longer, and stay productive longer as well, they will increasingly turn to medicine to maintain not only their health but their appearance. Cosmetic surgery will experience a boom until such time as the fundamental mechanisms underlying the aging process can be brought under control.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;geriatrics&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.3&lt;/span&gt; Geriatrics&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Advances will be slow here, but significant. Expect increasing understanding and application of trophic factors and bioregulatory compounds. Early candidates for rejuvenation will be the immune system and other stem cell systems or systems with higher cell turnover. By the early decades of 2000, significant rejuvenation and geroprophylaxis of skin, bone, immune, and other &amp;#x201C;high turn- over&amp;#x201D; tissues will be possible as the natural regulatory molecules which control these systems are understood and applied.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;By the early years of the 21st century the first generation of compounds effective at &amp;#x201C;rejuvenating&amp;#x201D; (i.e., restoring some degree of normal maintenance and repair to existing brain cells) the central nervous system will be available. These drugs will work by turning on protein synthesis and stimulating natural repair mechanisms.&lt;/p&gt;
&lt;p&gt;However, pathologies of the brain and other non-dividing tissues (renal, cardiac, and musculoskeletal system) will continue to be major sources of morbidity and mortality over the next two decades. As atherosclerosis and immune-related disorders are dealt with more effectively, expect an increasing shift of morbidity and mortality to central nervous system-related causes. Beyond 2000 this may be treated to a limited extent with fetal transplant&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p id=&quot;psychiatry-behavior&quot;&gt;We all know how well this has worked out. More troubling is that in some respects, we appear &lt;em&gt;further&lt;/em&gt; from any solutions or treatments than before; while resveratrol did well in &lt;a href=&quot;http://pipeline.corante.com/archives/2011/11/10/resveratrol_in_humans_results_of_a_controlled_trial.php&quot;&gt;a recent human trial&lt;/a&gt;, the sirtuin research that seemed so promising &lt;a href=&quot;https://www.sciencemag.org/content/334/6060/1194&quot;&gt;has been battered&lt;/a&gt; by null results and failures to replicate. And anti-aging drugs have their own methodological difficulties; from the &lt;a href=&quot;http://tech.groups.yahoo.com/group/New_Cryonet/message/1719&quot;&gt;followup email&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Antiaging drugs are unlikely to be free of adverse effects. In fact, it seems very likely that they will be burdened with many adverse effects and that they will even kill a minority of people who use them. The common perception is that antiaging drugs will make people super fit, healthier and more resistant to disease. And yet, in calorie restriction and effective antiaging drug studies there is emerging evidence that slowing aging comes at the cost of interfering with fundamental processes that make organisms fitter for both reproduction and for surviving in a hostile environment. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Consider the putative antaging drug rapamycin. It seems likely that rapamycin interferes with senesence by affecting the PI3-kinase and TOR: PIKTORing cell growth pathways. This almost certainly means that in some individuals there will serious and even lethal side effects - cancer being one of them. [Persons with a history of promiscuity, and thus a heavy burden of chronic viral infection, and those with certain &quot;unfavorable&quot; genotypes will likely be at very high risk.] But, beyond cancer, interfering with these fundamental and deeply evolutionarily conserved pathways is likely to cause a range of adverse effects that negatively (and possibly irreversibly) impact normal body functions, such as energy level, cognition, sexual performance, and so on.. While some people are now using rapamycin as an antiaging drug...&lt;/span&gt;&lt;span style=&quot;font-size: small;&quot;&gt;it is virtually inconceivable that any major pharmaceutical company anywhere in the world would (or will) market such a drug for &quot;normal&quot; aging. This is important to understand because it gives us basic insight into what will almost certainly be a major barrier to the development and marketing of antiaging drugs: they will necessarily be used by large numbers of people over the course of many decades (and thus millions of drug/person years) and they are incredibly unlikely to be free of adverse, and sometimes even lethal side effects.&lt;/span&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.4&lt;/span&gt; Psychiatry &amp;amp; Behavior&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Diagnosis by brain scanning (metabolic MRI) and chemical analysis of cerebrospinal fluids will be commonplace in 20 years. As neuroregulatory compounds are better understood and as the biochemistry underlying mental disorders is elucidated there will be more effective treatments. Expect 2nd and 3rd generation drugs and combinations thereof for treatment of depression and psychosis by the late 1990&amp;#x2019;s. There will probably be several very effective therapeutic agents for compulsive disorders in the marketplace by the early to mid 1990&amp;#x2019;s.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;From the previously quoted &lt;a href=&quot;http://tech.groups.yahoo.com/group/New_Cryonet/message/1719&quot;&gt;followup email&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;span style=&quot;font-family: Arial; color: #000000; font-size: x-small;&quot;&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-size: small;&quot;&gt;Similarly, psychiatric drugs (which &lt;em&gt;are&lt;/em&gt; typically chronically used) are no longer economical to develop and market because of the litigation costs associated with them. Widespread chronic use of &lt;strong&gt;any&lt;/strong&gt; drug means that the likelihood of adverse conditions that were impossible to detect in the testing phase of the drug development process are almost certain to emerge.&amp;#xA0;Statistics rule in drug development, and a Phase III study that lasts a year and enrolls 5,000 patients is simply not adequately powered to predict what will happen when 5 million patients take&amp;#xA0;a drug for 20 years! The only way to get that data is to do&lt;strong&gt;&amp;#xA0;that&lt;/strong&gt; study. And therein lies a&amp;#xA0;powerful caution about antiaging drugs. These drugs will likely need to be taken&amp;#xA0;starting in young adulthood, or in middle age, at latest,&amp;#xA0;and they will need to be taken for a lifetime. Indeed, if they are effective, for a longer lifetime than any but a few super-centenarians &amp;#xA0;has previously lived. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;implants-prosthetics&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.5&lt;/span&gt; Implants &amp;amp; Prosthetics&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Early spectacular applications will be small vessel prostheses (wide use by the early to mid 1990&amp;#x2019;s) for use in traumatized and atherosclerotic limbs and organs and venous prostheses (mid to late 1990&amp;#x2019;s) for use in treating traumatic injuries and deep vein incompetence (which results in varicosities, chronic pain, and edema-related skin changes in the leg, often leading to non-healing ulcers or limb loss). Another application of non-thrombogenic surfaces will be a practical artificial heart and more widespread use of extracorporeal support for infants, trauma and cardiac arrest victims, and others where anticoagulation provides a major barrier to the use of artificial circulation.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;Good synthetic bone and skin should be available by the late 1990&amp;#x2019;s to early 2000&amp;#x2019;s. Good red cell and plasma substitutes (synthetic blood) should be seen increasing in clinical use throughout the early 1990&amp;#x2019;s and in frequent use by the late 1990&amp;#x2019;s to early 2000&amp;#x2019;s.&lt;/p&gt;
&lt;p&gt;There will be steady improvement in other synthetic materials such as hip, knee, and other joints, as well as in other less dramatic materials such as connective tissue replacements. Expect a slow replacement of prosthetic approaches to therapy as natural repair and regeneration processes are better understood and utilized. Expect to see synthetic connective tissue products for tendon repair which contain bioregulatory molecules (BRMs) that stimulate tendon regeneration. Artificial tendons made of both synthetic and/or natural materials will come into use in the late 1980&amp;#x2019;s to early 1990&amp;#x2019;s. In short, expect stunning advances in tissue replacement technology for all tissues that have primarily structural function and which are not complicated chemical processing plants, such as the liver or kidneys, or mechanically active such as the heart. In addition to connective tissue and bone, a candidate for early (late 1980&amp;#x2019;s to early 1990&amp;#x2019;s) replacement is the cornea. Expect evolution in biocompatible materials to allow for replacement of the cornea with an appropriate plastic, much like the lens of the eye is already replaced with polymer inserts.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;hemodialysis&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.6&lt;/span&gt; Hemodialysis&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Advances in hemodialysis will also be very incremental. There may be a gradual shift to peritoneal dialysis (PD) if good drugs to block glucosylation of proteins and inhibit cholesterol deposition are available. The major problem with PD today is that it raises blood sugars to astronomical levels, causing diabetic-like side effects. Inhibition of these side effects may lead to renewed application of this modality.&lt;/p&gt;
&lt;p&gt;Direct changes in dialysis are likely to be along the lines of better membrane materials which allow for transport of wastes not currently removable by conventional dialysis and nonthrombogenic surfaces which will reduce the need for anticoagulation. The use of BRMs such as erythropoetin to treat anemia and bone growth factors to treat dialysis bone disease will help to improve the quality and quantity of patient&amp;#x2019;s lives on dialysis.&lt;/p&gt;
&lt;p&gt;Perhaps the biggest advance in this area will be advances in immunology and infectious disease treatment. The ability to administer BRMs to stimulate immune function and improve general health should act to extend dialysis patients&amp;#x2019; lives considerably.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;Of course, the biggest improvement in the life expectancy and health of dialysis patients will probably come in the form of the increasing use of transplantation and its application to a wider age range of patients with better long term results.&lt;/p&gt;
&lt;p&gt;The most striking revolution in prosthetics will probably occur in dentistry. Expect a whole family of new materials to enter the dental operatory. A workable vaccine against streptococcus mutans should be available by the mid to late 1990&amp;#x2019;s, greatly reducing the incidence of tooth decay by eliminating the major class of mouth organisms that cause it. Similar advances in prevention and in treatment of gum disease can be expected as well, although probably not as soon. Repairing dental defects will also be revolutionized by the introduction of good, tough, and reliable polymers which will replace metallic amalgams. By the late 1990&amp;#x2019;s to early 2000&amp;#x2019;s biocompatible ceramics and coated polymers will be available that will allow for workable single tooth and multitooth gum-implanted prostheses.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;organ-preservation&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.7&lt;/span&gt; Organ Preservation&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Ever since the work of people like Mazur, Fahy, and Pegg was published, it has become pretty clear what the constraints are on long term viable cryopreservation of organs: don&amp;#x2019;t form any significant amount of ice; it injures mechanically and it injures chemically. The problem is that water loves to turn into ice when it&amp;#x2019;s cooled below 0&amp;#xF8;C. To circumvent this, a lot of very drastic changes have to be made in the system. Whenever you attempt to make a drastic change in a complicated, interdependent living system &amp;#x2014; like replacing half the water in it with industrial chemicals &amp;#x2014; you are in for trouble. The trouble will come in the form of a very tight or narrow window for success: everything will have to be &amp;#x201C;just right.&amp;#x201D;&lt;/p&gt;
&lt;p&gt;This is where current vitrification technology is now. The existence of such a tight window means that vitrification of large masses will be a technological tour-de-force requiring very sophisticated computer controlled perfusion equipment and exotic and very costly high pressure chambers. Quality control and reliable storage and rewarming of organs will be very costly and difficult.&lt;/p&gt;
&lt;p&gt;The future holds the possibility of developing better solute systems which vitrify more easily and which are less toxic (have a wider window for success). It is difficult to predict the pace of advance in this area since it will be arrived at by a mixture of empirical methods and theoretical insights. A big determining factor will be luck. Will the NIH and the Red Cross continue to fund such efforts? And, more to the point, will technological advances in other areas of organ preservation obviate the need for them? If we were betting men, we&amp;#x2019;d put our dollars on the latter rather than on the former. Major advances in organ preservation (as opposed to cell and tissue preservation) over the next decade will probably be in three areas: 1) Extended hypothermic storage of organs in the 2 to 3 weeks range; 2) Extended normothermic or room temperature storage of organs in the weeks to months range and; 3) mixtures of the above two modalities which yield similar available time courses of storage.&lt;/p&gt;
&lt;p&gt;&amp;#x2026;The next 5 to 10 years should also see major advances in our understanding of the effects of deep hypothermia on the tissues and organs of non-hibernating mammals. These advances should be readily translatable into better flush and perfusion storage techniques for organs. A good understanding of lipid metabolism and mechanisms of cell swelling in deep hypothermia may allow for preservation of organs in the 2&amp;#xF8;C to 10&amp;#xF8;C temperature range for periods of several months &amp;#x2014; thus definitively ending the need for long term solid state preservation of transplantable organs.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;other-approaches-to-organ-preservation&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.8&lt;/span&gt; Other Approaches to Organ Preservation&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;One possibility for a major advance over the next two decades is room temperature or hypothermic preservation of organs or organisms using metabolic inhibitors. There have been tantalizing clues in the examination of a wide variety of estivators (animals which go into states of profoundly reduced metabolism at normal temperatures, such as the African lungfish, which can shut off metabolism at temperatures in the range of 30&amp;#xF8;C to 40&amp;#xF8;C) that anti-metabolite compounds exist which may be able to induce states of profoundly reduced metabolism at ambient (i.e., 70&amp;#xF8;F) temperatures.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;genetic-therapy&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.9&lt;/span&gt; Genetic therapy&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;Expect very gradual application of this technology. Early candidates for gene replacement will be in storage diseases such as Lesch-Nyhan, Tay-Sachs, and other &amp;#x201C;single enzyme missing&amp;#x201D; disorders. Later applications will include treatments for hypercholesterolemia, some forms of hypertension, and other congenital missing enzyme syndromes. Very late applications (2000 or later) may be in the treatment of a wide range of mental illnesses and cancers.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;prevention&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.10&lt;/span&gt; Prevention&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;The principal lesson is the lesson of the impact of calorie restriction on overall health, well-being, and lifespan. The basic message here is &amp;#x201C;you are what you eat.&amp;#x201D; In terms of treating atherosclerotic disease, the role of prevention is already clear. By reducing fat intake and decreasing serum cholesterol to below 150 mg/dl, most atherosclerotic disease can be avoided. Similarly, basic changes in nutrition such as trace element and vitamin supplementation can greatly reduce the number of late onset malignancies. Eliminating smoking will also be a major factor in achieving this end&amp;#x2026;.Calorie restriction achieved by means of education and therapeutic agents seems the next big area of preventics to be explored by medicine. Expect the development of truly effective anorectics for treatment of gross obesity and eating disorders by the late 1980&amp;#x2019;s and then secondary use of these for treatment of mild obesity and weight control in the normal middle aged. Products with reduced calories employing fat substitutes such as sucrose polyester should also be entering the marketplace in the early 1990&amp;#x2019;s and these will help to reduce the calorie load further.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h3 id=&quot;the-downside&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;1.2.11&lt;/span&gt; The Downside&lt;/h3&gt;
&lt;blockquote&gt;
&lt;p&gt;A little information is a dangerous thing, and sometimes a lot of information can be an even more dangerous thing. The reason is that progress in therapeutics, which is relatively difficult, always lags far behind progress in diagnosis, which is relatively easy. This imbalance results in a tension which forces premature treatment which often does more harm than good. It is well to note that each new diagnostic modality brings with it a flood of new information which will at first be grossly misused before anyone understands what it means (Harris&amp;#x2019;s Law of Diagnostics Advance).&lt;/p&gt;
&lt;p&gt;A recent example of this sort of thing is the EKG machine, which for the first time showed that many seemingly normal people had strange cardiac rhythms, some of which were seen also around the time people died suddenly of heart problems. Because of this association, for the last 15 years, a number of very powerful drugs have been used to treat people with such rhythms. Many drug-induced fatalities resulted. Unfortunately, only now is it beginning to be understood that most people with good heart function are less in danger from such rhythms than they are from the drugs used to treat them &amp;#x2014; a finding of little consolation to the people already killed by the drugs.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And on to the economics:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;There is a second downside to advanced medicine, of course, besides the danger, that is the cost of &amp;#x201C;middlingly advanced technology&amp;#x201D; (such as what we&amp;#x2019;ll see in the next fifty years) in a society takes a socialistic view of health care. Such as ours.&lt;/p&gt;
&lt;p&gt;Non-molecular technology is expensive. It should be obvious to the reader, with a bit of thought that in a world of non-molecular technology, the potential demand for medical care as technology advances, is (for all intents and purposes) Iinfinite. In America, we have adopted the unfortunate policy of letting everyone pay for everyone else&amp;#x2019;s medical care, which has had exactly the same result as if we had let everyone pool their money and pay for each other&amp;#x2019;s lunch: everyone orders lobster. We have paid for the lobster only by spreading the costs around to places where they are not obvious. For instance, when you buy an American car, you pay more money for the health care costs of the people who built it than you do the steel that goes into it. This kind of thing can continue very subtly and very insidiously until a very large fraction of the gross national product is eaten up by health care costs. (In our country, it is already 11% and rising).&lt;/p&gt;
&lt;p&gt;One day, you may find that you have had to forego your family vacation in order to buy Granny that new AUTODOC which measures 245 different chemicals in her blood every minute and transmits all of the results to Medical Multivac in Bethesda. Of course you may not realize this: all you will know is that the vacation went because money is so tight, taxes are so high, and inflation is so bad. But your money went to Granny nevertheless. The only answer to this problem, short of nanotechnology, is rationing.&lt;/p&gt;
&lt;p&gt;But rationing itself becomes the last great social cost of advanced medical technology under socialism, because history shows that it is never done on an individual (person by person) basis. When people do not pay for their own medical care, no one (not doctors, families, or the government) has ever been willing to make the decision of who should benefit from a given technology, and who should not. Therefore, all systems of rationing to control medical costs ultimately have come down in the past to rationing technology across the board&amp;#x2026;So all of the rosy predictions made in this article must be tempered with the &amp;#x201C;social&amp;#x201D; realities that medicine will have to deal with in the next 20 years. Many of the advances we have discussed may simply not materialize because we are not wealthy enough to afford them collectively. That will be a great tragedy.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h1 id=&quot;reactions&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;2&lt;/span&gt; Reactions&lt;/h1&gt;
&lt;p&gt;On reading all the foregoing, I commented: that was a depressing read. As far as I can tell, they were dead on about the dismal economics, somewhat right about the diagnostics, and fairly wrong about everything else. Which is better than the old predictions listed, only one of which struck me as obviously right (but in a useless way, who actually uses &lt;a href=&quot;http://en.wikipedia.org/wiki/Fluorocarbon&quot;&gt;perfluorocarbons&lt;/a&gt; for &lt;a href=&quot;http://en.wikipedia.org/wiki/Liquid_breathing&quot;&gt;liquid breathing&lt;/a&gt;?).&lt;/p&gt;
&lt;p&gt;To which Darwin said:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;At the time I wrote it I kept saying to myself, almost none of this stuff is going to happen in 20 years - not here anyway. However, I &lt;em&gt;have&lt;/em&gt; to come up with something.&lt;/p&gt;
&lt;p&gt;Ironically, in the area of cerebral resuscitation, where I am a supposed &amp;#x201C;expert,&amp;#x201D; I tried very hard to be realistic and to be both accurate and precise. That was arguably the area where I did the worst - exactly the way other experts fare when they try to predict the future of their fields&amp;#x2026;So, here I am, 24 &lt;em&gt;years&lt;/em&gt; out from making those predictions and I read the &lt;em&gt;crap&lt;/em&gt; posted on Less Wrong and on Cryonet and I don&amp;#x2019;t whether to scream in rage and anger, or weep. How is possible to reach and convince this new generation of cryonics &amp;#x201C;passivists&amp;#x201D; that Yudkowsky and Alcor are breeding and make them understand that progress will continue to be unacceptably slow unless the &lt;em&gt;system&lt;/em&gt; itself is changed?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;See also &lt;em&gt;Fight Aging!&lt;/em&gt;&amp;#x2019;s post, &lt;a href=&quot;http://www.fightaging.org/archives/2011/12/overestimating-the-near-future.php&quot;&gt;&amp;#x201C;Overestimating the Near Future&amp;#x201D;&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&amp;#x2026;Many of the specific predictions in the article were in fact demonstrated in the laboratory to some degree, and were technically feasible to develop as commercial products by the year 2000, and in some cases earlier but at much greater expense. Certainly there are partial hits for many of the predictions by 2010, in the sense of it being possible, somewhat demonstrated, or in the early stages of being shown to be a practical goal. Yet the regulatory environment in much of the developed world essentially rules out any form of adventurous, rapid, highly competitive development in clinical medicine - such as exists in the electrical engineering, computing, and other worlds. We are cursed therefore with the passage of many years between a new medical technology being demonstrated possible and then attempted in the marketplace &amp;#x2026; if it ever makes it to the marketplace at all. &lt;a href=&quot;https://www.opencures.org/content/open-cures-speed-clinical-development-longevity-science&quot;&gt;This must change&lt;/a&gt; if we are to see significant progress.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Darwin comments there:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;I&amp;#x2019;ve been going over my original manuscript and surfing the web for specific applications (approved or in process) which meet the criteria of my predictions of 24 years ago. While many of my &amp;#x201C;lesser&amp;#x201D; predictions are in fact being realized (often in ways totally unforeseen by us when we wrote the article) overall it is a profoundly depressing experience.&lt;/p&gt;
&lt;p&gt;Perhaps nowhere has that been more true than in the areas of aging and cerebral resuscitation - two fields of endeavor I&amp;#x2019;ve spent a lifetime working on, or intimately involved with those who are. In 1999, &lt;a href=&quot;http://www.cryonet.org/cgi-bin/dsp.cgi?msg=11238&quot;&gt;we announced&lt;/a&gt; that we had achieved repeatable recovery of dogs following 16+ minutes of whole body noromothermic cardiac arrest with no neurological deficit. The enabling molecules and techniques (principally a combination of melatonin, alpha-phenyl-n-tert-butyl-nitrone (PBN), and mild post-cardiac arrest therapeutic hypothermia) all seemed eminently applicable in the (then) immediate future. Indeed, an analog of PBN, 2,3-dihydroxy&amp;#x2013;6-nitro&amp;#x2013;7-sulfamoyl-benzo(F)quinoxaline (NBQX) had passed Phase I and II clinical trials for the treatment of stroke with flying colors, and seemed destined for approval.&lt;/p&gt;
&lt;p&gt;That was 13 years, ago, and there is still not a single drug available (approved or otherwise) anywhere in the world to treat cerebral ischemia-reperfusion injury - the real killer in cardiac arrest and stroke! Do a &lt;a href=&quot;http://www.ncbi.nlm.nih.gov/pubmed?term=melatonin%20cerebral%20ischemia&quot;&gt;literature search&lt;/a&gt; on Pubmed for melatonin + cerebral ischemia and you will get ~130 hits - almost all of them dramatically positive. Melatonin is a naturally occurring bioregulatory molecule which is inexpensive and freely available as an over the counter &amp;#x201C;nutrient.&amp;#x201D; Even as a stand alone molecule, melatonin is powerfully protective in both global and regional cerebral ischemia, and yet no human application has been forthcoming. It&amp;#x2019;s been 15 years since &lt;a href=&quot;http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;amp;Sect2=HITOFF&amp;amp;p=1&amp;amp;u=%2Fnetahtml%2FPTO%2Fsearch-bool.html&amp;amp;r=6&amp;amp;f=G&amp;amp;l=50&amp;amp;co1=AND&amp;amp;d=PTXT&amp;amp;s1=melatonin&amp;amp;s2=federowicz&amp;amp;OS=melatonin+AND+federowicz&amp;amp;RS=melatonin+AND+federowicz&quot;&gt;our patent on melatonin and other cerebroprotective molecules&lt;/a&gt; was issued, 17 years since the patent was applied for, and over 20 years since I made the discovery! Indeed, mild therapeutic hypothermia, made the supposed standard of care for post cardiac arrest neuroinjury nearly a decade ago, is still largely ignored and is used well in only a handful of hospitals worldwide.&lt;/p&gt;
&lt;p&gt;What kind of black irony is it that I live in terror of stroke and cardiac arrest (for both myself and my loved ones) and yet the very molecules I discovered to combat them are as unavailable as if they had never been found? Change? Yes, change is certainly needed, and soon.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h1 id=&quot;further-reading&quot;&gt;&lt;span class=&quot;header-section-number&quot;&gt;3&lt;/span&gt; Further reading&lt;/h1&gt;
&lt;p&gt;Previous Darwin-related posts:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;/lw/6vq/on_the_unpopularity_of_cryonics_life_sucks_but_at/&quot;&gt;&amp;#x201C;On the unpopularity of cryonics: life sucks, but at least then you die&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/lw/6me/alcor_finances/&quot;&gt;&amp;#x201C;ALCOR finances&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/lw/7vv/mike_darwin_on_kurzweil_technooptimisim_and/&quot;&gt;&amp;#x201C;Mike Darwin on Kurzweil, Techno-Optimism, and Delusional Stances on Cryonics&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;/lw/7zp/mike_darwin_on_steve_jobss_hypocritical_stance/&quot;&gt;&amp;#x201C;Mike Darwin on Steve Jobs&amp;#x2019;s hypocritical stance towards death&amp;#x201D;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;See also Tyler Cowen's &lt;em&gt;The Great Stagnation&lt;/em&gt; and &lt;a href=&quot;/lw/7xm/peter_thiel_warns_of_upcoming_and_current/&quot;&gt;&amp;#x201C;Peter Thiel warns of upcoming (and current) stagnation&amp;#x201D;&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/8yp/prediction_is_hard_especially_of_medicine/#comments"&gt;46 comments&lt;/a&gt;
</description>
</item>
<item>
<title>A Crash Course in the Neuroscience of Human Motivation</title>
<link>http://lesswrong.com/lw/71x/a_crash_course_in_the_neuroscience_of_human/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/71x/a_crash_course_in_the_neuroscience_of_human/</guid>
<pubDate>Sat, 20 Aug 2011 07:15:05 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
107 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/71x/a_crash_course_in_the_neuroscience_of_human/#comments"&gt;89 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;small&gt;[&lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/Muehlhauser-A-Crash-Course-in-the-Neuroscience-of-Human-Motivation-08-20-2011.pdf&quot;&gt;PDF&lt;/a&gt; of this article updated Aug. 23, 2011]&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;[&lt;a href=&quot;#preface&quot;&gt;skip to preface&lt;/a&gt;] &lt;/small&gt;&lt;/p&gt;
&lt;p&gt;Whenever I write a new article for Less Wrong, I'm pulled in two opposite directions.&lt;/p&gt;
&lt;p&gt;One force pulls me toward writing short, exciting posts with lots of brain candy and just &lt;em&gt;one main point&lt;/em&gt;. Eliezer has done that kind of thing very well many times: see &lt;a href=&quot;/lw/i3/making_beliefs_pay_rent_in_anticipated_experiences/&quot;&gt;Making Beliefs Pay Rent&lt;/a&gt;, &lt;a href=&quot;/lw/im/hindsight_devalues_science/&quot;&gt;Hindsight Devalues Science&lt;/a&gt;, &lt;a href=&quot;/lw/oj/probability_is_in_the_mind/&quot;&gt;Probability is in the Mind&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/nu/taboo_your_words/&quot;&gt;Taboo Your Words&lt;/a&gt;, &lt;a href=&quot;/lw/oi/mind_projection_fallacy/%20%20%20%20&quot;&gt;Mind Projection Fallacy&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/iq/guessing_the_teachers_password/&quot;&gt;Guessing the Teacher's Password&lt;/a&gt;, &lt;a href=&quot;/lw/ka/hold_off_on_proposing_solutions/&quot;&gt;Hold Off on Proposing Solutions&lt;/a&gt;, &lt;a href=&quot;/lw/jb/applause_lights/&quot;&gt;Applause Lights&lt;/a&gt;, &lt;a href=&quot;/lw/of/dissolving_the_question/&quot;&gt;Dissolving the Question&lt;/a&gt;, and &lt;a href=&quot;/lw/no/how_an_algorithm_feels_from_inside/&quot;&gt;many&lt;/a&gt; &lt;a href=&quot;/lw/he/knowing_about_biases_can_hurt_people/&quot;&gt;more&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Another force pulls me toward writing long, factually dense posts that fill in as many of the pieces of a particular argument in one fell swoop as possible. This is largely because I want to write about the cutting edge of human knowledge but I keep realizing that the inferential gap is &lt;a href=&quot;/lw/kg/expecting_short_inferential_distances&quot;&gt;larger than I had anticipated&lt;/a&gt;, and I want to fill in that inferential gap quickly so I can get to the cutting edge.&lt;/p&gt;
&lt;p&gt;For example, I had to draw on dozens of Eliezer's posts just to &lt;a href=&quot;http://wiki.lesswrong.com/wiki/No-Nonsense_Metaethics&quot;&gt;say&lt;/a&gt; I was &lt;em&gt;heading toward&lt;/em&gt;&amp;#xA0;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/No-Nonsense_Metaethics&quot;&gt;my metaethics sequence&lt;/a&gt;. I've also published 21 new posts (many of them quite long and heavily researched) written specifically because I need to refer to them in my metaethics sequence.&lt;sup&gt;1&lt;/sup&gt;&amp;#xA0;I tried to make these posts interesting and useful on their own, but my primary motivation for writing them was that I need them for my metaethics sequence.&lt;/p&gt;
&lt;p&gt;And now I've written only&amp;#xA0;&lt;em&gt;four posts&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;&amp;#xA0;in my metaethics sequence and already the inferential gap to my next post in that sequence is huge again. :(&lt;/p&gt;
&lt;p&gt;So I'd like to try an experiment. I won't do it often, but I want to try it at least once. Instead of writing 20 more short posts between now and the next post in my metaethics sequence, I'll attempt to fill in a big chunk of the inferential gap to my next metaethics post in one fell swoop by writing a long tutorial post (&lt;em&gt;a la&lt;/em&gt;&amp;#xA0;Eliezer's tutorials on &lt;a href=&quot;http://yudkowsky.net/rational/bayes&quot;&gt;Bayes' Theorem&lt;/a&gt; and &lt;a href=&quot;http://yudkowsky.net/rational/technical&quot;&gt;technical explanation&lt;/a&gt;).&lt;sup&gt;3&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;So if you're not up for a 20-page tutorial on human motivation, this post isn't for you, but I hope you're glad I bothered to write it for the sake of others. If you &lt;em&gt;are&lt;/em&gt;&amp;#xA0;in the mood for a 20-page tutorial on human motivation, please proceed.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/HumanMotivation_title.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Who knows what I want to do? Who knows what anyone wants to do? How can you&amp;#xA0;be sure about something like that? Isn&amp;#x2019;t it all a question of brain chemistry, signals&amp;#xA0;going back and forth, electrical energy in the cortex? How do you know whether&amp;#xA0;something is really what you want to do or just some kind of nerve impulse in the&amp;#xA0;brain. Some minor little activity takes place somewhere in this unimportant place in&amp;#xA0;one of the brain hemispheres and suddenly I want to go to Montana or I don&amp;#x2019;t want&amp;#xA0;to go to Montana.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p align=&quot;right&quot;&gt;- Don DeLillo, &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/White-Noise-Penguin-Classics-Deluxe/dp/0143105981/&quot;&gt;White Noise&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;Preface&quot;&gt;&lt;/a&gt;Preface&lt;/h4&gt;
&lt;p&gt;How do we value things, and choose between options? Philosophers, economists, and psychologists have long tried to answer these questions. But human behavior continues to defy our most subtle models of it, and the algorithms producing our behavior remained hidden in a black box.&lt;/p&gt;
&lt;p&gt;But now, neuroscientists are directly measuring the neurons whose firing rates encode value and produce our choices. We know a lot more about the neuroscience of human motivation than you might think. Now we can peer directly into the black box of human motivation, and begin (dimly) to read our own source code.&lt;/p&gt;
&lt;p&gt;The neuroscience of human motivation has implications for philosophy of mind and action, for &lt;a href=&quot;/lw/3nn/scientific_selfhelp_the_state_of_our_knowledge/&quot;&gt;scientific self-help&lt;/a&gt;, and for &lt;a href=&quot;http://wiki.lesswrong.com/wiki/No-Nonsense_Metaethics&quot;&gt;metaethics&lt;/a&gt; and &lt;a href=&quot;http://en.wikipedia.org/wiki/Friendly_artificial_intelligence&quot;&gt;Friendly AI&lt;/a&gt;. (We &lt;a href=&quot;/lw/5sk/inferring_our_desires/&quot;&gt;don't really know what we want&lt;/a&gt;, and looking directly at the algorithms that produce human wanting might help in solving this mystery.)&lt;/p&gt;
&lt;p&gt;So, I wrote a crash course in the neuroscience of human motivation.&lt;/p&gt;
&lt;p&gt;The purpose of this document is not to &lt;em&gt;argue&lt;/em&gt;&amp;#xA0;for any of the conclusions presented within it. That would require not a long blog post but instead a couple 500-page books &amp;#x2014; say, &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Foundations-Neuroeconomic-Analysis-Paul-Glimcher/dp/0199744254/&quot;&gt;Foundations of Neuroeconomic Analysis&lt;/a&gt;&lt;/em&gt;&amp;#xA0;and &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Handbook-Reward-Decision-Making-Jean-Claude/dp/0123746205/&quot;&gt;Handbook of Reward and Decision Making&lt;/a&gt;&lt;/em&gt;&amp;#xA0;(my two greatest sources for this post).&lt;sup&gt;4&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;Instead, I merely want to summarize the current mainstream scientific picture on the neuroscience of human motivation, explain some of the concepts it uses, and tell a few stories about how our current picture of human motivation developed.&lt;/p&gt;
&lt;p&gt;As you read this, I hope that many questions and objections will come to mind, because it's &lt;em&gt;not&lt;/em&gt;&amp;#xA0;the full story. That's why I went to the trouble of linking to PDFs of almost all my sources (see &lt;a href=&quot;#References&quot;&gt;References&lt;/a&gt;): so you can check the full data and the full arguments yourself if you like.&lt;/p&gt;
&lt;p&gt;This document is long. You may prefer to read it in sections.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;Contents:&lt;/h4&gt;
&lt;ol&gt;
&lt;li&gt;&lt;a href=&quot;#FolkPsychology&quot;&gt;Folk Psychology&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#NeoclassicalEconomics&quot;&gt;Neoclassical Economics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#BehaviorismAnd&quot;&gt;Behaviorism and Reinforcement Learning&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#ReinforcementLearning&quot;&gt;Reinforcement Learning and Decision Theory&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#TheTurn&quot;&gt;The Turn to the Brain&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#HebbianLearning&quot;&gt;Hebbian Learning&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#ExpectedUtility&quot;&gt;Expected Utility in Neurons&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#RealTime&quot;&gt;Real-Time Updates to Expected Utility&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#ArgmaxAnd&quot;&gt;Argmax and Reservation Price&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#RandomUtility&quot;&gt;Random Utility&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#Discounting&quot;&gt;Discounting&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#RelativeAnd&quot;&gt;Relative and Absolute Utility&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#Normalization&quot;&gt;Normalization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#AreActions&quot;&gt;Are Actions Choices?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#ThePrimate&quot;&gt;The Primate Choice Mechanism: A Brief Review&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#MarginalUtility&quot;&gt;Marginal Utility and Reference Dependence&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#ValuationIn&quot;&gt;Valuation in the Brain&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#SummaryAnd&quot;&gt;Summary and Research Directions&lt;/a&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;#Notes&quot;&gt;Notes&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;#References&quot;&gt;References&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;FolkPsychology&quot;&gt;&lt;/a&gt;Folk Psychology&lt;/h4&gt;
&lt;p&gt;There are these things called 'humans' on planet Earth. They undergo metabolism and cell growth. They produce waste. They maintain homeostasis. They reproduce. They move. They communicate. Sometimes they have &lt;a href=&quot;http://i.imgur.com/WLYH1.jpg&quot;&gt;pillow fights&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Some of these human processes are 'automatic', like cell growth and breathing. Other processes are 'intentional' or 'willed', like moving and communicating and having pillow fights. We call these latter processes&amp;#xA0;&lt;em&gt;intentional actions&lt;/em&gt;, or simply &lt;em&gt;actions&lt;/em&gt;. Sometimes we're not sure &lt;a href=&quot;/lw/o0/where_to_draw_the_boundary/&quot;&gt;where to draw the line&lt;/a&gt; between automatic processes and actions, but this should become clearer as we learn more. In the meantime, we ask...&lt;/p&gt;
&lt;p&gt;How can we &lt;a href=&quot;http://yudkowsky.net/rational/technical&quot;&gt;explain&lt;/a&gt;&amp;#xA0;human actions?&lt;/p&gt;
&lt;p&gt;One popular explanation is '&lt;a href=&quot;http://en.wikipedia.org/wiki/Folk_psychology&quot;&gt;folk psychology&lt;/a&gt;.' Folk psychology posits that we humans have beliefs and desires, and that &lt;em&gt;we are motivated to do what we believe will fulfill our desires&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;I desire to eat a cookie. I believe I can fulfill that desire if I walk to the kitchen and put one of the cookies there into my mouth. So I am motivated to walk to the kitchen and put a cookie in my mouth.&lt;/p&gt;
&lt;p&gt;Of course there are complications. For example I have multiple desires. Suppose I desire to eat a cookie and believe there are cookies in the kitchen. But I also desire to remain sitting comfortably in the living room. Can I satisfy &lt;em&gt;both&lt;/em&gt;&amp;#xA0;desires? I also believe that if I nicely ask my friend in the kitchen to bring me a cookie, she will. So I ask her to bring me a cookie and I begin to eat it, without having to leave the comfy living room sofa. We still explain my behavior with constructs like 'beliefs' and 'desires', but we consider more than one of each to do so.&lt;/p&gt;
&lt;p&gt;Most of us use folk psychology every day to successfully predict human behavior. I believe that my friend &lt;em&gt;desires&lt;/em&gt;&amp;#xA0;to do nice things for me on occasion if they're not too much trouble, and I believe that my friend, once I tell her I want a cookie, will&amp;#xA0;&lt;em&gt;believe&lt;/em&gt;&amp;#xA0;she can be nice to me without much trouble if she brings me a cookie from the kitchen. So, I predict that my friend will bring me a cookie when I ask her. So I ask her, and &lt;em&gt;behold&lt;/em&gt;!&amp;#xA0;My prediction was correct. I am happily eating a cookie on the sofa.&lt;/p&gt;
&lt;p&gt;But folk psychology (FP) faces some problems.&lt;sup&gt;5&lt;/sup&gt; Consider its context in history:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The presumed domain of FP used to be much larger than it is now. In primitive cultures, the behavior of most of the elements of nature were understood in intentional terms. The wind could know anger, the moon jealousy, the river generosity&amp;#x2026; These were not metaphors&amp;#x2026; the animistic approach to nature has dominated our history, and it is only in the last two or three thousand years that we have restricted FP&amp;#x2019;s literal application to the domain of the higher animals.&lt;/p&gt;
&lt;p&gt;[Even still,] the FP of the Greeks is essentially the FP we uses today&amp;#x2026; This is a very long period of stagnation and infertility for any theory to display, especially when faced with such an enormous backlog of anomalies and mysteries in its own explanatory domain&amp;#x2026; To use Imre Lakatos&amp;#x2019; terms, FP is a stagnant or degenerating research program, and has been for millennia.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Consider also its prospects for inter-theoretic reduction:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;If we approach homo sapiens from the perspective of natural history and the physical sciences, we can tell a coherent story of its constitution, development, and behavioral capacities which encompasses particle physics, atomic and molecular theory, organic chemistry, evolutionary theory, biology, physiology, and materialistic neuroscience. The story, though still radically incomplete, is already extremely powerful, outperforming FP at many points even in its own domain. And it is deliberately&amp;#x2026; coherent with the rest of our developing world picture. In short, the greatest theoretical synthesis in [history] is currently in our hands&amp;#x2026;&lt;/p&gt;
&lt;p&gt;But FP is no part of this growing synthesis. Its intentional categories stand magnificently alone, without visible prospect of reduction to that larger corpus. A successful reduction cannot be ruled out, in my view, but FP&amp;#x2019;s explanatory impotence and long stagnation inspire little faith that its categories will find themselves neatly reflected in the framework of neuroscience. On the contrary, one is reminded of how alchemy must have looked as elemental chemistry was taking form, how Aristotelean cosmology must have looked as classical mechanics was being articulated, or how the vitalist conception of life must have looked as organic chemistry marched forward.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Finally, consider the problem of &lt;em&gt;habit&lt;/em&gt;. I sit at my computer and want to type my name, 'Luke.' However, I have just used a special program to switch the function of the keys labeled L and P so that they will input the &lt;em&gt;other&lt;/em&gt;&amp;#xA0;character instead (so that I can play a prank on my friend, who will be using my computer shortly). I &lt;em&gt;believe&lt;/em&gt;&amp;#xA0;that typing the key labeled L will input P instead, but nevertheless when I type my name my fingers fall into their familiar habit and I end up typing my name as 'Puke.' My act of typing was intentional, and yet I &lt;em&gt;didn't&lt;/em&gt; do what I believed would fulfill my desire to type my name.&lt;/p&gt;
&lt;p&gt;Folk psychology faces both successes and failures in explaining human action. Hopefully we can do better.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;NeoclassicalEconomics&quot;&gt;&lt;/a&gt;Neoclassical Economics&lt;/h4&gt;
&lt;p&gt;Folk psychology was updated and quantified by &lt;a href=&quot;http://www.econlib.org/library/Enc1/NeoclassicalEconomics.html&quot;&gt;neoclassical economics&lt;/a&gt;. To summarize:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;One [assumption of] neoclassical economics is &quot;rationality,&quot; in which individuals are said to choose alternatives that maximize expected utilities. In particular, the neoclassical view is that individuals rank all possible alternatives according to how much satisfaction they will bring and then choose the alternative that [they expect] will bring the most satisfaction or utility...&lt;sup&gt;6&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Let's review this notion of maximizing expected utility. Suppose I can choose one of two boxes sitting before me, red and blue. There is a 10% chance the red box contains a million dollars, and a 90% chance it contains nothing. As for the blue box, I am certain it contains $10,000. The 'expected value' of choosing the red box is (0.1 &amp;#xD7; $1,000,000) + (0.9 &amp;#xD7; $0), which is equal to $100,000. The expected value of choosing the blue box is !1&amp;#xA0;&amp;#xD7; $10,000), or $10,000. An agent that chose whatever had the highest expected value would choose the red box, which has 10 times the expected value of the blue box ($100,000 vs. $10,000).&lt;/p&gt;
&lt;p&gt;But humans don't value things only according to their dollar value. A million dollars might have 10 times the &lt;em&gt;objective&lt;/em&gt;&amp;#xA0;value&amp;#xA0;of $100,000, but it might have less than 10 times the &lt;em&gt;subjective&lt;/em&gt;&amp;#xA0;value&amp;#xA0;of $100,000 because after $100,000 you only care a little how much &lt;em&gt;more&lt;/em&gt;&amp;#xA0;wealthy you are.&lt;/p&gt;
&lt;p&gt;Or, you might be &lt;a href=&quot;http://en.wikipedia.org/wiki/Risk_aversion&quot;&gt;risk averse&lt;/a&gt;. You might prefer a sure thing to something that is uncertain. So a 10% chance of a million dollars might be worth &lt;em&gt;less&lt;/em&gt; &amp;#x2014; in subjective value &amp;#x2014; than a 100% chance of $10,000. If you are risk averse you might choose the blue box because it has higher expected &lt;em&gt;subjective&lt;/em&gt; value even though it has lower expected &lt;em&gt;objective&lt;/em&gt; value.&lt;/p&gt;
&lt;p&gt;We call objective value simply 'value'. We call subjective value 'utility.'&lt;/p&gt;
&lt;p&gt;Neoclassical economics quantifies folk psychology by measuring the strength of belief with &lt;em&gt;probability&lt;/em&gt; and by measuring the strength of desire with &lt;em&gt;utility&lt;/em&gt;. It then says that humans act so as to maximize &lt;em&gt;expected utility&lt;/em&gt;, a measure that combines the utility of particular thing with your subjective probability of getting it.&lt;sup&gt;7&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;This neoclassical model of human behavior has faced many challenges, and is regularly revised in the face of new evidence.&lt;sup&gt;8&lt;/sup&gt; For example, Loewenstein (1987) found that if students were asked to place a value on the opportunity to kiss a celebrity of their choice 1-5 days in the future, they placed the highest value on a kiss in 3 days. This didn't fit any existing neoclassical models of utility, but was explained in 2001 when Caplin &amp;amp; Leahy (2001) incorporated &quot;anticipatory feeling&quot; into the neoclassical model, explaining that the students got some utility from&amp;#xA0;&lt;em&gt;anticipating&lt;/em&gt;&amp;#xA0;the kiss with the celebrity (but also, as usual, discounted the utility of a reward the further away it was in the future), and this is why they didn't want the kiss right away.&lt;/p&gt;
&lt;p&gt;Keep in mind that economists don't argue that we &lt;em&gt;actually&lt;/em&gt;&amp;#xA0;compute the expected utility of each option before us and then choose the best one, but that we always act &quot;as if&quot; we were doing that.&lt;sup&gt;9&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;But sometimes we don't even act &quot;as if&quot; we are obeying the axioms of neoclassical economics. For example, the &lt;a name=&quot;independence&quot;&gt;&lt;/a&gt;independence axiom of expected utility theory &lt;a href=&quot;/lw/71x/a_crash_course_in_the_neuroscience_of_human/4o7t&quot;&gt;says&lt;/a&gt; that if you prefer an apple over an orange, then you must prefer the Gamble A (72% chance you get an apple, otherwise you get a cat) over the Gamble B (72% chance you get an orange, otherwise you get a cat). But Allais (1953) found that subjects &lt;em&gt;do&lt;/em&gt;&amp;#xA0;violate this basic assumption under some conditions.&lt;/p&gt;
&lt;p&gt;Such violations of the basic axioms of neoclassical economics led to the development of &lt;a href=&quot;http://en.wikipedia.org/wiki/Behavioral_economics&quot;&gt;behavioral economics&lt;/a&gt;&amp;#xA0;and theories like Kahneman and Tversky's (1979) &lt;a href=&quot;/lw/6kf/prospect_theory_a_framework_for_understanding/&quot;&gt;prospect theory&lt;/a&gt;,&lt;sup&gt;10&lt;/sup&gt;&amp;#xA0;which transcends some assumptions of the neoclassical model. But these new theories don't fit the data perfectly, either.&lt;sup&gt;11&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;The models of human motivation we've surveyed so far are conceptually related to &lt;a href=&quot;http://en.wikipedia.org/wiki/Decision_theory&quot;&gt;decision theory&lt;/a&gt;&amp;#xA0;(beliefs and desires, or probabilities and utilities), so I'll call them 'decision-theoretic models' of human motivation. We'll discuss decision-theoretic models again when we finally get to the topic of neuroscience, but for now I want to discuss a different approach to motivation.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;BehaviorismAnd&quot;&gt;&lt;/a&gt;Behaviorism and Reinforcement Learning&lt;/h4&gt;
&lt;p&gt;While neoclassical economists formulated expected utility theory, behaviorist psychologists developed a different set of explanations for human action. Though behaviorists were &lt;a href=&quot;/lw/sr/the_comedy_of_behaviorism/&quot;&gt;wrong&lt;/a&gt;&amp;#xA0;when they said that science can't talk about mental activity or mental states, you &lt;a href=&quot;/lw/6i5/behaviorism_beware_anthropomorphizing_humans/&quot;&gt;can charitably think of&lt;/a&gt; behaviorists as playing a game of &lt;a href=&quot;/lw/nu/taboo_your_words/&quot;&gt;Rationalist's Taboo&lt;/a&gt; with constructs of folk psychology like &quot;want&quot; or &quot;fear&quot; in order to get at phenomena more appropriate for quantification in&amp;#xA0;&lt;a href=&quot;http://yudkowsky.net/rational/technical&quot;&gt;technical explanation&lt;/a&gt;. Also, the behaviorist approach led to 'reinforcement learning', an important concept in the neuroscience of human motivation.&lt;/p&gt;
&lt;p&gt;Before I explain reinforcement learning, let's recall&amp;#xA0;&lt;a href=&quot;/lw/6iu/basics_of_animal_reinforcement/&quot;&gt;operant conditioning&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Stick a pigeon in a box with a lever and some associated machinery (a &quot;Skinner box&quot;). The pigeon wanders around, does various things, and eventually hits the lever. Delicious sugar water squirts out. The pigeon continues wandering about and eventually hits the lever again. Another squirt of delicious sugar water. Eventually it percolates into its tiny pigeon brain that maybe pushing this lever makes sugar water squirt out. It starts pushing the lever more and more, each push continuing to convince it that yes, this is a good idea.&lt;/p&gt;
&lt;p&gt;Consider a second, less lucky pigeon. It, too, wanders about in a box and eventually finds a lever. It pushes the lever and gets an electric shock. Eh, maybe it was a fluke. It pushes the lever again and gets another electric shock. It starts thinking &quot;Maybe I should stop pressing that lever.&quot; The pigeon continues wandering about the box doing anything and everything other than pushing the shock lever.&lt;/p&gt;
&lt;p&gt;The basic concept of operant conditioning is that an animal will repeat behaviors that give it reward, but avoid behaviors that give it punishment.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Behaviorism &lt;a href=&quot;http://commonsenseatheism.com/?p=13607#history&quot;&gt;died&lt;/a&gt; in the wake of cognitive psychology, but its approach to&amp;#xA0;motivation turned out to be very useful in the field of artificial intelligence, where it is called &lt;a href=&quot;http://www.scholarpedia.org/article/Reinforcement_learning&quot;&gt;reinforcement learning&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Reinforcement learning is learning what to do &amp;#x2014; how to map situations to actions &amp;#x2014; so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation and, through that, all subsequent rewards. These two characteristics &amp;#x2014; trial-and-error search and delayed reward &amp;#x2014; are the two most important distinguishing features of reinforcement learning.&lt;/p&gt;
&lt;p&gt;To obtain a lot of reward, a reinforcement learning agent must prefer actions that it has tried in the past and found to&amp;#xA0;be effective in producing reward. But to discover such actions it has to try actions that it has not selected before. The agent has to &lt;em&gt;exploit&lt;/em&gt; what it already knows in order to obtain reward, but it also has to &lt;em&gt;explore&lt;/em&gt; in order to make better action selections in the future. The dilemma is that neither exploitation nor exploration can be pursued exclusively without failing at the task. The agent must try a variety of actions and progressively favor those that appear to be best..&lt;sup&gt;12&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In addition to the agent and its environment, there are four major components of a reinforcement learning system:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;...a &lt;em&gt;policy&lt;/em&gt;, a &lt;em&gt;reward function&lt;/em&gt;, a &lt;em&gt;value function&lt;/em&gt;, and, optionally, a &lt;em&gt;model&lt;/em&gt; of the environment.&lt;/p&gt;
&lt;p&gt;A &lt;em&gt;policy&lt;/em&gt; defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states...&lt;/p&gt;
&lt;p&gt;A &lt;em&gt;reward function&lt;/em&gt; defines the goal in a reinforcement learning problem. Roughly speaking, it maps perceived states (or state-action pairs) of the environment to a single number, a reward, indicating the intrinsic desirability of the state. A reinforcement-learning agent's sole objective is to maximize the total reward it receives in the long run. ...[A reward function may] be used as a basis for changing the policy. For example, if an action selected by the policy is followed by low reward, then the policy may be changed to select some other action in that situation in the future...&lt;/p&gt;
&lt;p&gt;Whereas a reward function indicates what is good in an immediate sense, a &lt;em&gt;value function&lt;/em&gt; specifies what is good in the long run. Roughly speaking, the value of a state is the total amount of reward an agent can expect to accumulate over the future starting from that state. Whereas rewards determine the immediate, intrinsic desirability of environmental states, values indicate the long-term desirability of states after taking into account the states that are likely to follow, and the rewards available in those states. For example, a&amp;#xA0;state might always yield a low immediate reward, but still have a high value because it is regularly followed by other states that yield high rewards. Or the reverse could be true...&lt;/p&gt;
&lt;p&gt;Rewards are in a sense primary, whereas values, as predictions of rewards, are secondary. Without rewards there could be no values, and the only purpose of estimating values is to achieve more reward. Nevertheless, it is values with which we are most concerned when making and evaluating decisions. Action choices are made on the basis of value judgments. We seek actions that bring about states of highest value, not highest reward, because these actions obtain for us the greatest amount of reward over the long run...&lt;/p&gt;
&lt;p&gt;...The fourth and final element of some reinforcement learning systems is a &lt;em&gt;model&lt;/em&gt; of the environment. This is something that mimics the behavior of the environment. For example, given a state and action, the model might predict the resultant next state and next reward. Models are used for planning, by which we mean any way of deciding on a course of action by considering possible future situations before they are actually experienced.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Want an example? Here is how a reinforcement learning agent would learn to play &lt;a href=&quot;http://en.wikipedia.org/wiki/Tic-tac-toe&quot;&gt;Tic-Tac-Toe&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;First we set up a table of numbers, one for each possible state of the game. Each number will be the latest estimate of the probability of our winning from that state. We treat this estimate as the state's current value, and the whole table is the learned value function. State A has higher value than state B, or is considered 'better' than state B, if the current estimate of the probability of our winning from A is higher than it is from B. Assuming we always play Xs, then for all states with three Xs in a row the probability of winning is 1, because we have already won. Similarly, for all states with three &amp;#xD8;s in a row... the correct probability is 0, as we cannot win from them. We set the initial values of all the other states, the &lt;em&gt;nonterminals&lt;/em&gt;, to 0.5, representing an informed guess that we have a 50% chance of winning.&lt;/p&gt;
&lt;p&gt;Now we play many games against the opponent. To select our moves we examine the states that would result from each of our possible moves (one for each blank space on the board) and look up their current values in the table. Most of the time we move &lt;em&gt;greedily&lt;/em&gt;, selecting the move that leads to the state with greatest value, that is, with the highest estimated probability of winning. Occasionally, however, we select randomly from one of the other moves instead; these are called &lt;em&gt;exploratory&lt;/em&gt;&amp;#xA0;moves because they cause us to experience states that we might otherwise never see.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;A sequence of Tic-Tac-Toe moves might look like this:&lt;sup&gt;13&lt;/sup&gt;&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/tic-tac-toe.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;Solid lines are the moves our reinforcement learning agent made, and dotted lines are moves it considered but did &lt;em&gt;not&lt;/em&gt;&amp;#xA0;make.&amp;#xA0;The second move was an exploratory move: it was taken even though another sibling move, that leading to e*, was ranked higher.&lt;/p&gt;
&lt;p&gt;While playing, the agent changes the values assigned to the states it finds itself in. To improve its estimates concerning the probability of winning from various states, it 'backs up' the value of state after each greedy move to the state &lt;em&gt;before&lt;/em&gt;&amp;#xA0;the move (as suggested by the arrows.) What this means is that the value of the earlier state is adjusted to be closer to the value of the later state.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;If we let s denote the state before the greedy move, and s' the state after, then the update to the estimated value of s, denoted V(s), can be written:&lt;/p&gt;
&lt;p&gt;V(s) &amp;lt;- V(s) + &amp;#x3B1;[V(s') - V(s)]&lt;/p&gt;
&lt;p&gt;where &amp;#x3B1; is a small positive fraction called the &lt;em&gt;step-size parameter&lt;/em&gt;, which influences the rate of learning. The update rule is an example of a &lt;em&gt;temporal difference&lt;/em&gt;&amp;#xA0;learning method, so called because its changes are based on a difference... between [value] estimates at two different times.&lt;/p&gt;
&lt;p&gt;...if the step-size parameter is reduced properly over time, this method converges, for any [unchanging] opponent, to the true probabilities of winning from each state give optimal play by the agent.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And that's how a simple version of temporal difference (TD) reinforcement learning works.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;ReinforcementLearning&quot;&gt;&lt;/a&gt;Reinforcement Learning and Decision Theory&lt;/h4&gt;
&lt;p&gt;You may have noticed a key advantage of reinforcement learning: an agent using it can be 'dumber' than a decision-theoretic agent. It can just start with guesses (&quot;What the hell; let's try 50%!&quot;) for the value of various states, and then it learns their &lt;em&gt;true&lt;/em&gt; values by running through &lt;em&gt;many, many&amp;#xA0;trials&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;But what if you don't &lt;em&gt;have&lt;/em&gt;&amp;#xA0;many trials to run through, and you need to make an important decision right now?&lt;/p&gt;
&lt;p&gt;Then you have to be smart. You need to have a good model of the world and use decision theory to choose the action with the highest expected utility.&lt;/p&gt;
&lt;p&gt;&lt;span&gt;This is precisely what rationality&amp;#xA0;&lt;/span&gt;&amp;#x2014;&lt;span&gt;&amp;#xA0;being &lt;em&gt;good&lt;/em&gt;&amp;#xA0;at building correct models of the world&amp;#xA0;&lt;/span&gt;&amp;#x2014;&lt;span&gt;&amp;#xA0;&lt;a href=&quot;/lw/34a/goals_for_which_less_wrong_does_and_doesnt_help/&quot;&gt;is especially good for&lt;/a&gt;:&lt;/span&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;For some tasks, the world provides rich, inexpensive empirical feedback. In these tasks you hardly need reasoning. &amp;#xA0;Just try the task many ways... and take care to notice what is and isn&amp;#x2019;t giving you results.&lt;/p&gt;
&lt;p&gt;Thus, if you want to learn to sculpt, [studying rationality] is a bad way to go about it. Better to find some clay and a hands-on sculpting course. The situation is similar for small talk, cooking, selling, programming, and many other useful skills.&lt;/p&gt;
&lt;p&gt;Unfortunately, most of us also have goals for which we can obtain no such ready success/failure data. For example, if you want to know whether cryonics is a good buy, you can&amp;#x2019;t just try buying it and not-buying it and see which works better. &amp;#xA0;If you miss your first bet, you&amp;#x2019;re out for good.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Reinforcement learning can be a good strategy if you have time to learn from many trials. If you've only got &lt;em&gt;one shot&lt;/em&gt; at a problem, you'd better build up a really accurate model of the world first and then try to maximize expected utility.&lt;/p&gt;
&lt;p&gt;Now, back to our story.&lt;/p&gt;
&lt;p&gt;It turns out that reinforcement learning seems to underlie many of our mental processes. (More on this later.)&lt;/p&gt;
&lt;p&gt;The lesson Yvain &lt;a href=&quot;/lw/6i5/behaviorism_beware_anthropomorphizing_humans/&quot;&gt;drew&lt;/a&gt;&amp;#xA0;from this discovery was:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Reinforcement learning is &lt;a href=&quot;/lw/l2/protein_reinforement_and_dna_onsequentialism/&quot;&gt;evolution writ small&lt;/a&gt;; behaviors propagate or die out based on their consequences to reinforcement in a mind, just as mutations propagate or die out based on their consequences to reproduction in an organism. In the behaviorist model, our mind is not an agent, but a flourishing ecosystem of behaviors both physical and mental, all scrabbling for supremacy and mutating into more effective versions of themselves.&lt;/p&gt;
&lt;p&gt;Just as evolving organisms are &lt;a href=&quot;/lw/l0/adaptationexecuters_not_fitnessmaximizers/&quot;&gt;adaptation-executors and not fitness-maximizers&lt;/a&gt;, so minds are behavior-executors and not utility-maximizers.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;But things are a bit more complicated than that, as we'll now see.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;TheTurn&quot;&gt;&lt;/a&gt;The Turn to the Brain&lt;/h4&gt;
&lt;blockquote&gt;
&lt;p&gt;I hesitate to say that men will ever have the means&amp;#xA0;of measuring directly the feelings of the human&amp;#xA0;heart. It is from the quantitative effects of the&amp;#xA0;feelings that we must estimate their comparative&amp;#xA0;amounts.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p align=&quot;right&quot;&gt;William Jevons (1871)&lt;/p&gt;
&lt;p&gt;It turns out that Jevons was wrong. Modern neuroscience allows us to peer into the black box of the human value system and measure directly &quot;the feelings of the human heart.&quot;&lt;sup&gt;14&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;We'll begin with the experiments of &lt;a href=&quot;http://www.pdn.cam.ac.uk/staff/schultz/&quot;&gt;Wolfram Shultz&lt;/a&gt;. Schultz recorded the activity of single dopamine neurons in monkeys who sat in front of a water spout. At irregular intervals, a speaker played a tone and a drop of water dropped from the spout.&lt;sup&gt;15&lt;/sup&gt;&amp;#xA0;The monkeys' dopamine neurons normally fired at the baseline rate, but responded with a burst of activity when water was delivered. Over time, though, the neurons responded less and less to the water and more and more to the tone.&lt;/p&gt;
&lt;p&gt;But if Schultz delivered water without first giving the tone, then the dopamine neurons responded with a burst of activity again. And if he played the tone and &lt;em&gt;didn't&lt;/em&gt;&amp;#xA0;provide water, the neurons reduced their firing rates &lt;em&gt;below&lt;/em&gt;&amp;#xA0;the baseline. The neurons weren't responding to the water itself but to a difference between expected reward and actual reward &amp;#x2014; a reward prediction error (RPE).&lt;/p&gt;
&lt;p&gt;Two other researchers, &lt;a href=&quot;http://www.hnl.bcm.tmc.edu/faculty.html&quot;&gt;Read Montague&lt;/a&gt; and &lt;a href=&quot;http://www.gatsby.ucl.ac.uk/~dayan/&quot;&gt;Peter Dayan&lt;/a&gt;, noticed that these patterns of neuronal activity were exactly predicted by TD reinforcement learning theory from computer science.&lt;sup&gt;16&lt;/sup&gt;&amp;#xA0;In particular, the RPE observed in neurons appeared to play the same role in monkey learning as the difference between value estimates at two different times did in TD reinforcement learning theory.&lt;/p&gt;
&lt;p&gt;Since then, researchers have done many more single-neuron recording studies to test particular versions of TD reinforcement learning and revise the theory until it predicts more and more behavior while also predicting novel experimental discoveries.&lt;/p&gt;
&lt;p&gt;Caplin &amp;amp; Dean&lt;sup&gt;17&lt;/sup&gt; provided another way to test the hypothesis that dopamine neurons encoded RPE in a TD-class model. They showed that all existing RPE-models could be reduced to three axiomatic statements. If a system violated one of these axioms, it could not be an RPE system. Later, Caplin et al. (2010) tested the axioms on actual brain activity to see if they held up. They did. This is another reason why so many scientists working in this field believe the current 'dopamine hypothesis' &amp;#x2014; that dopamine neurons encode RPE in a TD-class reinforcement learning system in the brain.&lt;/p&gt;
&lt;p&gt;TD-class reinforcement learning works in computers by updating numbers that represent the values of states. How does reinforcement learning work when using nerve cells?&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;HebbianLearning&quot;&gt;&lt;/a&gt;Hebbian Learning&lt;/h4&gt;
&lt;p&gt;By &lt;a href=&quot;http://en.wikipedia.org/wiki/Hebbian_theory&quot;&gt;Hebbian learning&lt;/a&gt;, of course. &quot;Cells that fire together, wire together.&quot;&lt;/p&gt;
&lt;p&gt;Imagine a neural pathway (in one of Pavlov's dogs) that connects the neural circuits that sense the ringing of a bell to the neural circuits for salivation. This is a weak connection at first, which is why the bell doesn't initially elicit salivation.&lt;/p&gt;
&lt;p&gt;Also imagine a third neuron that connects the salivation circuit to a circuit that detects food. This is a strong connection, and that's why food &lt;em&gt;does&lt;/em&gt;&amp;#xA0;elicit salivation right away:&lt;sup&gt;18&lt;/sup&gt;&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/Hebbian-circuits.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;Donald Hebb proposed:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;When an axon of cell A is near enough to excite cell B and repeatedly or persistently take part in firing it, a growth process of metabolic change takes place in one or both cells such that A's efficacy, as one of the cells firing B, is increased.&lt;sup&gt;19&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In short, whenever two connected cells are active at the same time, the synapses connecting them are strengthened.&lt;/p&gt;
&lt;p&gt;Consider Pavlov's experiment. At first, the Bell cell will fire whenever bells ring, but probably not when the salivation cells happen to be active. So, the connection between the Bell cell and the Salivation cell remains weak. But then, Pavlov intervenes and causes the Bell cell and the Salivation cell to fire at the same time by ringing the bell and presenting food at the same time (the Food detector cell already has a strong connection to the Salivation cell). Whenever the Bell cell and the Salivation cell happen to fire at the same time, the synapse between them is strengthened. Once the connection is strong enough, the Bell cell can cause the Salivation cell to fire on its own, just like the Food detector cell can.&lt;/p&gt;
&lt;p&gt;It was a fine theory, but it wasn't observed until Bliss &amp;amp; Lomo (1973) observed Hebb's mechanism at work in the rabbit hippocampus. Today, we know how some forms of Hebb's mechanism work at the molecular level.&lt;sup&gt;20&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;Later, Wickens (1993) proposed a similar mechanism called the &lt;em&gt;three-factor rule&lt;/em&gt;, according to which some synapses are strengthened whenever presynaptic and postsynaptic activity occurred &lt;em&gt;in the presence of dopamine&lt;/em&gt;. These same synapses might be weakened when activity occurred in the &lt;em&gt;absence&lt;/em&gt;&amp;#xA0;of dopamine. Later studies confirmed this hypothesis.&lt;sup&gt;21&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;Suppose a monkey receives an unexpected reward and encodes a large positive RPE. Glimcher explains:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The TD model tells us that under these conditions we want to increment the value attributed to all actions or sensations that have just occurred. Under these conditions, we know that the dopamine neurons release dopamine throughout the frontocortical-basal ganglia loops, and do so in a highly homogenous manner. That means we can think of any neuron equipped with dopamine receptors as 'primed' for synaptic strengthening. When this happens, any segment of the frontocortical-basal ganglia loop that is already active will have its synapses strengthened.&lt;sup&gt;22&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;We will return to the dopamine system later, but for now let us back up and pursue the neoclassical economic path into the brain.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;ExpectedUtility&quot;&gt;&lt;/a&gt;Expected Utility in Neurons&lt;/h4&gt;
&lt;p&gt;Ever since Friedman (1953), economists have insisted that humans only behave &lt;em&gt;as if&lt;/em&gt;&amp;#xA0;they are utility maximizers, not that they &lt;em&gt;actually&lt;/em&gt;&amp;#xA0;compute expected utility and try to maximize it.&lt;/p&gt;
&lt;p&gt;It was a surprise, then, when neuroscientists stumbled upon the neurons that were encoding expected utility in their firing rates.&lt;/p&gt;
&lt;p&gt;Tanji &amp;amp; Evarts (1976) did their experiments with rhesus monkeys because they are our closest relative besides the apes, and this kind of work is usually forbidden on apes for ethical reasons (we need to implant a recording electrode in the brain).&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/rhesus-monkey.jpg&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;The monkeys were trained to know that a colored light on the screen meant they would soon be offered a reward (a drop of water) either for pushing or pulling, but not for both. This was the &amp;#x2018;ready&amp;#x2019; cue. A second later, researchers gave a &amp;#x2018;direction&amp;#x2019; cue that told the monkeys which action &amp;#x2014; push or pull &amp;#x2014; was going to be rewarded. The third cue was the 'go' signal: if the monkey made the previously indicated movement, it was rewarded.&lt;/p&gt;
&lt;p&gt;This is what they saw:&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/push-pull-smaller.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;At the &amp;#x2018;ready&amp;#x2019; cue, the neurons associated with a pushing motion became weakly active (but fired above the baseline rate), and so did the neurons associated with a pulling motion. When the &amp;#x2018;direction&amp;#x2019; cue was given, the neurons associated with the to-be-rewarded motion doubled their firing rate, and the neurons associated with the opposite motion fell back to the baseline rate. Then at the &amp;#x2018;go&amp;#x2019; cue, the neurons associated with the to-be-rewarded movement increased again rapidly, up past the threshhold required to produce movement, and the movement was produced shortly thereafter.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;One tempting explanation of the data is that after the &amp;#x2018;ready&amp;#x2019; cue, the monkey&amp;#x2019;s brain 'decides' there&amp;#x2019;s a 50% chance that pulling will get the reward, and a 50% chance that pushing will get the reward. That&amp;#x2019;s why we see the neuron firing rates associated with those two actions each jump to slightly less than 50% of the movement threshold when the &amp;#x2018;ready&amp;#x2019; cue is given. But then, when the &amp;#x2018;direction&amp;#x2019; cue is given, those expectations shift to 100%/0% or 0%/100%, depending on which action is about to be rewarded according to the &amp;#x2018;direction&amp;#x2019; cue. That&amp;#x2019;s why activity in the circuit associated with the to-be-rewarded action doubles and the other one drops to baseline. And then the &amp;#x2018;go&amp;#x2019; cue is delivered and firing rates blast past the movement threshold, and movement is produced.&lt;/p&gt;
&lt;p&gt;Let's jump ahead to Basso &amp;amp; Wurtz (1997), who did a similar experiment except that they used voluntary eye movements (called &amp;#x2018;saccades&amp;#x2019;) instead of voluntary arm movements. And this time, they presented each monkey with one, two, four, or eight possible targets, instead of just two targets (push and pull) like Tanji &amp;amp; Evarts did.&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/monkey-saccade.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;What they found was that as more potential targets were presented, the magnitude of the preparatory activity associated with each target systematically decreased. And again, once the &amp;#x2018;direction&amp;#x2019; and &amp;#x2018;go&amp;#x2019; cues were presented, the activity associated with those other potential targets dropped rapidly and activity burst rapidly in neurons associated with the to-be-rewarded movement. It was as though the monkeys&amp;#x2019; brains were distributing their probability mass evenly across the potentially rewarded actions, and then once they knew which action should in fact be rewarded, they moved all their probability mass to that action and performed the action and got the reward.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;RealTime&quot;&gt;&lt;/a&gt;Real-Time Expected Utility Updates&lt;/h4&gt;
&lt;p&gt;Other researchers showed monkeys a black screen with flickering white dots on it. In each frame of the video, the computer moved each dot in a random direction. The independent variable was a measure called 'coherence.' In a 100% leftward coherence condition, all dots moved to the left. In a 60% rightward condition, 60% of the dots move rightward while the rest moved randomly. And so on.&lt;/p&gt;
&lt;p&gt;In a typical experiment, the researchers would identify a neuron in a monkey's brain that increased its firing rate in response to rightward coherence of the dots, and decreased its firing rate in response to leftward coherent of the dots. Then they would present the monkey with a sequence (in random order) of every possible leftward and rightward coherence condition.&lt;/p&gt;
&lt;p&gt;A leftward coherence (of any magnitude) meant the monkey would be rewarded for leftward eye movement, and a rightward coherence meant the monkey would be rewarded for rightward eye movement. But, the monkey had to wait two seconds before being rewarded.&lt;/p&gt;
&lt;p&gt;In this experiment, the probabilities always started at 50% but then updated continuously. A 100% rightward coherence condition allowed the monkey to very quickly know which voluntary eye movement would be rewarded, but in a 5% rightward coherence condition the expected utility of the rightward target grew more slowly.&lt;/p&gt;
&lt;p&gt;The results? The greater the coherence of rightward motion of the dots, the faster the neurons associated with rightward eye movement increased their firing rate. (A higher coherence meant the monkey was able to update its probabilities more quickly.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;ArgmaxAnd&quot;&gt;&lt;/a&gt;Argmax and Reservation Price&lt;/h4&gt;
&lt;p&gt;Many studies show that the brain controls movement by way of a 'winner take all' mechanism that is isomorphic to the &lt;a href=&quot;http://en.wikipedia.org/wiki/Arg_max&quot;&gt;argmax&lt;/a&gt; operation from economics.&lt;sup&gt;23&lt;/sup&gt; That is, there are many possibilities competing for your final choice, but just before your choice the single strongest signal remains after all the others are inhibited.&lt;/p&gt;
&lt;p&gt;This &lt;em&gt;choice mechanism&lt;/em&gt;&amp;#xA0;was investigated in more detail by &lt;a href=&quot;http://depts.washington.edu/pbiopage/people_fac_page.php?fac_ID=28&quot;&gt;Michael Shadlen&lt;/a&gt; and others.&lt;sup&gt;24&lt;/sup&gt;&amp;#xA0;Shadlen gave monkeys the same eye movement task as above, except that the monkeys could make their choice at any time instead of waiting for two seconds. He found that:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;When the direction of the dots is unambiguous, monkeys make their choices quickly.&lt;/li&gt;
&lt;li&gt;As the direction of the dots becomes more ambiguous, they take longer to make their choices.&lt;/li&gt;
&lt;li&gt;Throughout the experiment, the firing rates of neurons in the LIP (part of the 'final common path' for generating eye movement) grew toward a specific threshold level.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The threshold level acts as a kind of criterion of choice. Once the criterion is met, action is taken. Or in economic terms, the monkeys seemed to set a &lt;em&gt;reservation price&lt;/em&gt; on making certain movements.&lt;sup&gt;25&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;RandomUtility&quot;&gt;&lt;/a&gt;Random Utility&lt;/h4&gt;
&lt;p&gt;When deciding between goods of different expected utilities, humans exhibit a &lt;em&gt;stochastic transfer function&lt;/em&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Consider a human subject choosing between two objects of highly different expected utilities, such as a first lottery with a 50% chance of winning $5 and a second lottery with a 25% chance of winning $5. We observe highly deterministic behavior under these conditions: basically all subjects always choose the 5o% chance of winning $5. But what happens when we increment the value of the 25% lottery? As the amount one stands to win from that lottery is incremented, individual subjects eventually switch their preference. Exactly when they make that switch depends on their idiosyncratic degree of risk aversion. What is most interesting about this behavior for these purposes, though, is that actual human subjects, when presented with this kind of choice repeatedly, are never completely deterministic. As the value of the 25% lottery increases, they begin to show probabilistic behavior &amp;#x2014; selecting the 25% lottery sometimes, but not always.&lt;sup&gt;26&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Our behavior has an element of randomness in it. &lt;a href=&quot;http://elsa.berkeley.edu/~mcfadden/&quot;&gt;Daniel McFadden&lt;/a&gt; won a Nobel Prize in economics for capturing such behavior using a &lt;em&gt;random utility model&lt;/em&gt;.&lt;sup&gt;27&lt;/sup&gt; The way he did it is to suppose that when a chooser asks himself what a thing is worth, he doesn't get a fixed answer but a variable one. That is, there is actual variation &lt;em&gt;in his preferences&lt;/em&gt;. Thus, his expected utility for a particular lottery is drawn from a &lt;em&gt;distribution&lt;/em&gt; of possible utilities, usually one with a Gaussian variance.&lt;sup&gt;28&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;This behavior makes sense when we think about the human choice mechanism at the neuronal level, because neuron firing rates are&amp;#xA0;stochastic.&lt;sup&gt;29&lt;/sup&gt; When a neurobiologist says &quot;The neuron was firing at 200 Hz,&quot; what she means is that the mean firing rate of the neuron over a long time and stable conditions would have been close to 200 Hz. So the neurons that encode utility (wherever they are) will exhibit stochasticity, and thereby introduce some randomness into our choices. In this way, neurobiological data &lt;em&gt;constrains&lt;/em&gt;&amp;#xA0;our economic models of human behavior. An economic model without some randomness in it will have difficulty capturing human choices for as long as humans run on neurons.&lt;sup&gt;30&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;Discounting&quot;&gt;&lt;/a&gt;Discounting&lt;/h4&gt;
&lt;p&gt;Louie &amp;amp; Glimcher (2010) examined temporal discounting in the brain. The two monkeys in this study were repeatedly asked to choose between a small, immediately available reward and a larger reward available after a small delay. For example, on one day they were asked to choose between 0.13 millileters of juice right now, or else 0.2 millileters of juice available after a delay of 2, 4, 8, or 12 seconds. A monkey might be willing to wait 2, 4, or 8 seconds for the larger reward, but not 12 seconds.&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/monkey-discounting-smaller.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;After many, many measurements of this kind, Louie and Glimcher were able to describe the discounting function being used by each monkey. (One of them was more impatient than the other.)&lt;/p&gt;
&lt;p&gt;Moreover, the neurons in the relevant section of the brain fired at rates that reflected each monkey&amp;#x2019;s discounting function. If 0.2 millileters of juice was offered with no delay, the neurons were highly active. If the same reward was offered at a delay of 2 seconds, they were slightly less active. If the same reward was offered after 4 seconds, the neurons were less active still. And so on. As it turned out, the discounting function that captured their choices was identical to the discounting function that captured the firing rates of these neurons.&lt;/p&gt;
&lt;p&gt;This shouldn't be a surprise at this point, but just to confirm: Yes, we can observe discounting in the firing rates of neurons involved in the choice-making process.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;RelativeAnd&quot;&gt;&lt;/a&gt;Relative and Absolute Utility&lt;/h4&gt;
&lt;p&gt;Dorris &amp;amp; Glimcher (2004) observed monkeys and their choice mechanism neurons while the monkeys engaged in repeated plays of the &lt;a href=&quot;http://www.maths.lse.ac.uk/Personal/stengel/TEXTE/insp.pdf&quot;&gt;inspection game&lt;/a&gt;. The study is too involved for me to explain here, but the results suggested that choice mechanism neurons encode &lt;em&gt;relative&lt;/em&gt;&amp;#xA0;expected utilities (relative to other actions under consideration) rather than &lt;em&gt;absolute&lt;/em&gt;&amp;#xA0;expected utilities.&lt;/p&gt;
&lt;p&gt;Tobler et al. (2005) suggested that the brain &lt;em&gt;only&lt;/em&gt;&amp;#xA0;encodes relative expected utilities. But there is reason to suspect this can't be right. If we stored only relative expected utilities, then we would routinely violate the axiom of transitivity (if you prefer A to B and B to C, you can't also prefer C to A). To see why this is the case, consider Glimcher's example (he says 'expected value' instead of 'utility'):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;...consider a subject trained to choose between objects A and B, where A is $1,000,000 worth of goods and B is $100,000 worth of goods... A system that represented only the relative expected subjective value of A and B would represent SV(A) &amp;gt; SV(B). Next, consider training the same subject to choose between C and D, where C is $1,000 worth of goods and D is $100 worth of goods. Such a system would represent SV(C) &amp;gt; SV(D). What happens when we ask a chooser to select between B and C? For a chooser who represents only relative expected subjective value, the choice should be C: she should pick $1,000 worth of goods over $100,000 worth of goods because it has a higher learned relative expected subjective value. In order for our chooser to... construct transitive preferences across choice sets (and to obey the continuity axiom)... it is required that somewhere in the brain she represent the absolute subjective values of her choices.&lt;sup&gt;31&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And we mostly&amp;#xA0;&lt;em&gt;do&lt;/em&gt;&amp;#xA0;seem to obey the axiom of transitivity.&lt;/p&gt;
&lt;p&gt;So if the choice mechanism neurons &lt;em&gt;do&lt;/em&gt;&amp;#xA0;represent relative utilities, then some other neurons elsewhere must encode a more absolute form of utility. Other implications of this are explored in the next section.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;Normalization&quot;&gt;&lt;/a&gt;Normalization&lt;/h4&gt;
&lt;p&gt;&lt;a href=&quot;http://www.cns.nyu.edu/~david/&quot;&gt;David Heeger&lt;/a&gt; showed&lt;sup&gt;32&lt;/sup&gt; that the firing rates of 'feature detector' neurons in the visual cortex captured a response to a feature in the visual field divided by the sum of the activation rates of nearby neurons sensitive to the same image. Thus, these neurons encode not only whether they 'see' the feature they are built to detect, but also how unique it is in the visual field.&lt;/p&gt;
&lt;p&gt;The effect of this is that neurons reacting to the &lt;em&gt;edge&lt;/em&gt;&amp;#xA0;of a visual object fire more actively than others do. Behold! Edge detection!&lt;/p&gt;
&lt;p&gt;It's also an efficient way to encode information about the world. Consider a world where orange dots are ubiquitous. For an animal in that world, it would be wasteful to fire action potentials to represent orange dots. Better to represent the &lt;em&gt;absence&lt;/em&gt;&amp;#xA0;of orange dots, or the transition from orange dots to something else. An optimally efficient encoding method would be sensitive not to the 'alphabet' of &lt;em&gt;all possible inputs&lt;/em&gt;, but to a smaller alphabet of the inputs that &lt;em&gt;actually&lt;/em&gt;&amp;#xA0;appear in the world. This insight was mathematically formalized by Schwartz &amp;amp; Simoncelli (2001).&lt;/p&gt;
&lt;p&gt;The efficiency of this&amp;#xA0;&lt;em&gt;normalization&lt;/em&gt;&amp;#xA0;technique may explain why we've discovered it at work in so many different places in the brain.&lt;sup&gt;33&lt;/sup&gt; And given that we've found it almost everywhere we've looked for it, it wouldn't be a surprise to see it show up in our choice-making circuits. Indeed, Simoncelli &amp;amp; Schwartz's normalization equation may be what our brains use to encode expected utilities that are relative to the other choices under consideration.&lt;/p&gt;
&lt;p&gt;One implication of their equation is that a chooser's errors become more frequent as the size of the choice set grows. Thus, behavioral errors on small choice sets should be rarer than might be predicted by most random utility models, but error rates will increase rapidly with choice set size (and beyond a certain choice set size, choices will appear random).&lt;/p&gt;
&lt;p&gt;Preliminary evidence that choice set size effects error rates has arrived from behavioral economics. For example, consider Iyengar &amp;amp; Lepper's (2000) study of supermarket shoppers. They set up a table showing either 6 or 24 flavors of jams, allowing shoppers to sample as many as they wanted. Customers who saw 24 flavors had a 3% chance of buying a jar, while those who saw only 6 flavors had a 30% chance!&lt;/p&gt;
&lt;p&gt;In another experiment, Iyengar &amp;amp; Lepper let subjects choose one of either 6 or 30 different chocolates. Those who chose from among only 6 options were more satisfied with their selection than those who had been presented with 30 different chocolates.&lt;/p&gt;
&lt;p&gt;These data fit our expectation that as the choice set grows, the frequency of errors in our behavior rises and the likelihood that an option will rise above the threshold for purchase drops. When Louie &amp;amp; Glimcher (2010) investigated this phenomena in monkey choice mechanism neurons, they found it at work there, too. But the process of choice-set editing is still poorly understood, and some recent studies have failed to replicate Iyengar &amp;amp; Lepper's results (Scheibehenne et al. 2010).&lt;/p&gt;
&lt;p&gt;Perhaps the most surprising implication of these findings is that because of neuronal stochasticity, and because errors increase as the choice set grows, we should expect &lt;em&gt;stochastic violations of the &lt;a href=&quot;#independence&quot;&gt;independence axiom&lt;/a&gt;&lt;/em&gt;, and that &lt;em&gt;when choosers face very large choice sets they will essentially ignore the independence axiom&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;This is a prediction about human behavior not made by earlier models from neoclassical economics, but it is suggested by looking at the neurons involved in human choice-making.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;AreActions&quot;&gt;&lt;/a&gt;Are Actions Choices?&lt;/h4&gt;
&lt;p&gt;But all these data come from experiments where the choices are &lt;em&gt;actions&lt;/em&gt;, and from our knowledge of the brain's &quot;final common path&quot; for producing actions. How do actions map on to choices about lovers and smartphones?&lt;/p&gt;
&lt;p&gt;Studies by &lt;a href=&quot;http://faculty.washington.edu/ghorwitz/wordpress/&quot;&gt;Greg Horowitz&lt;/a&gt; have provided some relevant data, because monkeys had to choose options identified by color rather than by action.&lt;sup&gt;34&lt;/sup&gt;&amp;#xA0;For example in one trial, a 'red' option might offer one reward and a 'green' option might offer a different reward. On each trial, the red and green options would appear at random places on the computer screen, and the monkey could choose a reward with a voluntary eye movement. The key here is that rewards were chosen by color and not by a (particular) action.&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/monkey-decide-red-or-green.png&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;Horowitz found that the choice mechanism neurons showed the same pattern of activation under these conditions as was the case under action-based choice tasks.&lt;/p&gt;
&lt;p&gt;So, it looks like the valuation circuits can store the value of a colored target, and these valuations can be mapped to the choice mechanism. But we don't know much about how this works, yet.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;ThePrimate&quot;&gt;&lt;/a&gt;The Primate Choice Mechanism: A Brief Review&lt;/h4&gt;
&lt;p&gt;Thus far, we have mostly discussed the primate brain's choice mechanism. To review:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The choice circuit resides in the final common pathway for action.&lt;/li&gt;
&lt;li&gt;It takes as its input a signal that encodes stochastic expected utility, a concept aligned to the random utility term in economic models proposed by McFadden (2005) and Gul &amp;amp; Pesendorfer (2006).&lt;/li&gt;
&lt;li&gt;This input signal is represented by a &lt;em&gt;normalized&lt;/em&gt; firing rate (with Poisson variance, like all neurons).&lt;/li&gt;
&lt;li&gt;As the choice set size grows, so does the error rate.&lt;/li&gt;
&lt;li&gt;Final choice is implemented by an argmax function or a reservation price mechanism. (A single circuit can achieve both modes.&lt;sup&gt;35&lt;/sup&gt;)&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;But how are probability and utility calculated such that they can be fed into the expected utility representations of the choice mechanism? I won't discuss how the brain forms probabilistic beliefs in this article,&lt;sup&gt;36&lt;/sup&gt; so let us turn to the study of how utility is calculated in the brain: the question of &lt;em&gt;valuation&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;MarginalUtility&quot;&gt;&lt;/a&gt;Marginal Utility and Reference Dependence&lt;/h4&gt;
&lt;p&gt;Consider the following story:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Imagine an animal exploring a novel environment from a nest on a day when both (1) its blood concentration is dilute (and thus its need for water is low) and (2) its blood sugar level is low (and thus its need for food is high). The animal travels west one kilometer from the nest and emerges from the undergrowth into an open clearing at the shores of a large lake. Not very thirsty,&amp;#xA0;the animal bends down to sample the water and finds it... unpalatable... the next day the same animal leaves its nest in the same metabolic state and travels one kilometer to the east, where it discovers a grove of trees that yield a dry but nutritious fruit, a grove of dried apricot trees. It samples the fruit and finds it sweet and highly palatable.&lt;/p&gt;
&lt;p&gt;What has the animal actually learned about the value of going west and the value of going east? It has had a weakly negative experience, in the psychological sense, when going west and a very positive experience when going east. Do these subjective properties of its experience influence what it has learned? Do the stored representations derived from these experiences encode the actual objective values of going west and east, or do they encode the subjective experiences? That is a critical question about what the animal has learned, because it determines what it does when it wakes up thirsty. When it wakes up thirsty it should, in a normative sense, go west towards the... lake, despite the fact that its previous visit west was a negative experience.&lt;sup&gt;37&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Economists have known this problem for a long time, and solved it with an idea called&amp;#xA0;&lt;em&gt;marginal utility&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;In neoclassical economics, we view the animal as having two kinds of 'wealth': a sugar wealth and a water wealth (the total store of sugar and water in the animal's body at a given time). A piece of fruit or a sip of water is an &lt;em&gt;increment&lt;/em&gt;&amp;#xA0;in the animal's total sugar or water wealth. The utility of a piece of fruit or a sip of water, then, depends on its current levels of sugar and water wealth.&lt;/p&gt;
&lt;p&gt;On day one, the animal's need for water is low and its need for sugar is high. On that day, the marginal utility of a piece of fruit is greater than the marginal utility of a sip of water. But suppose during the next week the animal has a high blood sugar level. At that time, the marginal utility of a piece of fruit is low. Thus, the marginal utility of a consumable resource depends on wealth. The wealthier the chooser, the lower the marginal utility provided by a fixed amount of gain ('diminishing marginal utility').&lt;/p&gt;
&lt;p&gt;In neoclassical economics, the animal faced with the option of going east or west in the morning would first estimate how much the water and the fruit would change its objective wealth level, and then it would estimate how much those objective changes in wealth would change its utility. That is, it would use objective values to compute its marginal (subjective) utility. If it only had access to the subjective experiences in our story, it couldn't compute a new marginal utilities when it finds itself unexpectedly thirsty.&lt;/p&gt;
&lt;p&gt;The problem with this solution is that the brain does not appear to encode the objective values of stimuli, and humans &lt;em&gt;behaviorally&lt;/em&gt;&amp;#xA0;don't seem to respect the objective values of options either,&amp;#xA0;as discussed &lt;a href=&quot;/lw/6da/do_humans_want_things/&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In response to the behavioral evidence, Kahneman &amp;amp; Tversky (1979) developed a &lt;em&gt;reference dependent&lt;/em&gt;&amp;#xA0;utility function to describe human behavior: &lt;a href=&quot;/lw/6kf/prospect_theory_a_framework_for_understanding/&quot;&gt;prospect theory&lt;/a&gt;. Their suggestion was, basically:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Rather than computing marginal utilities against [objective] wealth as in [standard neoclassical economic models], &lt;em&gt;utilities&lt;/em&gt;&amp;#xA0;(not marginal utilities) could be computed directly as deviations from a baseline level of wealth, and then choices could be based on direct comparisons of these utilities rather than on comparisons of marginal utilities. Their idea was to begin with something like the chooser's &lt;em&gt;status quo&lt;/em&gt;, how much wealth he thinks he has. Each gamble is then represented as the chance of winning or losing utilities relative to that status-quo-like reference point.&lt;sup&gt;38&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This fits with the neurobiological fact that we encode signals from external stimuli relative to reference points, and don't have access to the objective values of stimuli.&lt;/p&gt;
&lt;p&gt;The advantage of the neoclassical economic model is that it keeps a chooser's choices consistent. The advantage of the reference-dependent approach is that it better fits human behavior and human neurobiology.&lt;/p&gt;
&lt;p&gt;Most neoclassical economists seem to ignore the problems for their theories that are presented by reference dependence in human behavior and human neurobiology, but two neoclassical economists at Berkeley, &lt;a href=&quot;http://elsa.berkeley.edu/~rabin/&quot;&gt;Matthew Rabin&lt;/a&gt; and &lt;a href=&quot;http://elsa.berkeley.edu/~botond/&quot;&gt;Botond Koszegi&lt;/a&gt;, have begun to take reference dependence seriously. As they put it:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;...while&amp;#xA0;an unexpected monetary windfall in the lab may be assessed as a gain, a salary of $5o,000 to an employee who expected $60,000 will not be assessed as a large gain relative to status-quo wealth, but rather as a loss relative to&amp;#xA0;expectations of wealth. And in nondurable consumption &amp;#x2014; where there is no object with which the person can be endowed &amp;#x2014; a status-quo-based theory cannot capture the role of reference dependence at all: it would predict, for instance, that a person who misses a concert she expected to attend would feel no differently than somebody who never expected to see the concert.&lt;sup&gt;39&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Their reference-dependent model makes particular predictions:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;[Our theory] shows that a consumer's willingness to pay a given price for shoes depends on the probability with which she expected to buy them and the price she expected to pay. On the one hand, an increase in the likelihood of buying increases a consumer's sense of loss of shoes if she does not buy, creating an &quot;attachment effect&quot; that increases her willingness to pay. Hence, the greater the likelihood she thought prices would be low enough to induce purchase, the greater is her willingness to buy at higher prices. On the other hand, holding the probability of getting the shoes fixed, a decrease in the price a consumer expected to pay makes paying a higher price feel like more of a loss, creating a &quot;comparison effect&quot; that lowers her willingness to pay the high price. Hence, the lower the prices she expected among those prices that induce purchase, the lower is her willingness to buy at higher prices.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Thus, the cost of accepting the human fact of reference-dependence is that we have to admit that humans are irrational (in the sense of 'rationality' defined by &lt;a href=&quot;http://en.wikipedia.org/wiki/Revealed_preference&quot;&gt;the axioms of revealed preference&lt;/a&gt;):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The fact that a consumer will pay more for shoes she expected to buy than for shoes she did not expect to buy, or that an animal would prefer inferior fruit it expected to eat over superior fruit it did not expect to eat, is exactly the kind of irrational behavior that we might hope the pressures of evolution would preclude. What observations tell us, however, is that these behaviors do occur. The neuroscience of sensory encoding tells us that these behaviors are an inescapable product of the fundamental structure of our brains.&lt;sup&gt;40&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;But really, shouldn't it have been obvious all along that humans are irrational? Perhaps it is, to everyone but neoclassical economists and Aristoteleans. (Okay, enough teasing...)&lt;/p&gt;
&lt;p&gt;One thing to keep in mind is that the brain encodes information about the external world in a reference-dependent way because that method makes a more efficient use of neurons. So evolution traded away some rationality for greater efficiency in the encoding mechanism.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;ValuationIn&quot;&gt;&lt;/a&gt;Valuation in the Brain&lt;/h4&gt;
&lt;p&gt;Back to dopamine. &lt;a href=&quot;#TheTurn&quot;&gt;Earlier&lt;/a&gt;, we learned that the brain learns the values of their actions with a dopaminergic reward system that uses something like temporal difference (TD) reinforcement learning. This reward system updates the stored values for actions by generating a reward prediction error (RPE) from the difference between expected reward and experience reward, and propagating this learning throughout relevant structures of the brain using the neurotransmitter dopamine. In particular, some&amp;#xA0;synapses are strengthened whenever presynaptic and postsynaptic activity occur &lt;em&gt;in the presence of dopamine&lt;/em&gt;, as proposed by Wickens (1993).&lt;/p&gt;
&lt;p&gt;But we haven't yet discussed how utilities for actions are generated in the first place, or how they are stored (independent of the&amp;#xA0;&lt;em&gt;expected&lt;/em&gt;&amp;#xA0;utilities represented during the choice process). It feels like I generally want ice cream a little bit and hot sex a lot more. Where is that information stored?&lt;/p&gt;
&lt;p&gt;Dozens&lt;sup&gt;41&lt;/sup&gt; of fMRI studies show that two brain regions in particular are correlated with subjective value: the ventral striatum and the medial prefrontal cortex. Other studies suggest that at least five more brain regions probably also contribute to the valuation process: the orbitofrontal cortex, the dorsolateral prefrontal cortex, the amygdala, the insula, and the anterior cingulate cortex.&lt;/p&gt;
&lt;p&gt;There are many theories about how the human brain generates and stores utilities, but these theories are far more speculative and in their infancy than everything else I've presented in this tutorial, so I won't discuss them here. Instead, let us conclude with a summary of what neuroscientists know about the human brain's motivational system, and what some of the greatest open questions are.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-size: 14px; font-weight: bold;&quot;&gt;&lt;a name=&quot;SummaryAnd&quot;&gt;&lt;/a&gt;&lt;a name=&quot;SummaryAnd&quot;&gt;&lt;/a&gt;Summary and Research Directions&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Here's what we've learned:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Utilities are real numbers ranging from 0 to 1,000 that take action potentials per second as their natural units. (By 'utility' here I don't mean what's usually meant by the term, I just mean 'utility' for the purpose of predicting choice by measuring the firing rates of certain populations of neurons &lt;em&gt;in the final common path of the choice circuit in the human brain&lt;/em&gt;.)&lt;/li&gt;
&lt;li&gt;Mean utilities are mean firing rates of specific populations of neurons in the final common path of human choice circuits.&lt;/li&gt;
&lt;li&gt;Mean utilities predict choice stochastically, similar to random utility models from economics.&lt;/li&gt;
&lt;li&gt;Utilities are encoded cardinally in firing rates relative to neuronal baseline firing rates. (This is opposed to post-Pareto, ordinal notions of utility.)&lt;/li&gt;
&lt;li&gt;The choice circuit takes as its input a firing rate that encodes relative (normalized) stochastic expected utility.&lt;/li&gt;
&lt;li&gt;As the choice set size grows, so does the error rate.&lt;/li&gt;
&lt;li&gt;Final choice is implemented by an argmax function or a reservation price mechanism.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a href=&quot;http://www.cns.nyu.edu/~glimcher/&quot;&gt;Paul Glimcher&lt;/a&gt; lists&lt;sup&gt;42&lt;/sup&gt; the greatest open questions in the field as:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Where is utility stored and how does it get to the choice mechanism?&lt;/li&gt;
&lt;li&gt;How does the brain decide when it's time to choose?&lt;/li&gt;
&lt;li&gt;What is the neural mechanism that allows us to substitute between two goods at a certain point?&lt;/li&gt;
&lt;li&gt;How are probabilistic beliefs represented in the brain?&lt;/li&gt;
&lt;li&gt;Utility functions are state-dependent, so how do state and utility function interact?&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Later, we'll explore the implications of our findings for &lt;a href=&quot;http://wiki.lesswrong.com/wiki/No-Nonsense_Metaethics&quot;&gt;metaethics&lt;/a&gt;. As of August 2011, if you've read this then you probably know more about how human values &lt;em&gt;actually&lt;/em&gt;&amp;#xA0;work than almost every professional metaethicist on Earth. The general lesson here is that you can often out-pace &lt;a href=&quot;/lw/4zs/philosophy_a_diseased_discipline/&quot;&gt;most philosophers&lt;/a&gt; simply by reading what today's leading&amp;#xA0;&lt;em&gt;scientists&lt;/em&gt;&amp;#xA0;have to say about a given topic instead of reading what &lt;em&gt;philosophers&lt;/em&gt;&amp;#xA0;say about it.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;Notes&quot;&gt;&lt;/a&gt;Notes&lt;/h4&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;1&lt;/sup&gt; They are: &lt;a href=&quot;/lw/4vr/less_wrong_rationality_and_mainstream_philosophy/&quot;&gt;Less Wrong Rationality and Mainstream Philosophy&lt;/a&gt;, &lt;a href=&quot;/lw/4zs/philosophy_a_diseased_discipline/&quot;&gt;Philosophy: A Diseased Discipline&lt;/a&gt;, &lt;a href=&quot;/lw/5i7/on_being_okay_with_the_truth/&quot;&gt;On Being Okay with the Truth&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/4yq/the_neuroscience_of_pleasure/&quot;&gt;The Neuroscience of Pleasure&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/4z7/the_neuroscience_of_desire/&quot;&gt;The Neuroscience of Desire&lt;/a&gt;, &lt;a href=&quot;/lw/531/how_you_make_judgments_the_elephant_and_its_rider/&quot;&gt;How You Make Judgments: The Elephant and its Rider&lt;/a&gt;, &lt;a href=&quot;/lw/5ee/being_wrong_about_your_own_subjective_experience/&quot;&gt;Being Wrong About Your Own Subjective Experience&lt;/a&gt;, &lt;a href=&quot;/lw/59v/intuition_and_unconscious_learning/&quot;&gt;Intuition and Unconscious Learning&lt;/a&gt;, &lt;a href=&quot;/lw/5sk/inferring_our_desires/&quot;&gt;Inferring Our Desires&lt;/a&gt;, &lt;a href=&quot;http://commonsenseatheism.com/?p=15072&quot;&gt;Wrong About Our Own Desires&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/6da/do_humans_want_things/&quot;&gt;Do Humans Want Things?&lt;/a&gt;, &lt;a href=&quot;/lw/65w/not_for_the_sake_of_pleasure_alone/&quot;&gt;Not for the Sake of Pleasure Alone&lt;/a&gt;, &lt;a href=&quot;/lw/6dz/not_for_the_sake_of_selfishness_alone/&quot;&gt;Not for the Sake of Selfishness Alone&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;/lw/5bw/your_evolved_intuitions/&quot;&gt;Your Evolved Intuitions&lt;/a&gt;, &lt;a href=&quot;/lw/4vs/when_intuitions_are_useful/&quot;&gt;When Intuitions Are Useful&lt;/a&gt;, &lt;a href=&quot;http://commonsenseatheism.com/?p=15213&quot;&gt;Cornell Realism&lt;/a&gt;, &lt;a href=&quot;http://commonsenseatheism.com/?p=15253&quot;&gt;Railton's Moral Reductionism (Part 1)&lt;/a&gt;, &lt;a href=&quot;http://commonsenseatheism.com/?p=15264&quot;&gt;Railton's Moral Reductionism (Part 2)&lt;/a&gt;, &lt;a href=&quot;http://commonsenseatheism.com/?p=15267&quot;&gt;Jackson's Moral Functionalism&lt;/a&gt;,&amp;#xA0;&lt;a href=&quot;http://commonsenseatheism.com/?p=15336&quot;&gt;Moral Reductionism and Moore's Open Question Argument&lt;/a&gt;, and &lt;a href=&quot;/lw/74f/are_deontological_moral_judgments_rationalizations/&quot;&gt;Are Deontological Moral Judgments Rationalizations?&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;2&lt;/sup&gt; &lt;a href=&quot;/lw/54p/heading_toward_nononsense_metaethics/&quot;&gt;Heading Toward: No-Nonsense Metaethics&lt;/a&gt;, &lt;a href=&quot;/lw/5eh/what_is_metaethics/&quot;&gt;What is Metaethics?&lt;/a&gt;, &lt;a href=&quot;/lw/5kn/conceptual_analysis_and_moral_theory/&quot;&gt;Conceptual Analysis and Moral Theory&lt;/a&gt;, and &lt;a href=&quot;/lw/5u2/pluralistic_moral_reductionism/&quot;&gt;Pluralistic Moral Reductionism&lt;/a&gt;.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;3&lt;/sup&gt; I tried something similar before, with &lt;a href=&quot;http://commonsenseatheism.com/?p=13607&quot;&gt;Cognitive Science in One Lesson&lt;/a&gt;.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;4&lt;/sup&gt;&amp;#xA0;Glimcher (2010) offers the best coverage of the topic in a single book. Tobler &amp;amp; Kobayashi (2009) offer the best coverage in a single article.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;5&lt;/sup&gt;&amp;#xA0;The quotes in this section are from Churchland (1981).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;6&lt;/sup&gt; Allen &amp;amp; Ng (2004).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;7&lt;/sup&gt;&amp;#xA0;This perspective goes back at least as far back as Arnauld (1662), who wrote:&lt;/small&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;small&gt;To judge what one must do to obtain a good or avoid an evil, it is necessary to consider not only the good and the evil in itself, but also the probability that it happens or does not happen: and to view geometrically the proportion that all these things have together.&lt;/small&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;8&lt;/sup&gt;&amp;#xA0;In addition to Caplin &amp;amp; Leahy (2001), see Kreps &amp;amp; Porteus'&amp;#xA0;(1978, 1979)&amp;#xA0;incroporation of the &quot;utility of knowing&quot;, Loomes &amp;amp; Sugden's (1982) incorporation of &quot;regret&quot;,&amp;#xA0;Gul &amp;amp; Pesendorfer's (2001) incorporation of &quot;the cost of self-control&quot;, and&amp;#xA0;Koszegi &amp;amp; Rabin's (2007, 2009) incorporation of the &quot;reference point&quot;.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;9&lt;/sup&gt; Friedman (1953).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;10&lt;/sup&gt; See a review in Fox &amp;amp; Poldrack (2009).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;11&lt;/sup&gt; For one difficulty with prospect theory, see Laury &amp;amp; Holt (2008).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;12&lt;/sup&gt; Sutton &amp;amp; Barto (2008), p. 3. All quotes from this section are from the early pages of this book.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;13&lt;/sup&gt;&amp;#xA0;From Sutton &amp;amp; Barto (2008).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;14&lt;/sup&gt;&amp;#xA0;Much of the rest of this post is basically a summary and paraphrase of Glimcher (2010).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;15&lt;/sup&gt; Mirenowicz &amp;amp; Schultz (1994).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;16&lt;/sup&gt;&amp;#xA0;Schultz et al. (1997).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;17&lt;/sup&gt; Caplin &amp;amp; Dean (2007)&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;18&lt;/sup&gt; From Glimcher (2010).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;19&lt;/sup&gt; Hebb (1949).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;20&lt;/sup&gt; Malenka &amp;amp; Bear (2004).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;21&lt;/sup&gt; Reynolds &amp;amp; Wickens (2002).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;22&lt;/sup&gt; Glimcher (2010), p. 341.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;23&lt;/sup&gt;&amp;#xA0;Edelman &amp;amp; Keller (1996); Van Gisbergen et al. (1987).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;24&lt;/sup&gt;&amp;#xA0;Gold and Shadlen (2007); Roitman and Shadlen (2002).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;25&lt;/sup&gt; Simon (1957).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;26&lt;/sup&gt;&amp;#xA0;Glimcher (2010), p. 215.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;27&lt;/sup&gt;&amp;#xA0;McFadden (2000). The behavior of gradually transitioning between two choices is described by Selten (1975).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;28&lt;/sup&gt;&amp;#xA0;For a probably improved random utility model, see Gul &amp;amp; Pesendorfer (2006).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;29&lt;/sup&gt; Dean (1983); Werner &amp;amp; Mountcastle (1963).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;30&lt;/sup&gt;&amp;#xA0;Unless some other feature of the brain turns out to 'smooth out' the stochasticity of neurons involved in valuation and choice-making.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;31&lt;/sup&gt; Glimcher (2010).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;32&lt;/sup&gt; Heeger (1992, 1993); Carandini &amp;amp; Heeger (1994); Simoncelli &amp;amp; Heeger (1998).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;33&lt;/sup&gt; Carandini &amp;amp; Heeger (1994); Britten &amp;amp; Heuer (1999); Zoccolan et al. (2005); Louie &amp;amp; Glimcher (2010).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;34&lt;/sup&gt; Horowitz &amp;amp; Newsome (2001a, 2001b, 2004).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;35&lt;/sup&gt; Liu &amp;amp; Wang (2008).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;36&lt;/sup&gt;&amp;#xA0;But, see Deneve (2009).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;37&lt;/sup&gt; Glimcher (2010), p. 281.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;38&lt;/sup&gt; Glimcher (2010), p. 283.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;39&lt;/sup&gt;&amp;#xA0;This quote and the next quote are from Koszegi &amp;amp; Rabin (2006).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;40&lt;/sup&gt; Glimcher (2010), p. 292.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;41&lt;/sup&gt; I won't list them all here. For an overview, see Glimcher (2010), ch. 14.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;42&lt;/sup&gt; Glimcher (2010), ch. 17. I've paraphrased his open questions. I also excluded his 6th question:&amp;#xA0;What Is the Neural Organ for Representing Money?&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;span style=&quot;font-size: 11px;&quot;&gt;&lt;br&gt;&lt;/span&gt;&lt;/small&gt;&lt;/p&gt;
&lt;h4&gt;&lt;a name=&quot;References&quot;&gt;&lt;/a&gt;References&lt;/h4&gt;
&lt;p&gt;&lt;small&gt;Allais (1953). &lt;a href=&quot;http://www.ericchiang.org/files/Allais_1953_Econometrica.pdf&quot;&gt;Le comportement de l'homme rationel devant le risque. Critique des postulates et axiomes de l'ecole americaine&lt;/a&gt;. &lt;em&gt;Econometrica, 21&lt;/em&gt;: 503-546.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Allen &amp;amp; Ng (2004). Economic behavior. In Spielberger (ed.), &lt;em&gt;Encyclopedia of Applied Psychology, Vol. 1&lt;/em&gt;&amp;#xA0;(pp. 661-666). Academic Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Arnauld (1662). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Port-Royal-Logic-Pierre-Nicole/dp/1142071251/&quot;&gt;Port-Royal Logic&lt;/a&gt;&lt;/em&gt;.&amp;#xA0;&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Basso &amp;amp; Wurtz (1997). &lt;a href=&quot;http://neuro.cjb.net/content/18/18/7519.full.pdf&quot;&gt;Modulation of neuronal activity in superior colliculus by changes in target probability&lt;/a&gt;. &lt;em&gt;Journal of Neuroscience, 18&lt;/em&gt;: 7519-7534.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Britten &amp;amp; Heuer (1999). &lt;a href=&quot;http://neuro.cjb.net/content/19/12/5074.full.pdf&quot;&gt;Spatial summation in the receptive fields of MT neurons&lt;/a&gt;. &lt;em&gt;Journal of Neuroscience, 19&lt;/em&gt;: 5074-5084.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Caplin &amp;amp; Dean (2007). &lt;a href=&quot;http://cess.nyu.edu/caplin/wp-content/uploads/2010/02/Axiomatic-Neuroeconomics.pdf&quot;&gt;Axiomatic neuroeconomics&lt;/a&gt;.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Caplin, Dean, Glimcher, &amp;amp; Rutledge (2010). &lt;a href=&quot;http://www.yorkshire-exile.co.uk/Beliefs.pdf&quot;&gt;Measuring beliefs and rewards: a neuroeconomic approach&lt;/a&gt;. &lt;em&gt;Quarterly Journal of Economics, 125&lt;/em&gt;: 3.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Caplin &amp;amp; Leahy (2001). &lt;a href=&quot;http://pages.stern.nyu.edu/~dbackus/Exotic/1Other/CaplinLeahy%20antic%20QJE%2001.pdf&quot;&gt;Psychological expected utility theory and anticipatory feelings&lt;/a&gt;. &lt;em&gt;Quarterly Journal of Economics, 116&lt;/em&gt;: 55-79.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Carandini &amp;amp; Heeger (1994). &lt;a href=&quot;http://redwood.berkeley.edu/vs265/carandini-heeger.pdf&quot;&gt;Summation and devision by neurons in primate visual cortex&lt;/a&gt;. &lt;em&gt;Science, 264&lt;/em&gt;: 1333-1336.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Churchland (1981).&amp;#xA0;&lt;a href=&quot;http://philosophy.wisc.edu/Shapiro/Phil554/PAPERS/Churchland.pdf&quot;&gt;Eliminative materialism and the propositional attitudes&lt;/a&gt;. &lt;em&gt;The Journal of Philosophy, 78&lt;/em&gt;: 67-90.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Dean (1983). Adaptation-induced alteration of the relation between response amplitude and contrast in cat striate cortical neurons. &lt;em&gt;Vision Research, 23&lt;/em&gt;: 249-256.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Deneve (2009).&amp;#xA0;&lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/Deneve-Bayesian-decision-making-in-two-alternative-forced-choices.pdf&quot;&gt;Bayesian decision making&amp;#xA0;in two-alternative forced choices&lt;/a&gt;.&amp;#xA0;In Dreher &amp;amp; Tremblay (eds.), &lt;em&gt;Handbook of Reward and Decision Making&lt;/em&gt; (pp. 441-458). Academic Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Dorris &amp;amp; Glimcher (2004). &lt;a href=&quot;http://monkeybiz.stanford.edu/nbio220/DorrisMC-Glimcher.pdf&quot;&gt;Activity in posterior parietal cortex is correlated with the subjective desireability of an action&lt;/a&gt;. &lt;em&gt;Neuron, 44&lt;/em&gt;: 365-378.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Edelman &amp;amp; Keller (1996). Activity of visuomotor burst neurons in the superior colliculus accompanying express saccades. &lt;em&gt;Journal of Neurophysiology, 76&lt;/em&gt;: 908-926.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Fox &amp;amp; Poldrack (2009). &lt;a href=&quot;http://cnl.salk.edu/~terry/BGGN/CH011.pdf&quot;&gt;Prospect theory and the brain&lt;/a&gt;. In Glimcher, Camerer, Fehr, &amp;amp; Poldrack (eds.), &lt;em&gt;Neuroeconomics: Decision Making and the Brain&lt;/em&gt;&amp;#xA0;(pp. 145-173). Academic Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Friedman (1953). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Essays-Positive-Economics-Phoenix-Books/dp/0226264033/&quot;&gt;Essays in Positive Economics&lt;/a&gt;&lt;/em&gt;. University of Chicago Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Glimcher (2010). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Foundations-Neuroeconomic-Analysis-Paul-Glimcher/dp/0199744254/&quot;&gt;Foundations of Neuroeconomic Analysis&lt;/a&gt;&lt;/em&gt;. Oxford University Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Gold and Shadlen (2007). &lt;a href=&quot;http://homepages.inf.ed.ac.uk/pseries/CCN/gold_shadlen_review.pdf&quot;&gt;The neural basis of decision making&lt;/a&gt;. &lt;em&gt;Annual Review of Neuroscience, 30&lt;/em&gt;: 535-574.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Gul &amp;amp; Pesendorfer (2001). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.160.133&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;Temptation and self-control&lt;/a&gt;. &lt;em&gt;Econometrica, 69&lt;/em&gt;: 1403-1435.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Gul &amp;amp; Pesendorfer (2006). &lt;a href=&quot;http://www.princeton.edu/~pesendor/random.pdf&quot;&gt;Random expected utility&lt;/a&gt;. &lt;em&gt;Econometrica, 74&lt;/em&gt;: 121-146.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Hebb (1949). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Organization-Behavior-Neuropsychological-Theory/dp/0805843000/&quot;&gt;The organization of behavior&lt;/a&gt;&lt;/em&gt;. Wiley &amp;amp; Sons.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Heeger (1992). &lt;a href=&quot;http://www.cns.nyu.edu/labs/heegerlab/content/publications/Heeger-VisNeurosci1992a.pdf&quot;&gt;Normalization of cell responses in cat striate cortex&lt;/a&gt;. &lt;em&gt;Visual Neuroscience, 9&lt;/em&gt;: 181-197.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Heeger (1993). &lt;a href=&quot;http://www.cns.nyu.edu/labs/heegerlab/content/publications/Heeger-JNeurophysiol1993.pdf&quot;&gt;Modeling simple-cell direction selectivity with normalized, half-squared linear operators&lt;/a&gt;. &lt;em&gt;Journal of Neurophysiology, 70&lt;/em&gt;: 1885-1898.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Horowitz &amp;amp; Newsome (2001a). &lt;a href=&quot;http://jn.physiology.org/content/86/5/2527.full.pdf&quot;&gt;Target selection for saccadic eye movements: direction selective visual responses in the superior colliculus induced by behavioral training&lt;/a&gt;. &lt;em&gt;Journal of Neurophysiology, 86&lt;/em&gt;: 2527-2542.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Horowitz &amp;amp; Newsome (2001b). &lt;a href=&quot;http://jn.physiology.org/content/86/5/2543.full.pdf&quot;&gt;Target selection for saccadic eye movements: prelude activity in the superior colliculus during a direction discrimination task&lt;/a&gt;. &lt;em&gt;Journal of Neurophysiology, 86&lt;/em&gt;: 2543-2558.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Iyengar &amp;amp; Lepper (2000). &lt;a href=&quot;http://www.columbia.edu/~ss957/articles/Choice_is_Demotivating.pdf&quot;&gt;When choice is demotivating: Can one desire too much of a good thing?&lt;/a&gt; &lt;em&gt;Journal of Personality and Social Psychology, 79&lt;/em&gt;: 995-1006.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Jevons (1871).&amp;#xA0;&lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Theory-Political-Economy-William-Stanley/dp/1165162105/&quot;&gt;The Theory of Political&amp;#xA0;Economy&lt;/a&gt;&lt;/em&gt;. Macmillan and Co.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Kahneman &amp;amp; Tversky (1979). &lt;a href=&quot;http://paper.blog.bbiq.jp/Kahneman_and_Tversky_1979.pdf&quot;&gt;Prospect theory: An analysis of decision under risk&lt;/a&gt;. &lt;em&gt;Econometrica, 47&lt;/em&gt;: 263-291.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Koszegi &amp;amp; Rabin (2006). &lt;a href=&quot;http://webserver1.pugetsound.edu/facultypages/gmilam/courses/econ291/readings/01-KozegiRabin.pdf&quot;&gt;A model of reference-dependent preferences&lt;/a&gt;. &lt;em&gt;Quarterly Journal of Economics, 121&lt;/em&gt;: 1133-1165.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Koszegi &amp;amp; Rabin (2007). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.61.4476&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;Reference-dependent risk attitudes&lt;/a&gt;. &lt;em&gt;American Economic Review, 97&lt;/em&gt;: 1047-1073.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Koszegi &amp;amp; Rabin (2009). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.153.2065&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;Reference-dependent consumption plans&lt;/a&gt;. &lt;em&gt;American Economic Review, 99&lt;/em&gt;: 909-936.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Kreps &amp;amp; Porteus (1978). &lt;a href=&quot;http://teaching.ust.hk/~bee/papers/040918/1978-Kreps_Porteus-dynamic_choice_theory.pdf&quot;&gt;Temporal resolution of uncertainty and dynamic choice theory&lt;/a&gt;. &lt;em&gt;Econometrica, 46&lt;/em&gt;: 185-200.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Kreps &amp;amp; Porteus (1979). &lt;a href=&quot;http://hassler-j.iies.su.se/Courses/NewPrefs/Papers/TimeandRisk/KrepsPorteus%20DP%20Ec%20Jan%2079.pdf&quot;&gt;Dynamic choice theory and dynamic programming&lt;/a&gt;. &lt;em&gt;Econometrica, 47&lt;/em&gt;: 91-100.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Laury &amp;amp; Holt (2008). &lt;a href=&quot;http://www2.gsu.edu/~ecoskl/lotteryhbk.pdf&quot;&gt;Payoff scale effects and risk preference under real and hypothetical conditions&lt;/a&gt;. In Plott &amp;amp; Smith (eds.), &lt;em&gt;Handbook of Experimental Economic Results, Vol. 1&lt;/em&gt; (pp. 1047-1053). Elsevier Press.&amp;#xA0;&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Loewenstein (1987). &lt;a href=&quot;http://sds.hss.cmu.edu/media/pdfs/loewenstein/AnticipationValuDelayed.pdf&quot;&gt;Anticipation and the valuation of delayed consumption&lt;/a&gt;. &lt;em&gt;Economic Journal, 97&lt;/em&gt;: 666-684.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Liu &amp;amp; Wang (2008). &lt;a href=&quot;http://biophy.nju.edu.cn/papers/Liu_PLoS_253_08.pdf&quot;&gt;A common cortical circuit mechanism for perceptual categorical discrimination and veridical judgment&lt;/a&gt;. &lt;em&gt;PLOS Computational Biology, 4&lt;/em&gt;: 1-14.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Loomes &amp;amp; Sugden (1982). &lt;a href=&quot;http://teaching.ust.hk/~bee/papers/misc/Regret%20Theory%20An%20Alternative%20Theory%20of%20Rational%20Choice%20Under%20Uncertainty.pdf&quot;&gt;Regret theory: An alternative theory of rational choice under uncertainty&lt;/a&gt;. &lt;em&gt;Economic Journal, 92&lt;/em&gt;: 805-824.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Louie &amp;amp; Glimcher (2010). &lt;a href=&quot;http://www.jneurosci.org/content/30/16/5498.full.pdf&quot;&gt;Separating value from choice: delay discounting activity in the lateral intraparietal area&lt;/a&gt;. &lt;em&gt;Journal of Neuroscience, 30&lt;/em&gt;: 5498-5507.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Malenka &amp;amp; Bear (2004). LTP and LTD: an embarrassment of riches. &lt;em&gt;Neuron, 44&lt;/em&gt;: 5&amp;#x2013;21.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Mirenowicz &amp;amp; Schultz (1994). Importance of unpredictability for reward responses in primate dopamine neurons. &lt;em&gt;Journal of Neurophysiology, 72&lt;/em&gt;: 1024-1027.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Reynolds &amp;amp; Wickens (2002). &lt;a href=&quot;http://cnl.salk.edu/~terry/BGGN/reynolds.NN.02.pdf&quot;&gt;Dopamine-dependent plasticity of corticostriatal synapses&lt;/a&gt;. &lt;em&gt;Neural Networks, 15&lt;/em&gt;: 507-521.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Roitman and Shadlen (2002). &lt;a href=&quot;http://neuro.cjb.net/content/22/21/9475.full.pdf&quot;&gt;Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task&lt;/a&gt;. &lt;em&gt;Nature Neuroscience, 22&lt;/em&gt;: 9475-9489.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Scheibehenne,&amp;#xA0;Greifeneder, &amp;amp; Todd (2010). Can there ever be too many options? A meta-analytic review of choice overload.&amp;#xA0;&lt;em&gt;Journal of Consumer Research, 37&lt;/em&gt;: 409-425.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Schultz, Dayan, &amp;amp; Montague (1997). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.124.5997&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;A neural substrate of prediction and reward&lt;/a&gt;. &lt;em&gt;Science, 275&lt;/em&gt;: 1593&amp;#x2013;1599.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Schwartz &amp;amp; Simoncelli (2001). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.29.3508&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;Natural signal statistics and sensory gain control&lt;/a&gt;. &lt;em&gt;Nature Neuroscience, 4&lt;/em&gt;: 819-825.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Selten (1975). Reexamination of perfectness concept for equilibrium points in extensive games. &lt;em&gt;International Journal of Game Theory, 4&lt;/em&gt;: 25-55.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Simoncelli &amp;amp; Heeger (1998). &lt;a href=&quot;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.35.5397&amp;amp;rep=rep1&amp;amp;type=pdf&quot;&gt;A model of neuronal responses in visual area MT&lt;/a&gt;. &lt;em&gt;Vision Research, 38&lt;/em&gt;: 743-761.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Sutton &amp;amp; Barto (2008). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Reinforcement-Learning-Introduction-Adaptive-Computation/dp/0262193981/&quot;&gt;Reinforcement Learning: An Introduction&lt;/a&gt;&lt;/em&gt;. MIT Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Tanji &amp;amp; Evarts (1976). &lt;a href=&quot;http://jacknife.med.yale.edu/spikeclub/TanjiAnticipatoryJNeurophys1976.pdf&quot;&gt;Anticipatory activity of motor cortex neurons in relation to direction of an intended movement&lt;/a&gt;. &lt;em&gt;Journal of Neurophysiology, 39&lt;/em&gt;: 1062-1068.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Tobler &amp;amp; Kobayashi (2009). &lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/Tobler-Kobayashi-Electrophysiological-correlates-of-reward-processing-in-dopamine-neurons.pdf&quot;&gt;Electrophysiological correlates of reward processing in dopamine neurons&lt;/a&gt;. In Dreher &amp;amp; Tremblay (eds.), &lt;em&gt;Handbook of Reward and Decision Making&lt;/em&gt;&amp;#xA0;(pp. 29-50). Academic Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Van Gisbergen, Opstal, &amp;amp; Tax (1987). Collicular ensemble coding of saccades based on vector summation. &lt;em&gt;Neuroscience, 21&lt;/em&gt;: 651.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Werner &amp;amp; Mountcastle (1963). The variability of central neural activity in a sensory system, and its implications for central reflection of sensory events. &lt;em&gt;Journal of Neurophysiology, 26&lt;/em&gt;: 958-977.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Wickens (1993). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Theory-Striatum-J-Wickens/dp/0080422780/&quot;&gt;A Theory of the Striatum&lt;/a&gt;&lt;/em&gt;. Pergamon Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Zoccolan, Cox, &amp;amp; DiCarlo (2005). &lt;a href=&quot;http://www.jneurosci.org/content/25/36/8150.full.pdf&quot;&gt;Multiple object response normalization in monkey inferotemporal cortex&lt;/a&gt;. &lt;em&gt;Journal of Neuroscience, 25&lt;/em&gt;: 8150-8164.&lt;/small&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/71x/a_crash_course_in_the_neuroscience_of_human/#comments"&gt;89 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Do Humans Want Things?</title>
<link>http://lesswrong.com/lw/6da/do_humans_want_things/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/6da/do_humans_want_things/</guid>
<pubDate>Thu, 04 Aug 2011 15:00:40 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lukeprog"&gt;lukeprog&lt;/a&gt;
&amp;bull;
23 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/6da/do_humans_want_things/#comments"&gt;52 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Summary: &lt;em&gt;Recent posts like &lt;a href=&quot;/lw/4z7/the_neuroscience_of_desire/&quot;&gt;The Neuroscience of Desire&lt;/a&gt; and &lt;a href=&quot;/lw/6oo/to_what_degree_do_we_have_goals/&quot;&gt;To what degree do we have goals?&lt;/a&gt; have explored the question of whether humans have desires (or 'goals'). If we don't&amp;#xA0;have desires, how can we tell an AI what kind of world we 'want'? Recent work in economics and neuroscience has clarified the nature of this problem.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;We begin, as is &lt;a href=&quot;/lw/6kf/prospect_theory_a_framework_for_understanding/&quot;&gt;so&lt;/a&gt; &lt;a href=&quot;/lw/ji/conjunction_fallacy/&quot;&gt;often&lt;/a&gt; &lt;a href=&quot;/lw/2ep/so_you_think_youre_a_bayesian_the_natural_mode_of/&quot;&gt;the&lt;/a&gt; &lt;a href=&quot;/lw/173/knowing_what_you_know/&quot;&gt;case&lt;/a&gt; on Less Wrong, with &lt;a href=&quot;http://en.wikipedia.org/wiki/Daniel_Kahneman&quot;&gt;Kahneman&lt;/a&gt; &amp;amp; &lt;a href=&quot;http://en.wikipedia.org/wiki/Amos_Tversky&quot;&gt;Tversky&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In 1981, K&amp;amp;T found that human choice was not always guided by the&amp;#xA0;&lt;em&gt;objective value&lt;/em&gt;&amp;#xA0;of possible outcomes, but by the way those outcomes were 'framed'.&lt;sup&gt;1&lt;/sup&gt;&amp;#xA0;For example in one study, K&amp;amp;T told subjects the following story:&lt;/p&gt;
&lt;blockquote style=&quot;border-left-width: 2px; border-left-style: solid; border-left-color: #336699; padding-left: 4px; margin-top: 5px; margin-bottom: 5px; margin-left: 5px; margin-right: 15px;&quot;&gt;
&lt;p&gt;Imagine that the U.S. is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Half the participants were given the following choice:&lt;/p&gt;
&lt;blockquote style=&quot;border-left-width: 2px; border-left-style: solid; border-left-color: #336699; padding-left: 4px; margin-top: 5px; margin-bottom: 5px; margin-left: 5px; margin-right: 15px;&quot;&gt;
&lt;p&gt;If program A is adopted, 200 people will be saved. If Program B is adopted, there is a 1/3 probability that 600 people will be saved and a 2/3 probability that no people will be saved.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The second half of participants were given a different choice:&lt;/p&gt;
&lt;blockquote style=&quot;border-left-width: 2px; border-left-style: solid; border-left-color: #336699; padding-left: 4px; margin-top: 5px; margin-bottom: 5px; margin-left: 5px; margin-right: 15px;&quot;&gt;
&lt;p&gt;If Program C is adopted 400 people will die. If Program D is adopted there is a 1/3 probability that nobody will die, and a 2/3 probability that 600 people will die.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Each of these choice sets is identical, except that one is framed with language about people being saved, and the other is framed with language about people dying.&lt;/p&gt;
&lt;p&gt;In the first group, 72% of subjects chose Program A. In the second group, only 22% of people chose the numerically identical option: Program C.&lt;/p&gt;
&lt;p&gt;K&amp;amp;T explained the difference by noting that in option A we consider the happy thought of &lt;em&gt;saving&lt;/em&gt; 200 people, but in option C we confront the dreadful thought of 400 &lt;em&gt;deaths&lt;/em&gt;. Our choice seems to depend not only on the objective properties of the options before us, but also on the &lt;em&gt;reference point&lt;/em&gt;&amp;#xA0;used to frame the options.&lt;/p&gt;
&lt;p&gt;But if this is how human desire works, we are left with a worrying problem about how to translate human desires into the goals of an AI. Surely we don't want an AI to realize one state of affairs over another based merely on how the options are &lt;em&gt;framed!&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Before we begin to solve this problem, though, let's look at a similar result from neurobiology.&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;Reference-Dependence in Neurobiology&lt;/h4&gt;
&lt;p&gt;A different kind of reference-dependence has been discovered in the way that neurons encode value.&lt;/p&gt;
&lt;p&gt;Imagine sitting in a windowless room with Mark, who is wearing blue jeans and a green t-shirt. Your perception of Mark results from about 10&lt;sup&gt;17&lt;/sup&gt; photons/second with a mean wavelength of 450 nanometers coming from every square centimeter of Mark's blue jeans, and about 10&lt;sup&gt;17&lt;/sup&gt; photons/second with a mean wavelength of 550 nanometers coming from every square centimeter of his green shirt.&lt;/p&gt;
&lt;p&gt;Now, you and Mark step outside, and are briefly blinded by the sun. A minute later you sit on a park bench. Mark looks the same as before: blue jeans, green shirt. But now, in the bright sun, your identical subjective perceptual experience of Mark results from about 10&lt;sup&gt;23&lt;/sup&gt; 450-nm photons/second/cm&lt;sup&gt;2&lt;/sup&gt; coming from his blue jeans, and about 10&lt;sup&gt;23&lt;/sup&gt; 550-nm photons/second/cm&lt;sup&gt;2&lt;/sup&gt; coming off his green shirt.&lt;/p&gt;
&lt;p&gt;A &lt;em&gt;six-order-of-magnitude&lt;/em&gt;&amp;#xA0;shift in the objective reality of the stimulus has resulted in &lt;em&gt;no&lt;/em&gt; change in your subjective experience of Mark.&lt;sup&gt;2&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;How did this happen?&lt;/p&gt;
&lt;p&gt;What changed was the &lt;em&gt;illuminant&lt;/em&gt;,&amp;#xA0;the sun. But for Earth-bound mammals, changes in an object millions of miles away are not very important. What matters for our survival and reproduction is information about the objects immediately around us. So our brains subtract away the changing effects of the sun as we move in and out of direct sunlight.&lt;/p&gt;
&lt;p&gt;This 'illuminant subtraction' process occurs during the &lt;em&gt;first step&lt;/em&gt; of &lt;a href=&quot;/lw/6cv/entangled_with_reality_the_shoelace_example/&quot;&gt;visual processing&lt;/a&gt;, during &lt;a href=&quot;http://en.wikipedia.org/wiki/Transduction_(biophysics)&quot;&gt;transduction&lt;/a&gt;. The rods and cones of the retina compute an average of local light intensity, which is used as a reference point.&lt;sup&gt;3&lt;/sup&gt; &lt;em&gt;Changes&lt;/em&gt; of light intensity from this reference point are what the rods and cones communicate to the rest of the nervous system.&lt;/p&gt;
&lt;p&gt;Thus: &lt;em&gt;information about the objective intensity of incoming light is irretrievably lost at the transducer&lt;/em&gt;. Light intensity is stored in the brain only in a reference-dependent way.&lt;/p&gt;
&lt;p&gt;The same is true of our other senses. Sound intensity can differ between a quiet room and a rock concert by as much as 10 order of magnitude,&lt;sup&gt;4&lt;/sup&gt; and our ears respond by shifting the reference point and encoding sound intensity relative to that reference point.&lt;sup&gt;5&lt;/sup&gt; A rose may smell sweet in my bedroom, but its scent will be hidden in a field of roses.&lt;sup&gt;6&lt;/sup&gt;&amp;#xA0;The &lt;a href=&quot;http://en.wikipedia.org/wiki/Somatosensory_system&quot;&gt;somatosensory system&lt;/a&gt; appears to operate with the same principle. You feel your clothes when you first put them on, but the nerve endings in your skin stop reporting their existence except where your clothes are shifting across your skin or their pressure on your skin is changing.&lt;sup&gt;7&lt;/sup&gt; And the same is true for taste. How salty something tastes, for example, depends on the amount of sodium in your blood and in surrounding tissue in your mouth.&lt;sup&gt;8&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;I &lt;a href=&quot;/lw/4z7/the_neuroscience_of_desire/&quot;&gt;wrote before&lt;/a&gt; about how neurons encode value. But now it seems that, as neuroscientist Paul Glimcher puts it:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;All sensory encoding is reference dependent: nowhere in the nervous system are the objective values of consumable rewards encoded.&lt;sup&gt;9&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Thus we smack headlong into another constraint for our theories about human values and their&amp;#xA0;&lt;a href=&quot;http://singinst.org/upload/CEV.html&quot;&gt;extrapolation&lt;/a&gt;. Human brains can't (directly) encode value for the objective intensities of stimuli because that information is lost at the transducer.&lt;/p&gt;
&lt;p&gt;It's beginning to seem that our folk theories about humans 'wanting' things in the world were naive.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;Do Humans Want Things?&lt;/h4&gt;
&lt;p&gt;It has traditionally been thought that humans desire (or value) states of affairs:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;A desire for tea is a desire for a certain state of affairs one has in mind: that one drink some tea. A desire for a new pair of skates is likewise a desire for another state of affairs: that one own a new pair of skates. And so on.&lt;sup&gt;10&lt;/sup&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Intuitively, when we think about what we want, it seems that we want certain states of affairs to obtain. We want to be &lt;a href=&quot;http://en.wikipedia.org/wiki/Nootropic&quot;&gt;smarter&lt;/a&gt;. We want there to be world peace. We want to live forever while having &lt;a href=&quot;/lw/y0/31_laws_of_fun/&quot;&gt;fun&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;But as far as we can tell, our behavior is often not determined by our wanting&amp;#xA0;a particular state of affairs, but by how our options are framed.&lt;/p&gt;
&lt;p&gt;Moreover,&amp;#xA0;neurons in the&amp;#xA0;parietal and orbitofrontal&amp;#xA0;corticies&amp;#xA0;&lt;a href=&quot;/lw/4z7/the_neuroscience_of_desire/&quot;&gt;encode value&lt;/a&gt;&amp;#xA0;in a reference-dependent way &amp;#x2014; that is, they do not encode value for objective states of affairs.&lt;sup&gt;11&lt;/sup&gt;&amp;#xA0;So in what sense do humans 'want' objective states of affairs?&lt;/p&gt;
&lt;p&gt;(Compare: In what sense does &lt;a href=&quot;/lw/6ha/the_blueminimizing_robot/&quot;&gt;the blue-minimizing robot&lt;/a&gt; 'want' anything?)&lt;/p&gt;
&lt;p&gt;In a later post, I'll explain in greater detail how brains do (and don't) encode value for states of affairs. In the meantime, you might want to try to figure out on your own in what sense the brain might want things.&lt;/p&gt;
&lt;p align=&quot;center&quot;&gt;&lt;img src=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/08/chasing-the-carrot.gif&quot; alt=&quot;chasing the carrot&quot; height=&quot;301&quot; align=&quot;center&quot; width=&quot;297&quot;&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h4&gt;Notes&lt;/h4&gt;
&lt;p&gt;&lt;span style=&quot;font-size: xx-small;&quot;&gt;&lt;span style=&quot;font-size: 11px;&quot;&gt;&lt;sup&gt;1&lt;/sup&gt;&amp;#xA0;Tversky &amp;amp; Kahneman (1981).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;2&lt;/sup&gt; This example, and the outline of this post, is taken from Glimcher (2010), ch. 12.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;3&lt;/sup&gt; Burns &amp;amp; Baylor (2001).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;4&lt;/sup&gt; Bacchus (2006); Robinson &amp;amp; McAlpine (2009).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;5&lt;/sup&gt; Squire et al. (2008), ch. 26.&amp;#xA0;&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;6&lt;/sup&gt; Mountcastle (2005); Squire et al. (2008), pp. 565-567.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;7&lt;/sup&gt; Squire et al. (2008), ch. 25.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;8&lt;/sup&gt; Squire et al. (2008), pp. 555-556.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;9&lt;/sup&gt; Glimcher (2010), p. 278.&amp;#xA0;&lt;/small&gt;&lt;small&gt;Moreover, objective properties of the real world are not even linearly related to our subjective experience. The intensity of our perception of the world grows as a power law, the exact rate of which depends on the kind of stimulus (Stevens 1951, 1970, 1975). For example, we've found that:&lt;/small&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;small&gt;Perceived warmth of a patch of skin = (temp. of that skin)&lt;sup&gt;0.7&lt;/sup&gt;&lt;/small&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;small&gt;And, another example:&lt;/small&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;small&gt;Perceived intensity of an electrical shock = (electrical current)&lt;sup&gt;3.5&lt;/sup&gt;&lt;/small&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;10&lt;/sup&gt;&amp;#xA0;Schroeder (2009).&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;sup&gt;11&lt;/sup&gt;&amp;#xA0;It's less certain how values are encoded in the medial prefrontal cortex and in the temporal cortex, but Paul Glimcher predicts (in personal communication with me from June 2011) that this will also be a largely reference-dependent process.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;&lt;br&gt;&lt;/small&gt;&lt;/p&gt;
&lt;h4&gt;References&lt;/h4&gt;
&lt;p&gt;&lt;small&gt;Baccus (2006). From a whisper to a roar: Adaptation to the mean and variance of naturalistic sounds. &lt;em&gt;Neuron, 51&lt;/em&gt;: 682-684.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Burns &amp;amp; Baylor (2001). &lt;a href=&quot;http://commonsenseatheism.com/wp-content/uploads/2011/06/Burns-Baylor-Activation-deactivation-and-adaptation-in-vertebrate-photoreceptor-cells.pdf&quot;&gt;Activation, deactivation, and adaptation in vertebrate photoreceptor cells&lt;/a&gt;. &lt;em&gt;Annual Review of Neuroscience, 24&lt;/em&gt;: 779-805.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Glimcher (2010). &lt;em&gt;&lt;a href=&quot;http://api.viglink.com/api/click?format=go&amp;amp;key=9f37ca02a1e3cbd4f3d0a3618a39fbca&amp;amp;loc=http%3A%2F%2Flesswrong.com%2Flw%2F4z7%2Fthe_neuroscience_of_desire%2F&amp;amp;v=1&amp;amp;libid=1308982026386&amp;amp;out=http%3A%2F%2Fwww.amazon.com%2Fdp%2F0199744254%2F&amp;amp;ref=http%3A%2F%2Fwww.google.com%2Fsearch%3Fsourceid%3Dchrome%26ie%3DUTF-8%26q%3Dneuroscience%2Bof%2Bdesire&amp;amp;title=The%20Neuroscience%20of%20Desire%20-%20Less%20Wrong&amp;amp;txt=Foundations%20of%20Neuroeconomic%20Analaysis&quot;&gt;Foundations of Neuroeconomic Analaysis&lt;/a&gt;&lt;/em&gt;. Oxford University Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Mountcastle (2005). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Sensory-Hand-Mechanisms-Somatic-Sensation/dp/0674019741&quot;&gt;The Sensory Hand: Neural Mechanisms of Somatic Sensation&lt;/a&gt;&lt;/em&gt;. Harvard University Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Robinson &amp;amp; McAlpine (2009). Gain control mechanisms in the auditory pathway. &lt;em&gt;Current Opinion in Neurobiology, 19&lt;/em&gt;: 402-407.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Schroeder (2009). &lt;a href=&quot;http://plato.stanford.edu/entries/desire/&quot;&gt;Desire&lt;/a&gt;. &lt;em&gt;Stanford Encyclopedia of Philosoph&lt;/em&gt;y.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Squire, Berg, Bloom, du Lac, &amp;amp; Ghosh, eds. (2008). &lt;em&gt;&lt;a href=&quot;http://api.viglink.com/api/click?format=go&amp;amp;key=9f37ca02a1e3cbd4f3d0a3618a39fbca&amp;amp;loc=http%3A%2F%2Flesswrong.com%2Flw%2F6cv%2Fentangled_with_reality_the_shoelace_example%2F&amp;amp;v=1&amp;amp;libid=1308985302093&amp;amp;out=http%3A%2F%2Fwww.amazon.com%2FNeuroscience-Textbook-Set-Fundamental-Squire%2Fdp%2F0123740193%2F&amp;amp;ref=http%3A%2F%2Flesswrong.com%2F&amp;amp;title=Entangled%20with%20Reality%3A%20The%20Shoelace%20Example%20-%20Less%20Wrong&amp;amp;txt=Fundamental%20Neuroscience%2C%20Third%20Edition&quot;&gt;Fundamental Neuroscience, Third Edition&lt;/a&gt;&lt;/em&gt;. Academic Press.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Stevens (1951). &lt;em&gt;Handbook of Experimental Psychology, 1st edition&lt;/em&gt;. John Wiley &amp;amp; Sons.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Stevens (1970). Neural events and the psychophysical law. &lt;em&gt;Science, 170&lt;/em&gt;: 1043-1050.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Stevens (1975). &lt;em&gt;&lt;a href=&quot;http://www.amazon.com/Psychophysics-Introduction-Perceptual-Neural-Prospects/dp/0887386431/&quot;&gt;Psychophysics: Introduction to its Perceptual, Neural and Social Prospects&lt;/a&gt;&lt;/em&gt;. Wiley.&lt;/small&gt;&lt;/p&gt;
&lt;p&gt;&lt;small&gt;Tversky &amp;amp; Kahneman (1981). &lt;a href=&quot;http://www.brainvitge.org/papers/tverski_kahneman.pdf&quot;&gt;The framing of decisions and the psychology of choice&lt;/a&gt;. &lt;em&gt;Science, 211&lt;/em&gt;: 453-458.&lt;/small&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/6da/do_humans_want_things/#comments"&gt;52 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Assuming Nails</title>
<link>http://lesswrong.com/lw/2ai/assuming_nails/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/2ai/assuming_nails/</guid>
<pubDate>Tue, 06 Jul 2010 08:26:00 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Psychohistorian"&gt;Psychohistorian&lt;/a&gt;
&amp;bull;
6 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/2ai/assuming_nails/#comments"&gt;28 comments&lt;/a&gt;
&lt;div&gt;&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;Tangential followup to &lt;a href=&quot;/lw/2cv/defeating_ugh_fields_in_practice/&quot;&gt;&lt;span style=&quot;color: purple;&quot;&gt;Defeating Ugh Fields in Practice&lt;/span&gt;&lt;/a&gt;.&lt;br&gt;Somewhat related to &lt;a href=&quot;/lw/19m/privileging_the_hypothesis/&quot;&gt;&lt;span style=&quot;color: #6a8a6b;&quot;&gt;Privileging the Hypothesis&lt;/span&gt;&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;&lt;strong&gt;Edited to add:&lt;/strong&gt;&lt;br&gt;I'm surprised by negative/neutral reviews.&amp;#xA0;This means that either I'm simply wrong about what counts as interesting, or I haven't expressed my point very well. Based on commenter response, I think the problem is the latter. In the next week or so, expect a much more concise version of this post that expresses my point about epistemology without the detour through a criticism of economics.&lt;br&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;At the beginning of my &lt;a href=&quot;/lw/2cv/defeating_ugh_fields_in_practice/&quot;&gt;&lt;span style=&quot;color: #800080;&quot;&gt;last post&lt;/span&gt;&lt;/a&gt;, I was rather uncharitable to neoclassical economics:&lt;/span&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;If I had to choose a single piece of evidence off of which to argue that the rationality assumption of neoclassical economics is totally, irretrievably incorrect, it's&amp;#xA0;&lt;a href=&quot;http://www.nytimes.com/2010/06/14/health/14meds.html&quot;&gt;&lt;span style=&quot;color: #3d3d3e;&quot;&gt;this article about financial incentives and medication compliance.&lt;/span&gt;&lt;/a&gt;... [to maintain that this theory is correct]&amp;#xA0;is to crush reality into a theory that cannot hold it.&amp;#xA0;&lt;/span&gt;&lt;span style=&quot;font-family: Arial;&quot;&gt;&amp;#xA0;&amp;#xA0; &lt;/span&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;Some mistook this to mean that I believe neoclassical economists honestly, explicitly believe that all people are always totally rational. But, to quote Rick Moranis, &quot;It's not what you think. It's far, far worse.&quot; The problem is that they often take the complex framework of neoclassical economics and believe that a valid deduction &lt;em&gt;within&lt;/em&gt;&amp;#xA0;this framework is a valid deduction about the &lt;em&gt;real world. &lt;/em&gt;However, deductions within any given framework are entirely uninformative unless&amp;#xA0;the framework corresponds to reality. But, because such deductions are internally valid, we often give them far more weight than they are due. Testing the fit of a theoretical framework to reality is &lt;em&gt;hard, &lt;/em&gt;but a valid deduction within a framework feels so very satisfying. But even if you have a fantastically engineered hammer, you cannot go around assuming everything you want&amp;#xA0;to use it on&amp;#xA0;is a nail. It is all too common for experts to assume that their framework applies cleanly to the real world simply because it works so well in its own world.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;If this concept doesn't make perfect sense, that's what the rest of this post is about: spelling out exactly how we go wrong when we misuse the essentially circular models of many sciences, and how this matters. We will begin with the one discipline in which this problem does not occur.&amp;#xA0;The one discipline which appears immune to this type of problem is mathematics,&amp;#xA0;the paragon of &quot;pure&quot; academic disciplines. This is principally because mathematics appears to have perfect conformity with reality, with no&amp;#xA0;research or experimentation needed to ensure said conformity.&amp;#xA0;The entire system of mathematics exists, in a sense, in its own world. You could sit in windowless room (perhaps one with a supercomputer) and, theoretically, derive every major theorem of mathematics, given the proper axioms. The answer to the most difficult unsolved problems in mathematics was determined the moment the terms and operators within them were defined - once you say a &quot;circle&quot; is &quot;a convex polygon with every point equidistant from a center,&quot; you have already determined every single digit of pi. The problem is finding out exactly how this model works - making calculations and deductions within this model. In the case of mathematics, for whatever reason, the model conforms&amp;#xA0;&lt;/span&gt;&lt;span style=&quot;font-family: Arial;&quot;&gt;perfectly to the real world, so any valid mathematical deduction is a valid deduction in the real world.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;This is not the case in any true science, which by necessity&amp;#xA0;must rely on experiment and observation. Every science operates off of some simplified model of the world, at least with our current state of knowledge. This creates two avenues of progress: discoveries within the model, which allow one to make predictions about the world, and refinements of the model, which make such predictions more accurate. If we have an internally consistent framework, t&lt;/span&gt;&lt;span style=&quot;font-family: Arial;&quot;&gt;heoretical manipulation within our model will &lt;em&gt;never &lt;/em&gt;show us our error, because our model is circular and functions outside the real world. It would be like trying to predict a stock market crash by analyzing the rules of Monopoly, except that it doesn't &lt;em&gt;feel &lt;/em&gt;absurd. There's nothing wrong with the model&amp;#xA0;&lt;em&gt;qua&lt;/em&gt;&amp;#xA0;the model, the problem is with the model&amp;#xA0;&lt;em&gt;qua&lt;/em&gt;&amp;#xA0;reality, and we have to look at&amp;#xA0;&lt;em&gt;both&amp;#xA0;&lt;/em&gt;of them to figure that out.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;Economics is one of the fields that most suffers from this problem. Our mathematician in his windowless room could generate models of international exchange rates without ever having seen currency, once we gave him the appropriate definitions and assumptions. However, when we try using these models to forecast the future, life gets complicated. No amount of experimenting within our original model will fix this without looking at the real world. At best, we come up with some equations that appear to conform to what we observe, but we run the risk that the correspondence is incidental or that there were some (temporarily) constant variables we left out that will suddenly cease to be constant and break the whole model. It is all too easy to forget that the tremendous rigor and certainty we feel when we solve the equations of our model does not translate into the real world.&amp;#xA0; Getting the &quot;right&quot; answer within the model is not the same thing as getting the real answer.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;As an obvious practical example, an individual with a serious excess of free time could develop a model of economics which assumes that agents are&amp;#xA0;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/Paperclip_maximizer&quot;&gt;&lt;span style=&quot;color: #6a8a6b;&quot;&gt;rational paper-clip maximizers &lt;/span&gt;&lt;/a&gt;- that agents are rational and their ultimate concern is maximizing the number of existing paper-clips. Given even more free time and a certain amount of genius, you could even model the behaviour of &lt;em&gt;irrational&lt;/em&gt;&amp;#xA0;paper-clip&amp;#xA0;maximizers, so long as you had a definition of irrational. But however refined these models are, they models will remain entirely useless unless you actually have some paper-clip maximizers whose behaviour you want to predict. And even then, you would need to evaluate your predictions after they succeed or fail. Developing a great hammer is relatively useless if the thing you need to make must be put together with screws.&amp;#xA0;&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;There is an obvious difference in the magnitude of this problem between the sciences, and it seems to be based on the difficulty of experimenting within them. In harder sciences where experiments are fairly straightforwards, like physics and chemistry, it is not terribly difficult to make models that conform well with reality. The bleeding edge of, say, physics, tends to like in areas that are either extremely hard to observe, like the subatomic, or extremely computation-intensive. In softer sciences, experiments are very difficult, and our models rely much more on powerful assumptions, social values, and armchair reasoning.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;As humans, we are both bound and compelled to use the tools we have at our disposal. The problem here is one of uncertainty. We know that most of our assumptions in economics are empirically off, but we don't know &lt;em&gt;how&lt;/em&gt; wrong or how much that matters when we make predictions. But the model nevertheless seeps into the very core of our model of reality itself. We cannot feel this disconnect when we try to make predictions; a well-designed model feels so complete that there is no feeling of error when we try to apply it. This is likely&amp;#xA0;because we are applying it correctly, but it just doesn't apply to reality. This leads people to have high degrees of certainty and yet frequently be wrong. It would not surprise me if the failure of many experts to appreciate the model-reality gap is responsible for a large proportion of incorrect predictions.&lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;text-align: justify; background: white;&quot;&gt;&lt;span style=&quot;font-family: Arial; color: black;&quot;&gt;This, unfortunately, is not the end of the problem. It gets much worse when you add a normative element into your model, when you get to call some things, &quot;efficient&quot; or &quot;healthful,&quot; or &quot;normal,&quot; or &quot;insane.&quot; There is also a serious&amp;#xA0;question as&amp;#xA0;to whether this false&amp;#xA0;certainty is preferable to the vague unfalsifiability of even softer social sciences.&amp;#xA0;But I shall save these subjects for future posts. &lt;/span&gt;&lt;/p&gt;
&lt;p style=&quot;margin: 0in 0in 0pt;&quot; class=&quot;MsoNormal&quot;&gt;&lt;span style=&quot;font-family: Arial;&quot;&gt;&amp;#xA0;&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/2ai/assuming_nails/#comments"&gt;28 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Cryonics Wants To Be Big</title>
<link>http://lesswrong.com/lw/2f5/cryonics_wants_to_be_big/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/2f5/cryonics_wants_to_be_big/</guid>
<pubDate>Mon, 05 Jul 2010 17:50:29 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/lsparrish"&gt;lsparrish&lt;/a&gt;
&amp;bull;
26 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/2f5/cryonics_wants_to_be_big/#comments"&gt;160 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Cryonics scales very well. People who argue from the perspective that cryonics is costly are probably not aware of this fact. Even assuming you needed to come up with the &lt;strong&gt;lump sum&lt;/strong&gt; all at once rather than steadily pay into life insurance, the fact is that most people would be able to afford it if most people wanted it. There are some basic physical reasons why this is the case.&lt;/p&gt;
&lt;p&gt;So long as you keep the shape constant, for any given container the surface area is based on a &lt;a href=&quot;http://en.wikipedia.org/wiki/Square-cube_law&quot;&gt;square law&lt;/a&gt; while the volume is calculated as a cube law. For example with a simple cube shaped object, one side squared times 6 is the surface area; one side cubed is the volume. Spheres, domes, and cylinders are just more efficient variants on this theme. For any constant shape, if volume is multiplied by 1000, surface area only goes up by 100 times.&lt;/p&gt;
&lt;p&gt;Surface area is where heat gains entry. Thus if you have a huge container holding cryogenic goods (humans in this case) it costs less per unit volume (human) than is the case with a smaller container that is equally well insulated. A way to understand why this works is to realize that you only have to insulate and cool the outside edge -- the inside does not collect any new heat. In short, by multiplying by a thousand patients, you can have a tenth of the thermal transfer to overcome per patient with no change in r-value.&lt;/p&gt;
&lt;p&gt;But you aren't limited to using equal thickness of insulation. You can use thicker insulation, but get a much smaller proportional effect on total surface area when you use bigger container volumes. Imagine the difference between a marble sized freezer and a house-sized freezer. What happens when you add an extra foot of insulation to the surface of each? Surface area is impacted much as diameter is -- i.e. more significantly in the case of the smaller freezer than the larger one. The outer edge of the insulation is where it begins collecting heat. With a truly gigantic freezer, you could add an entire meter (or more) of insulation without it having a significant proportional impact on surface area, compared to how much surface area it already has. (This is one reason cheaper materials can be used to construct large tanks -- they can be applied in thicker layers.)&lt;/p&gt;
&lt;p&gt;Another factor to take into account is that liquid nitrogen, the super-cheap coolant used by cryonics facilities around the world, is vastly cheaper (more than a factor of 10) when purchased in huge quantities of several tons. The scaling factors for storage tanks and high-capacity tanker trucks are a big part of the reason for this. CI has used bulk purchasing as a mechanism for getting their prices &lt;a href=&quot;http://cryonics.org/ci.html&quot;&gt;down&lt;/a&gt; &lt;a href=&quot;http://cryonics.org/cryostats.html&quot;&gt;to&lt;/a&gt; $100 per patient per year for their newer tanks. They are actually storing 3,000 gallons of the stuff and using it slowly over time, which implies there is a boiloff rate associated with the 3,000 gallon tank in addition to the tanks.&lt;/p&gt;
&lt;p&gt;The conclusion I get from this is that there is a very strong self-interested case (as well as the altruistic case) to be made for the promotion of &lt;strong&gt;megascale&lt;/strong&gt; cryonics towards the mainstream, as opposed to small independently run units for a few of us die-hard futurists. People who say they won't sign up for cost reasons may actually (if they are sincere) be reachable at a later date. To deal with such people's objections and make sure they remain reachable, it might be smart to get them to agree with some particular hypothetical price point at which they would feel it is justified. In large enough quantities, it is conceivable that &lt;em&gt;indefinite&lt;/em&gt; storage costs would be as low as $50 per person, or 50 cents per year.&lt;/p&gt;
&lt;p&gt;That is much cheaper than saving a life any other way. Of course there's still the risk that it might not work. However, given a sufficient chance of it working it could still be morally superior to other life saving strategies that cost more money. It also has inherent ecological advantages over other forms of life-saving in that it temporarily reduces the active population, giving the environment a chance to recover and green tech more time to take hold so that they can be supported sustainably and comfortably. And we might consider the advent of life-health extension in the future to be a reason to think&amp;#xA0; it a qualitatively &lt;em&gt;better&lt;/em&gt; form of life-saving.&lt;/p&gt;
&lt;p&gt;Note: This article only looks directly at cooling energy costs; construction and ongoing maintenance do not necessarily scale as dramatically. The same goes for stabilization (which I view as a separate though indispensable enterprise). Both of these do have obvious scaling factors however. Other issues to consider are defense and reliability. Given the large storage mass involved, preventing temperature fluctuations without being at the exact boiling temperature of LN2 is feasible; it could be both highly failsafe and use the ideal cryonics temperature of -135C rather than the -196C that LN2 boiloff as a temperature regulation mechanism requires. Feel free to raise further issues in the comments.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/2f5/cryonics_wants_to_be_big/#comments"&gt;160 comments&lt;/a&gt;
</description>
</item>
<item>
<title>What Cost for Irrationality?</title>
<link>http://lesswrong.com/lw/2et/what_cost_for_irrationality/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/2et/what_cost_for_irrationality/</guid>
<pubDate>Fri, 02 Jul 2010 04:25:06 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Kaj_Sotala"&gt;Kaj_Sotala&lt;/a&gt;
&amp;bull;
54 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/2et/what_cost_for_irrationality/#comments"&gt;105 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;&lt;em&gt;This is the first part in a &lt;a href=&quot;/tag/whatintelligencetestsmiss/&quot;&gt;mini-sequence&lt;/a&gt; presenting content from Keith E. Stanovich's excellent book &lt;a href=&quot;http://www.amazon.com/What-Intelligence-Tests-Miss-Psychology/dp/030012385X&quot;&gt;What Intelligence Tests Miss: The psychology of rational thought&lt;/a&gt;. It will culminate in a review of the book itself.&lt;/em&gt;&lt;br&gt;&lt;br&gt;People who care a lot about &lt;a href=&quot;/lw/31/what_do_we_mean_by_rationality/&quot;&gt;rationality&lt;/a&gt; may frequently be asked why they do so. There are various answers, but I think that &lt;a href=&quot;/lw/go/why_truth_and/&quot;&gt;many of ones discussed here&lt;/a&gt; won't be very persuasive to people who don't already have an interest in the issue. But in real life, most people don't try to stay healthy because of various &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Near/far_thinking&quot;&gt;far-mode&lt;/a&gt; arguments for the virtue of health: instead, they try to stay healthy in order to avoid various forms of illness. In the same spirit, I present you with a list of real-world events that have been caused by failures of rationality, so that you might better &lt;a href=&quot;/lw/vk/back_up_and_ask_whether_not_why/&quot;&gt;persuade others&lt;/a&gt; of this being important.&lt;/p&gt;
&lt;p&gt;What happens if you, or the people around you, are not rational? Well, in order from least serious to worst, you may...&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Have a worse quality of living&lt;/strong&gt;. &lt;em&gt;&lt;a href=&quot;http://wiki.lesswrong.com/wiki/Status_quo_bias&quot;&gt;Status Quo bias&lt;/a&gt; &lt;/em&gt;is a general human tendency to prefer the default state, regardless of whether the default is actually good or not. In the 1980's, Pacific Gas and Electric conducted a survey of their customers. Because the company was serving a lot of people in a variety of regions, some of their customers suffered from more outages than others. Pacific Gas asked customers with unreliable service whether they'd be willing to pay extra for more reliable service, and customers with reliable service whether they'd be willing to accept a less reliable service in exchange for a discount. The customers were presented with increases and decreases of various percentages, and asked which ones they'd be willing to accept. The percentages were same for both groups, only with the other having increases instead of decreases. Even though both groups had the same income, customers of both groups overwhelmingly wanted to stay with their status quo. Yet the service difference between the groups was large: the unreliable service group suffered 15 outages per year of 4 hours' average duration and the reliable service group suffered 3 outages per year of 2 hours' average duration! (Though &lt;a href=&quot;/lw/2et/what_cost_for_irrationality/283a&quot;&gt;note&lt;/a&gt; &lt;a href=&quot;/lw/2et/what_cost_for_irrationality/2833&quot;&gt;caveats&lt;/a&gt;.)&lt;/p&gt;
&lt;p&gt;A study by Philips Electronics found that one half of their products had nothing wrong in them, but the consumers couldn't figure out how to use the devices. This can be partially explained by egocentric bias on behalf of the engineers. Cognitive scientist Chip Heath notes that he has &quot;a DVD remote control with 52 buttons on it, and every one of them is there because some engineer along the line knew how to use that button and believed I would want to use it, too. People who design products are experts... and they can't imagine what it's like to be as ignorant as the rest of us.&quot;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Suffer financial harm.&lt;/strong&gt; John Allen Paulos is a professor of mathematics at Temple University. Yet he fell prey to serious irrationality which began when he purchased WorldCom stock at $47 per share in early 2000. As bad news about the industry began mounting, WorldCom's stock price started falling - and as it did so, Paulos kept buying, regardless of accumulating evidence that he should be selling. Later on, he admitted that his &quot;purchases were not completely rational&quot; and that &quot;I bought shares even though I knew better&quot;. He was still buying - partially on borrowed money - when the stock price was $5. When it momentarily rose to $7, he finally decided to sell. Unfortunately, he didn't get off from work until the market closed, and on the next market day the stock had lost a third of its value. Paulos finally sold everything, at a huge loss.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Stock market losses due to irrationality are not atypical. From the beginning of 1998 to the end of 2001, the Firsthand Technology Value mutual fund had an average gain of 16 percent per year. Yet the average investor who invested in the fund &lt;em&gt;lost&lt;/em&gt; 31.6 percent of her money over the same period. Investors actually lost a total of $1.9 &lt;em&gt;billion&lt;/em&gt; by investing in a fund which was producing 16 percent of a profit per year. That happened because the fund was very volatile, causing people to invest and cash out at exactly the wrong times. When it gained, it gained a lot, and when it lost, it lost a lot. When people saw that it had been making losses, they sold, and when they saw it had been making gains, they bought. In other words, they bought when high and sold when low - exactly the opposite of what you're supposed to do if you want to make a profit. Reporting on a study of 700 mutual funds during 1998-2001, finanical reporter Jason Zweig noted that &quot;to a remarkable degree, investors underperformed their funds' reported returns - sometimes by as much as 75 percentage points per year.&quot;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Be manipulated and robbed of personal autonomy.&lt;/strong&gt; Subjects were asked to divide 100 usable livers to 200 children awaiting a transplant. With two groups of children, group A with 100 children and group B with 100 children, the overwhelming response was to allocate 50 livers to each, which seems reasonable. But when group A had 100 children, each with an 80 percent chance of surviving when transplanted, and group B had 100 children, each with a 20 percent chance of surviving when transplanted, people still chose the equal allocation method even if this caused the unnecessary deaths of 30 children. Well, that's just a question of values and not rationality, right? Turns out that if the patients were ranked from 1 to 200 in terms of prognosis, people were relatively comfortable with distributing organs to the top 100 patients. It was only when the question was framed as &quot;group A versus group B&quot; that people suddenly felt they didn't want to abandon group B entirely. Of course, these are exactly the same dilemma. One could almost say that the person who got to choose which framing to use was &lt;em&gt;getting to decide on behalf of the people being asked the question.&lt;br&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Two groups of subjects were given information about eliminating affirmative action and adopting a race-neutral policy at several universities. One group was told that under race-neutral conditions, the probability of a black student being admitted would decline from 42 percent to 13 percent and the probability of a white student being admitted would rise from 25 percent to 27 percent. The other group was told that under race-neutral admissions, the number of black students being admitted would decrease by 725 and the number of white students would increase by 725. These two framings were both saying the same thing, but you can probably guess the outcome: support for affirmative action was much higher in the percentage group.&lt;/p&gt;
&lt;p&gt;In a hypothetical country, a family with no children and an income of $35,000 pays $4,000 in tax, while a family with no children and an income of $100,000 pays $26,000 in tax. Now suppose that there's a $500 tax reduction for having a child for a family with an income of $35,000. Should the family with an income of $100,000 be given a larger reduction because of their higher income? Here, most people would say no. But suppose that instead, the baseline is that a family of two children with an income of $35,000 pays $3,000 in tax and that a family of two children with an income of $100,000 pays $25,000 in tax. We propose to make the families with no children pay more tax - that is, have a &quot;childless penalty&quot;. Say that the family with the income of $100,000 and one child has their taxes set at $26,000 and the same family with no children has their taxes set at $27,000 - there's a childless penalty of $1,000 per child. Should the poorer family which makes $35,000 and has no children also pay the same $2,000 childless penalty as the richer family? Here, most people would also say no - they'd want the &quot;bonus&quot; for children to be equal for low- and high-income families, but they do not want the &quot;penalty&quot; for lacking children to be the high for same and low income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;End up falsely accused or imprisoned.&lt;/strong&gt; In 2003, an attorney was released from prison in England when her conviction of murdering her two infants was overturned. Five months later, another person was released from prison when her charge of having murdered her children was also overturned. In both cases, the evidence presented against them had been ambiguous. What had convinced the jury was that in both cases, a pediatrician had testified that the odds of two children in the same family dying of infant death syndrome was 73 million to 1. Unfortunately, he had arrived to this figure by squaring the odds of a single death. Squaring the odds of a single event to arrive at the odds for it happening twice only works if the two events are independent. But that assumption is likely to be false in the case of multiple deaths in the same family, where numerous environmental and genetic factors may have affected both deaths.&lt;/p&gt;
&lt;p&gt;In the late 1980s and early 1990s, many parents were excited and overjoyed to hear of a technique coming out of Australia that enabled previously totally non-verbal autistic children to communicate. It was uncritically promoted in highly visible media such as &lt;em&gt;60 Minutes, Parade &lt;/em&gt;magazine and the &lt;em&gt;Washington Post&lt;/em&gt;. The claim was that autistic individuals and other children with developmental disabilities who'd previously been nonverbal had typed highly literate messages on a keyboard when their hands and arms had been supported over by the typewriter by a sympathetic &quot;facilitator&quot;. As Stanovich describes: &quot;Throughout the early 1990s, behavioral science researchers the world over watched in horrified anticipation, almost as if observing cars crash in slow motion, while a predictable tragedy unfolded before their eyes.&quot; The hopes of countless parents were dashed when it was shown that the &quot;facilitators&quot; had been - consciously or unconsciously - directing the children's hands on the right keys. It &lt;em&gt;should&lt;/em&gt; have been obvious that spreading such news before the technique had been properly scientifically examined was dangerously irresponsible - and it gets worse. During some &quot;faciliation&quot; sessions, children &quot;reported&quot; having been sexually abused by their parents, and were removed from their homes as a result. (Though they were eventually returned.)&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;End up dead.&lt;/strong&gt; After 9/11, people became afraid of flying and started doing so less. Instead, they began driving more. Unfortunately, car travel has a much higher chance of death than air travel. Researchers have estimated that over 300 more people died in the last months of 2001 because they drove instead of flying. Another group calculated that for flying to be as dangerous as driving, there would have to be an incident on the scale of 9/11 once a month!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Have your society collapse.&lt;/strong&gt; Possibly even more horrifying is the tale of Albania, which had previously been a communist dictatorship but had made considerable financial progress from 1992 to 1997. In 1997, however, &lt;em&gt;one half of the adult population&lt;/em&gt; had fallen victim to &lt;a href=&quot;http://en.wikipedia.org/wiki/Ponzi_scheme&quot;&gt;Ponzi schemes&lt;/a&gt;. In a Ponzi scheme, the investment itself isn't actually making any money, but rather early investors are paid off with the money from late investors, and eventually the system has to collapse when no new investors can be recruited. But when schemes offering a 30 percent &lt;em&gt;monthly&lt;/em&gt; return began to become popular in Albania, competitors offering a 50-60 or even a 100 percent monthly return soon showed up, and people couldn't resist the temptation. Eventually both the government and economy of Albania collapsed. Stanovich describes:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;People took out mortgages on their homes in order to participate. Others sold their homes. Many put their entire life savings into the schemes. At their height, an amount equal to 50 percent of the country's GDP was invested in Ponzi schemes. Before the schemes collapsed, they actually began to compete with wage income and distort the economy. For example, one business owner saw his workforce quickly slip from 130 employees to 70 because people began to hink they could invest in the Ponzi schemes instead of actually working for their income.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;The estimated death toll was &lt;a href=&quot;http://en.wikipedia.org/wiki/1997_rebellion_in_Albania&quot;&gt;between 1,700 and 2,000&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/2et/what_cost_for_irrationality/#comments"&gt;105 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Defeating Ugh Fields In Practice</title>
<link>http://lesswrong.com/lw/2cv/defeating_ugh_fields_in_practice/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/2cv/defeating_ugh_fields_in_practice/</guid>
<pubDate>Sun, 20 Jun 2010 05:37:44 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Psychohistorian"&gt;Psychohistorian&lt;/a&gt;
&amp;bull;
60 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/2cv/defeating_ugh_fields_in_practice/#comments"&gt;92 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Unsurprisingly related to:&amp;#xA0;&lt;a href=&quot;/lw/21b/ugh_fields/&quot;&gt;Ugh fields&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;If I had to choose a single piece of evidence off of which to argue that the rationality assumption of neoclassical economics is totally, irretrievably incorrect, it's &lt;a href=&quot;http://www.nytimes.com/2010/06/14/health/14meds.html&quot;&gt;this article about financial incentives and medication compliance.&lt;/a&gt;&amp;#xA0;In short, offering people small cash incentives vastly improves their adherence to life-saving medical regimens. That's right. For a significant number of people, a small chance at winning $10-100 can be the difference between whether or not they stick to a regimen that has a very good chance of saving their life. This technique has even shown promise in getting drug addicts and psychiatric patients to adhere to their regimens, for as little as a $20 gift certificate. This problem, in the aggregate, is estimated to cost about 5% of total health care spending -$100 billion - and that may not properly account for the utility lost by those who are harmed beyond repair. To claim that people are making a reasoned decision between the payoffs of taking and not-taking their medication, and that they be persuaded to change their behaviour by a payoff of about $900 a year (or less), is to crush reality into a theory that cannot hold it. This is doubly true when you consider that some of these people were fairly affluent.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;A likely explanation of this detrimental irrationality is something close to an&amp;#xA0;&lt;a style=&quot;color: #8a8a8b; text-decoration: inherit;&quot; href=&quot;/lw/21b/ugh_fields/&quot;&gt;Ugh field&lt;/a&gt;. It must be miserable having a life-threatening illness. Being reminded of it by taking a pill every single day (or more frequently) is not pleasant. Then there's the question of whether you already took the pill. Because if you take it twice in one day, you'll end up in the hospital. And&amp;#xA0;Heaven forfend your treatment involves needles.&amp;#xA0;Thus, people avoid taking their medicine because the process becomes so unpleasant, even though they know they really should be taking it.&lt;/p&gt;
&lt;p&gt;As this&amp;#xA0;experiment&amp;#xA0;shows, this serious problem has a simple and elegant solution: make taking their medicine fun. As one person in the article describes it, using a low-reward lottery made taking his meds &quot;like a game;&quot; he couldn't wait to check the dispenser to see if he'd won (and take his meds again). Instead of thinking about how they have some terrible condition, they get excited thinking about how they could be winning money. The Ugh field has been demolished, with the once-feared procedure now associated with a tried-and-true intermittent reward system. It also wouldn't surprise me the least if people who are unlikely to adhere to a medical regimen are the kind of people who really enjoy playing the lottery.&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This also explains why rewarding success may be more useful than punishing failure in the long run: if a kid does his homework because otherwise he doesn't get dessert, it's labor. If he gets some reward for getting it done, it becomes a positive. The problem is that if she knows what the reward is, she may anchor on already having the reward, turning it back into negative reinforcement - if you promise your kid a trip to Disneyland if they get above a 3.5, and they get a 3.3, they feel like they actually&amp;#xA0;&lt;em&gt;lost&lt;/em&gt;&amp;#xA0;something. The use of a gambling mechanism may be key for this. If your reward is a &lt;em&gt;chance&lt;/em&gt;&amp;#xA0;at a real reward, you don't anchor as already having the reward, but the reward still excites you.&lt;/p&gt;
&lt;p&gt;I believe that the fact that such a significant problem can be overcome with such a trivial solution has tremendous implications, the enumeration of all of which would make for a very unwieldy post. A particularly noteworthy issue is the difficulty of applying such a technique to one's own actions, a problem which I believe has a fairly large number of workable solutions. That's what comments, and, potentially, follow-up posts are for.&amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/2cv/defeating_ugh_fields_in_practice/#comments"&gt;92 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Blue- and Yellow-Tinted Choices</title>
<link>http://lesswrong.com/lw/290/blue_and_yellowtinted_choices/</link>
<guid isPermaLink="true">http://lesswrong.com/lw/290/blue_and_yellowtinted_choices/</guid>
<pubDate>Fri, 14 May 2010 08:35:42 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Yvain"&gt;Yvain&lt;/a&gt;
&amp;bull;
43 votes
&amp;bull;
&lt;a href="http://lesswrong.com/lw/290/blue_and_yellowtinted_choices/#comments"&gt;54 comments&lt;/a&gt;
&lt;div&gt;&lt;blockquote&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&lt;sub&gt;&lt;em&gt;A man comes to the rabbi and complains about his life: &quot;I have almost no money, my wife is a shrew, and we live in a small apartment with seven unruly kids. It's messy, it's noisy, it's smelly, and I don't want to live.&quot;&lt;br&gt;The rabbi says, &quot;Buy a goat.&quot;&lt;br&gt;&quot;What? I just told you there's hardly room for nine people, and it's messy as it is!&quot;&lt;br&gt;&quot;Look, you came for advice, so I'm giving you advice. Buy a goat and come back in a month.&quot;&lt;br&gt;In a month the man comes back and he is even more depressed: &quot;It's gotten worse! The filthy goat breaks everything, and it stinks and makes more noise than my wife and seven kids! What should I do?&quot;&lt;br&gt;The rabbi says, &quot;Sell the goat.&quot;&lt;br&gt;A few days later the man returns to the rabbi, beaming with happiness: &quot;Life is wonderful! We enjoy every minute of it now that there's no goat - only the nine of us. The kids are well-behaved, the wife is agreeable - and we even have some money!&quot;&lt;/em&gt;&lt;/sub&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&lt;em&gt;-- traditional Jewish joke&lt;/em&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&lt;strong&gt;Related to:&lt;/strong&gt; &lt;a href=&quot;/lw/j7/anchoring_and_adjustment/&quot;&gt;Anchoring and Adjustment&lt;/a&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;Biases are &amp;#x201C;cognitive illusions&amp;#x201D; that work on the same principle as optical illusions, and a knowledge of the latter can be profitably applied to the former. Take, for example, these two cubes (source: &lt;a href=&quot;http://www.lottolab.org/articles/illusionsoflight.asp&quot;&gt;Lotto Lab&lt;/a&gt;, via Boing Boing):&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; padding-left: 120px;&quot;&gt;&lt;img src=&quot;http://www.raikoth.net/Stuff/color-cube-illusion.jpg&quot; alt=&quot;Colored cube illusion&quot; height=&quot;235&quot; width=&quot;452&quot;&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;The &amp;#x201C;blue&amp;#x201D; tiles on the top face of the left cube are the same color as the &amp;#x201C;yellow&amp;#x201D; tiles on the top face of the right cube; if you're skeptical you can prove it with the eyedropper tool in Photoshop (in which both shades come out a rather ugly gray).&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;The illusion works because visual perception is relative. Outdoor light on a sunny day can be ten thousand times greater than a fluorescently lit indoor room. As one psychology book put it: for a student reading this book outside, the black print will be objectively lighter than the white space will be for a student reading the book inside. Nevertheless, both students will perceive the white space as subjectively white and the black space as subjectively black, because the visual system returns to consciousness information about relative rather than absolute lightness. In the two cubes, the visual system takes the yellow or blue tint as a given and outputs to consciousness the colors of each pixel compared to that background.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;So this optical illusion occurs when the brain judges quantities relative to their surroundings rather than based on some objective standard. What's the corresponding cognitive illusion?&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;In &lt;a href=&quot;http://www.amazon.com/Predictably-Irrational-Revised-Expanded-Decisions/dp/0061353248/ref=sr_1_2?ie=UTF8&amp;amp;s=books&amp;amp;qid=1273789887&amp;amp;sr=8-2&quot;&gt;&lt;em&gt;Predictably Irrational&lt;/em&gt;&lt;/a&gt; (relatively recommended, even though the latter chapters sort of fail to live up to the ones mentioned here) Dan Ariely asks his students to evaluate (appropriately) three subscription plans to the Economist:&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; padding-left: 180px;&quot;&gt;&lt;img src=&quot;http://www.raikoth.net/Stuff/econ_sub1.gif&quot; alt=&quot;Economist subscription table&quot; height=&quot;307&quot; width=&quot;350&quot;&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;Ariely asked his subjects which plan they'd buy if they needed an Economist subscription. 84% wanted the combo plan, 16% wanted the web only plan, and no one wanted the print only plan. After all, the print plan cost exactly the same as the print + web plan, but the print + web plan was obviously better. Which raises the question: why even include a print-only plan? Isn't it something of a waste of space?&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;Actually, including the print-only plan turns out to be a very good business move for the Economist. Ariely removed the print-only plan from the choices. Now the options looked like this.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in; padding-left: 180px;&quot;&gt;&lt;img src=&quot;http://www.raikoth.net/Stuff/econ_sub2.gif&quot; alt=&quot;Economist subscription table, one option removed&quot;&gt;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;There shouldn't be any difference. After all, he'd only removed the plan no one chose, the plan no sane person would choose.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;This time, 68% of students chose the web only plan and 32% the combo plan. That's a 52% shift in preferences between the exact same options.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;The rational way to make the decision is to compare the value of a print subscription to the Economist (as measured by the opportunity cost of that money) to the difference in cost between the web and combo subscriptions. But this would return the same answer in both of the above cases, so the students weren't doing it that way.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;What it looks like the students were doing was perceiving relative value in the same way the eye perceives relative color. The ugly gray of the cube appeared blue when it was next to something yellow, and yellow when it was next to something blue. In the same way, the $125 cost of the combo subscription looks like good value next to a worse deal, and bad value next to a better deal.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;When the $125 combo subscription was placed next to a $125 plan with fewer features (print only instead of print plus web) it looked like a very good deal &amp;#x2013; the equivalent of placing an ugly gray square next to something yellow to make it look blue. Take away the yellow, or the artificially bad deal, and it doesn't look nearly as attractive.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;This is getting deep into Dark Arts territory, and according to Predictably Irrational, the opportunity to use these powers for evil has not gone unexploited. Retailers will deliberately include in their selection a super deluxe luxury model much fancier and more expensive than they expect anyone to ever want. The theory is that consumers are balancing a natural hedonism that tells them to get the best model possible against a commitment to financial prudence. So most consumers, however much they like television, will have enough good sense to avoid buying a $2000 TV. But if the retailer carries a $4000 super-TV, the $2000 TV suddenly doesn't look quite so bad.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;The obvious next question is &amp;#x201C;How do I use this knowledge to trick hot girls or guys into going out with me?&amp;#x201D; Dan Ariely decided to run some experiments on his undergraduate class. He took photographs of sixty students, then asked other students to rate their attractiveness. Next, he grouped the photos into pairs of equally attractive students. And next, he went to Photoshop and made a slightly less attractive version of each student: a blemish here, an asymmetry there.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;Finally, he went around campus, finding students and showing them three photographs and asking which person the student would like to go on a date with. Two of the photographs were from one pair of photos ranked equally attractive. The third was a version of one of the two, altered to make it less attractive. So, for example, he might have two people, Alice and Brenda, who had been ranked equally attractive, plus a Photoshopped ugly version of Brenda.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;The students overwhelmingly (75%) chose the person with the ugly double (Brenda in the example above), even though the two non-Photoshopped faces were equally attractive. Ariely then went so far as to recommend in his book that for best effect, you should go to bars and clubs with a wingman who is similar to you but less attractive. Going with a random ugly person would accomplish nothing, but going with someone similar to but less attractive than you would put you into a reference class and then bump you up to the top of the reference class, just like in the previous face experiment.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;Ariely puts these studies in a separate chapter from his studies on &lt;a href=&quot;/lw/j7/anchoring_and_adjustment/&quot;&gt;anchoring and adjustment&lt;/a&gt; (which are also very good) but it all seems like the same process to me: being more interested in the difference between two values than in the absolute magnitude of them. All that makes anchoring and adjustment so interesting is that the two values have nothing in common with one another.&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;&amp;#xA0;&lt;/p&gt;
&lt;p style=&quot;margin-bottom: 0in;&quot;&gt;This process also has applications to happiness set points, status seeking, morality, dieting, larger-scale purchasing behavior, and akrasia which deserve a separate post&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/lw/290/blue_and_yellowtinted_choices/#comments"&gt;54 comments&lt;/a&gt;
</description>
</item>
</channel>
</rss>