<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
<channel>
<title>
Articles Tagged ‘newcomb’ - Less Wrong Discussion
</title> <link>http://lesswrong.com/r/discussion/</link>
<description></description>
<item>
<title>Simulating Problems</title>
<link>http://lesswrong.com/r/discussion/lw/gie/simulating_problems/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/gie/simulating_problems/</guid>
<pubDate>Thu, 31 Jan 2013 00:14:28 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Andreas_Giger"&gt;Andreas_Giger&lt;/a&gt;
&amp;bull;
1 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/gie/simulating_problems/#comments"&gt;41 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Apologies for the rather mathematical nature of this post, but it seems to have some implications for topics relevant to LW. Prior to posting I looked for literature on this but was unable to find any; pointers would be appreciated.&lt;/p&gt;
&lt;p&gt;In short, my question is: How can we prove that any simulation of a problem really simulates the problem?&lt;/p&gt;
&lt;p&gt;I want to demonstrate that this is not as obvious as it may seem by using the example of Newcomb's Problem. The issue here is of course Omega's omniscience. If we construct a simulation with the rules (payoffs) of Newcomb, an Omega that is always right, and an interface for the agent to interact with the simulation, will that be enough?&lt;/p&gt;
&lt;p&gt;Let's say we simulate Omega's prediction by a coin toss and repeat the simulation (without payoffs) until the coin toss matches the agent's decision. This seems to adhere to all specifications of Newcomb and is (if the coin toss is hidden) in fact indistinguishable from it from the agent's perspective. However, if the agent knows how the simulation works, a CDT agent will one-box, while it is assumed that the same agent would two-box in 'real' Newcomb. Not telling the agent how the simulation works is never a solution, so this simulation appears to not actually simulate Newcomb.&lt;/p&gt;
&lt;p&gt;Pointing out differences is of course far easier than proving that none exist. Assuming there's a problem we have no idea which decisions agents would make, and we want to build a real-world simulation to find out exactly that. How can we prove that this simulation really simulates the problem?&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;(Edit: Apparently it wasn't apparent that this is about problems in terms of game theory and decision theory. Newcomb, Prisoner's Dilemma, Iterated Prisoner's Dilemma, Monty Hall, Sleeping Beauty, Two Envelopes, that sort of stuff. Should be clear now.)&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/gie/simulating_problems/#comments"&gt;41 comments&lt;/a&gt;
</description>
</item>
<item>
<title>A solvable Newcomb-like problem - part 3 of 3</title>
<link>http://lesswrong.com/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/</guid>
<pubDate>Fri, 07 Dec 2012 00:06:24 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Douglas_Reay"&gt;Douglas_Reay&lt;/a&gt;
&amp;bull;
3 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/#comments"&gt;3 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;This is the third part of a three post sequence on a problem that is similar to &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Newcomb%27s_problem&quot;&gt;Newcomb's problem&lt;/a&gt; but is posed in terms of probabilities and limited knowledge.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&amp;#xA0; Part 1 - stating the problem&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 2 - some mathematics&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 3 - towards a solution&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;In many situations we can say &quot;For practical purposes a probability of 0.9999999999999999999 is close enough to 1 that for the sake of simplicity I shall treat it as being 1, without that simplification altering my choices.&quot;&lt;/p&gt;
&lt;p&gt;However, there are some situations where the distinction does significantly alter that character of a situation so, when one is studying a new situation and one is not sure yet which of those two categories the situations falls into, the cautious approach is to re-frame the probability as being (1 - &amp;#x3B4;) where &amp;#x3B4; is small (eg 10 to the power of -12), and then examine the characteristics of the behaviour as &amp;#x3B4; tends towards 0.&lt;/p&gt;
&lt;p&gt;LessWrong wiki describes Omega as a super-powerful AI analogous to Laplace's demon, who knows the precise location and momentum of every atom in the universe, limited only by the laws of physics (so, if time travel isn't possible and some of our current thoughts on Quantum Mechanics are correct, then Omega's knowledge of the future is probabilistic, being limited by uncertainty).&lt;/p&gt;
&lt;p&gt;For the purposes of Newcomb's problem, and the rationality of Fred's decisions, it doesn't matter how close to that level of power Omega actually is.&amp;#xA0;&amp;#xA0; What matters, in terms of rationality, is the evidence available to Fred about how close Omega is to having to that level of power; or, more precisely, the evidence available to Fred relevant to Fred making predictions about Omega's performance in this particular game.&lt;/p&gt;
&lt;p&gt;Since this is a key factor in Fred's decision, we ought to be cautious.&amp;#xA0; Rather than specify when setting up the problem that Fred knows with a certainty of 1 that Omega does have that power, it is better to specify a concrete level of evidence that would lead Fred to assign a probability of (1 - &amp;#x3B4;) to Omega having that power, then examine the effect upon which option to the box problem it is rational for Fred to pick, as &amp;#x3B4; tends towards 0.&lt;/p&gt;
&lt;p&gt;The Newcomb-like problem stated in part 1 of this sequence contains an Omega that it is rational for Fred to assign a less than unity probability of being able to perfectly predict Fred's choices.&amp;#xA0; By using bets as analogies to the sort of evidence Fred might have available to him, we create an explicit variable that we can then manipulate to alter the precise probability Fred assigns to Omega's abilities.&lt;/p&gt;
&lt;p&gt;The other nice feature of the Newcomb-like problem given in part 1, is that it is explicitly solvable using the mathematics given in part 2.&amp;#xA0; By making randomness an external feature (the device Fred brings with him) rather than purely a feature of Fred's internal mind, we can acknowledge the question of Omega being able to predict quantum events, capture it as a variable, and take it into account when setting out the payoff matrix for the problem.&lt;/p&gt;
&lt;p&gt;This means that, instead of Fred having to think &quot;When I walked into this room I was determined to pick one-box.&amp;#xA0; As far as anyone knew or could predict, including myself, I intended to pick one-box.&amp;#xA0; However nothing I do now can change Omegas decision - the money is already in the box.&amp;#xA0; So I've nothing to lose by changing my mind.&quot;; Fred can now allocate a specific probability to whether Omega could predict Fred's chance of changing his mind in such circumstances, and Fred can take that into account in his strategy by making his chance of changing strategy explicit and external - basing it upon a random number device.&lt;/p&gt;
&lt;p&gt;Or, to put it another way, we are modelling a rational human who has a specific finite chance of talking himself into over riding a pre-committed strategy, as being made up from two components: a component that will infallibly stick to a pre-committed strategy plus a component with a known chance of change; we then treat the combined rational human as being someone infallibly committed to a meta-strategy that includes a chance of change - a mixed equilibrium, from Omega's point of view.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Ok, time to look at the numbers and draw a pretty diagram...&lt;/p&gt;
&lt;p&gt;Fred is Player A, and he has two meta options:&lt;br&gt;&amp;#xA0; A1 - play it safe.&amp;#xA0; 100% chance of one-box and 0% chance of two-box&lt;br&gt;&amp;#xA0; A2 - take a risk.&amp;#xA0;&amp;#xA0; Mentally pre-commit to using the device to produce 99% chance of one-box and 1% chance of two-box.&lt;/p&gt;
&lt;p&gt;Omega is Player B, and he has two meta options:&lt;br&gt;&amp;#xA0; B1 - reward risk.&amp;#xA0; Not attempt to distinguish between the mental state of Fred taking 1% risk and Fred playing it safe.&lt;br&gt;&amp;#xA0; B2 - punish risk.&amp;#xA0; Attempt to distinguish and, if Omega guesses Fred is taking risk rather than playing safe, punish it.&lt;/p&gt;
&lt;p&gt;We'll start out by assuming that if Omega does attempt to distinguish, then Omega will have a 1 in 10,000 false positive rate (thinking Fred is going to use the device, when actually Fred intends to play it safe) and a 1 in 10,000 false negative rate (thinking Fred is going to play it safe, when actually Fred intends to use the device).&lt;/p&gt;
&lt;h3&gt;A1 vs B1&lt;/h3&gt;
&lt;p&gt;Fred gains $1,000,000&lt;br&gt;Omega loses $1,000,000 to Fred but gains $1,000,000,000 from Alpha, for a net gain of $999,000,000&lt;/p&gt;
&lt;h3&gt;A2 vs B1&lt;/h3&gt;
&lt;p&gt;99% of the time, Fred gains $1,000,000 and Omega net gains $999,000,000&lt;br&gt;1% of the time, Fred gains $1,001,000 and Omega net loses $10,001,001,000&lt;/p&gt;
&lt;p&gt;Combining those gives an average of:&lt;br&gt;Fred gains: $1,000,010&lt;br&gt;Omega gains: $979,008,999&lt;/p&gt;
&lt;h3&gt;A1 vs B2&lt;/h3&gt;
&lt;p&gt;99.99% of the time, Omega correctly discerns that Fred is playing safe&lt;br&gt;Fred gains $1,000,000&lt;br&gt;Omega gains $999,000,000&lt;/p&gt;
&lt;p&gt;0.01% of the time, Omega falsely believes that Fred is taking a risk, and punishes that by putting $0 in Box A&lt;br&gt;Fred gains $0&lt;br&gt;Omega loses $10,000,000,000&lt;/p&gt;
&lt;p&gt;Combining those gives an average of:&lt;br&gt;Fred gains: $999,900&lt;br&gt;Omega gains: $997,900,100&lt;/p&gt;
&lt;h3&gt;A2 vs B2&lt;/h3&gt;
&lt;p&gt;In 100 trials out of 1,000,000 trials Omega incorrectly thinks Fred will play it safe, when actually Fred takes the risk of using the device.&amp;#xA0; Of these:&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;In 1 trial out of 1,000,000 trials: Omega incorrectly thinks Fred will play it safe, when actually Fred takes the risk of using the device and, in this case, the device picks two-box&lt;br&gt;==&amp;gt; Fred gains $1,001,000&lt;br&gt;==&amp;gt; Omega loses $10,001,001,000&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;In 99 trials out of 1,000,000 trials: Omega incorrectly thinks Fred will play it safe, when actually Fred takes the risk of using the device and, in this case, the device picks one-box&lt;br&gt;==&amp;gt; Fred gains $1,000,000&lt;br&gt;==&amp;gt; Omega gains $999,000,000&lt;/p&gt;
&lt;p&gt;In 999,900 trials out of 1,000,000 trials Omega correctly thinks that Fred is going to take the risk of using the device.&amp;#xA0; Of those:&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;In 9,999 trials out of 1,000,000 trials: Omega correctly thinks that Fred is going to take the risk of using the device and, in this case, the device picks two-box&lt;br&gt;==&amp;gt; Fred gains $1,000&lt;br&gt;==&amp;gt; Omega gains $999,999,000&lt;/p&gt;
&lt;p style=&quot;padding-left: 30px;&quot;&gt;In 989,901 trials out of 1,000,000 trials: Omega correctly thinks that Fred is going to take the risk of using the device and, in this case, the device picks one-box&lt;br&gt;==&amp;gt; Fred gains $0&lt;br&gt;==&amp;gt; Omega loses $10,000,000,000&lt;/p&gt;
&lt;p&gt;Combining those gives an average of:&lt;br&gt;Fred gains $110&lt;br&gt;Omega loses $9,888,922,110&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://www.chiark.greenend.org.uk/~douglasr/images/payoff_matrix8.jpg&quot; alt=&quot;&quot; height=&quot;600&quot; width=&quot;600&quot;&gt;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;If Fred's best estimate, beforehand, of the chances of Omega choosing meta strategy B1 is P, then&lt;/p&gt;
&lt;p&gt;Fred's expected gain from choosing meta strategy A1 would be: $1,000,000 P + $999,999 (1-P)&lt;br&gt;Fred's expected gain from choosing meta strategy A2 would be: $1,000,010 P + $110 (1-P)&lt;/p&gt;
&lt;p&gt;We can ask ourselves at what value of P these two strategies would return the same expected gain:&lt;/p&gt;
&lt;p&gt;$1,000,000 P + $999,999 (1-P)&amp;#xA0;&amp;#xA0; =&amp;#xA0;&amp;#xA0;&amp;#xA0; $1,000,010 P + $110 (1-P)&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; $999,889 (1-P)&amp;#xA0;&amp;#xA0; =&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; $10 P&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; $999,889&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; =&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; $999,899 P&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; P&amp;#xA0;&amp;#xA0;&amp;#xA0; = 0.99999&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; (1-P)&amp;#xA0; = 0.00001&lt;/p&gt;
&lt;p&gt;In other words, Fred thinking that Omega has even a 1 in 100,000 chance of being vindictive should be enough to deter Fred from taking the risky strategy.&lt;/p&gt;
&lt;p&gt;But how does that look from Omega's point of view?&amp;#xA0;&amp;#xA0; If Omega thinks that Fred's chance of picking meta strategy A1 is Q, then what is the cost to Omega of picking B2 1 in 100,000 times?&lt;/p&gt;
&lt;p&gt;Omega's expected gain from choosing meta strategy B1 would be: $999,000,000 Q + $979,008,999 (1-Q)&lt;br&gt;Omega's expected gain from choosing meta strategy B2 would be: $997,900,100 Q - $9,888,922,110 (1-Q)&lt;/p&gt;
&lt;p&gt;0.99999 { $999,000,000 Q + $979,008,999 (1-Q)&amp;#xA0; } + 0.00001 { $997,900,100 Q - $9,888,922,110 (1-Q) }&lt;br&gt;= (1 - 0.00001) { $979,008,999 + $19,991,001 Q } + 0.00001 { - $9,888,922,110&amp;#xA0; + $10,886,822,210 Q&amp;#xA0; }&lt;br&gt;= $979,008,999 + $19,991,001 Q + 0.00001 { - $9,888,922,110&amp;#xA0; + $10,886,822,210 Q - $979,008,999 - $19,991,001 Q }&lt;br&gt;= $979,008,999 + $19,991,001 Q + 0.00001 { $9,907,813,211 + $10,866,831,209 Q }&lt;br&gt;= ( $979,008,999 + $99,078.13211) + ( $19,991,001 + $108,668.31209 ) Q&lt;br&gt;= $979,108,077 + $20,099,669 Q&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Perhaps a meta strategy of 1% chance of two-boxing is not Fred's optimal meta strategy.&amp;#xA0; Perhaps, at that level compared to Omega's ability to discern, it is still worth Omega investing in being vindictive occasionally, in order to deter Fred from taking risk.&amp;#xA0;&amp;#xA0; But, given sufficient data about previous games, Fred can make a guess at Omega's ability to discern.&amp;#xA0; And, likewise Omega, by including in the record of past games occasions when Omega has falsely accused a human player of taking risk, can signal to future players where Omega's boundaries are.&amp;#xA0;&amp;#xA0; We can plot graphs of these to find the point at which Fred's meta strategy and Omega's meta strategy are in equilibrium - where if Fred took any larger chances, it would start becoming worth Omega's while to punish risk sufficiently often that it would no longer be in Fred's interests to take the risk.&amp;#xA0;&amp;#xA0; Precisely where that point is will depend on the numbers we picked in Part 1 of this sequence.&amp;#xA0; By exploring the space created by using each variable number as a dimension, we can divide it into regions characterised by which strategies dominate within that region.&lt;/p&gt;
&lt;p&gt;Extrapolating that as &amp;#x3B4; tends towards 0 should then carry us closer to a convincing solution to Newcomb's Problem.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0; Back to &lt;a href=&quot;/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/&quot;&gt;Part 1 - stating the problem&lt;/a&gt;&lt;br&gt;&amp;#xA0; Back to &lt;a href=&quot;/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/&quot;&gt;Part 2 - some mathematics&lt;/a&gt;&lt;br&gt;&amp;#xA0; This is&amp;#xA0;&amp;#xA0; Part 3 - towards a solution&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/#comments"&gt;3 comments&lt;/a&gt;
</description>
</item>
<item>
<title>A solvable Newcomb-like problem - part 2 of 3</title>
<link>http://lesswrong.com/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/</guid>
<pubDate>Tue, 04 Dec 2012 03:49:38 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Douglas_Reay"&gt;Douglas_Reay&lt;/a&gt;
&amp;bull;
0 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/#comments"&gt;1 comment&lt;/a&gt;
&lt;div&gt;&lt;p&gt;This is the second part of a three post sequence on a problem that is similar to &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Newcomb%27s_problem&quot;&gt;Newcomb's problem&lt;/a&gt; but is posed in terms of probabilities and limited knowledge.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&amp;#xA0; Part 1 - stating the problem&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 2 - some mathematics&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 3 - towards a solution&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;In game theory, a payoff matrix is a way of presenting the results of two players simultaneously picking options.&lt;/p&gt;
&lt;p&gt;For example, in the Prisoner's Dilemma, Player A gets to choose between option A1 (Cooperate) and option A2 (Defect) while, at the same time Player B gets to choose between option B1 (Cooperate) and option B2 (Defect).&amp;#xA0;&amp;#xA0; Since years spent in prison are a negative outcome, we'll write them as negative numbers:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://www.chiark.greenend.org.uk/~douglasr/images/prisoners_dilemma_payoff_matrix2.jpg&quot; alt=&quot;payoff&quot; height=&quot;250&quot; width=&quot;250&quot;&gt;&lt;/p&gt;
&lt;p&gt;So, if you look at the bottom right hand corner, at the intersection of Player A defecting (A2) and Player B defecting (B2) we see that both players end up spending 4 years in prison.&amp;#xA0;&amp;#xA0; Whereas, looking at the bottom left we see that if A defects and B cooperates, then Player A ends up spending 0 years in prison and Player B ends up spending 5 years in prison.&lt;/p&gt;
&lt;p&gt;Another familiar example we can present in this form is the game &lt;a href=&quot;http://www.netlaputa.ne.jp/~tokyo3/e/janken_e.html&quot;&gt;Rock-Paper-Scissors&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;We could write it as a zero sum game, with a win being worth 1, a tie being worth 0 and a loss being worth -1:&lt;/p&gt;
&lt;p&gt;But it doesn't change the mathematics if we give both players 2 points each round just for playing, so that a win becomes worth 3 points, a tie becomes worth 2 points and a loss becomes worth 1 point.&amp;#xA0; (Think of it as two players in a game show being rewarded by the host, rather than the players making a direct bet with each other.)&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://www.chiark.greenend.org.uk/~douglasr/images/payoff_matrix4.jpg&quot; alt=&quot;&quot; height=&quot;375&quot; width=&quot;375&quot;&gt;&lt;/p&gt;
&lt;p&gt;If you are Player A, and you are playing against a Player B who always chooses option B1 (Rock), then your strategy is clear.&amp;#xA0; You choose option A2 (Paper) each time.&amp;#xA0; Over 10 rounds, you'd expect to end up with $30 compared to B's $10.&lt;/p&gt;
&lt;p&gt;Let's imagine a slightly more sophisticated Player B, who always picks Rock in the first round, and then for all other rounds picks whatever would beat Player A's choice the previous round.&amp;#xA0;&amp;#xA0; This strategy would do well against someone who always picked the same option each round, but it is deterministic and, if we guess it correctly in advance, we can design a strategy that beats it every time.&amp;#xA0; (In this case, picking Paper-Rock-Scissors then repeating back to Paper). &amp;#xA0; In fact whatever strategy B comes up with, if that strategy is deterministic and we guess it in advance, then we end up with $30 and B ends up with $10.&lt;/p&gt;
&lt;p&gt;What if B has a deterministic strategy that B picked in advance and doesn't change, but we don't know at the start of the first round what it is?&amp;#xA0;&amp;#xA0; In theory B might have picked any of the 3-to-the-power-of-10 deterministic strategies that are indistinguishable from each other over a 10 round duel but, in practice, humans tend to favour some strategies over others so, if you know humans and the game of Rock-Paper-Scissors better than Player B does, you have a better than even chance of guessing his pattern and coming out ahead in the later rounds of the duel.&lt;/p&gt;
&lt;p&gt;But there's a danger to that.&amp;#xA0; What if you have overestimated your comparative knowledge level and Player B uses your overconfidence to lure you into thinking you've cracked B's pattern, while really B is laying a trap, increasing the predictability of Player A's moves so Player B can then take advantage of that to work out which moves will trump them?&amp;#xA0; This works better in a game like poker, where the stakes are not the same each round, but it is still possible in Rock-Paper-Scissors, and you can imagine variants of the game where the host varies payoff matrix by increasing the lose-tie-win rewards from 1,2,3 in the first round, to 2,4,6 in the second round, 3,6,9 in the third round, and so on.&lt;/p&gt;
&lt;p&gt;This is why the safest strategy is to not to have a deterministic strategy but, instead, use a source of random bits to each round pick option 1 with a probability of 33%, option 2 with a probability of 33% or option 3 with a probability of 33% (modulo rounding).&amp;#xA0; You might not get to take advantage of any predictability that becomes apparent in your opponents strategy, but neither can you be fooled into becoming predictable yourself.&lt;/p&gt;
&lt;p&gt;On a side note, this still applies even when there is only one round, because unaided humans are not as good at coming up with random bits as they think they are.&amp;#xA0; Someone who has observed many first time players will notice that first time players more often than not choose as their Rock as their 'random' first move, rather than Paper or Scissors.&amp;#xA0; If such a person were confident that they were playing a first time player, they might therefore pick Paper as their first move more frequently than not.&amp;#xA0; Things soon get very Sicilian (in the sense of &lt;a href=&quot;https://www.youtube.com/watch?v=E2y40U2LvKY&quot;&gt;the duel between Westley and Vizzini in the film The Princess Bride&lt;/a&gt;) after that, because a yet more sophisticated player who guessed their opponent would try this, could then pick Scissors.&amp;#xA0; And so ad infinitum, with ever more implausible levels of discernment being required to react on the next level up.&lt;/p&gt;
&lt;p&gt;We can imagine a tournament set up between 100 players taken randomly from the expertise distribution of game players, each player submitting a python program that always plays the same first move, and for each of the remaining 9 rounds produces a move determined solely by the the moves so far in that duel.&amp;#xA0; The tournament organiser would then run every player's program once against the programs of each of the other 99 players, so on average each player would collect 99x10x2 = $1,980&lt;/p&gt;
&lt;p&gt;We could make things more complex by allowing the programs to use, as an input, how much money their opponent has won so far during the tournament; or iterate over running the tournament several times, to give each player an 'expertise' rating which the program in the following tournament could then use.&amp;#xA0; We could allow the tournament host to subtract from each player a sum of money depending upon the size of program that player submitted (and how much memory or cpu it used).&amp;#xA0;&amp;#xA0; We could give each player a limited ration of random bits, so when facing a player with a higher expertise rating they might splurge and make their move on all 10 rounds completely random, and when facing a player with a lower expertise they might conserve their supply by trying to 'out think' them.&lt;/p&gt;
&lt;p&gt;There are various directions we could take this, but the one I want to look at here is what happens when you make the payoff matrix asymmetric.&amp;#xA0; What happens if you make the game unfair, so not only does one player have more at stake than the other player, but the options are not even either, for example:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://www.chiark.greenend.org.uk/~douglasr/images/payoff_matrix5.jpg&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;You still have the circular Rock-Paper-Scissors dynamic where:&lt;br&gt;&amp;#xA0;&amp;#xA0; If B chose B3, then A wants most to have chosen A1&lt;br&gt;&amp;#xA0;&amp;#xA0; If A chose A1, then B wants most to have chosen B2&lt;br&gt;&amp;#xA0;&amp;#xA0; If B chose B2, then A wants most to have chosen A3&lt;br&gt;&amp;#xA0;&amp;#xA0; If A chose A3, then B wants most to have chosen B1&lt;br&gt;&amp;#xA0;&amp;#xA0; If B chose B1, then A wants most to have chosen A2&lt;br&gt;&amp;#xA0;&amp;#xA0; If A chose A2, then B wants most to have chosen B3&lt;/p&gt;
&lt;p&gt;so everything wins against at least one other option, and loses against at least one other option.&amp;#xA0;&amp;#xA0; However Player B is clearly now in a better position, because B wins ties, and B's wins (a 9, an 8 and a 7) tend to be larger than A's wins (a 9, a 6 and a 6).&lt;/p&gt;
&lt;p&gt;What should Player A do?&amp;#xA0; Is the optimal safe strategy still to pick each option with an equal weighting?&lt;/p&gt;
&lt;p&gt;Well, it turns out the answer is: no, an equal weighting isn't the optimal response.&amp;#xA0;&amp;#xA0; Neither is just picking the same 'best' option each time.&amp;#xA0; Instead what do you is pick your 'best' option a bit more frequently than an equal weighting would suggest, but not so much that the opponent can steal away that gain by reliably choosing the specific option that trumps yours.&amp;#xA0;&amp;#xA0; Rather than duplicate material already well presented on the web, I will point you at two lecture courses on game theory that explain how to calculate the exact probability to assign to each option:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&quot;&lt;a href=&quot;http://oyc.yale.edu/economics/econ-159#sessions&quot;&gt;ECON 159: Game Theory&lt;/a&gt;&quot;, from Open Yale Courses&lt;/li&gt;
&lt;li&gt;&quot;&lt;a href=&quot;http://www.youtube.com/course?list=ECKI1h_nAkaQoDzI4xDIXzx6U2ergFmedo&quot;&gt;Game Theory 101: The Complete Series&lt;/a&gt;&quot;, by William Spaniel&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;You do this by using the indifference theorem to arrive at a set of linear equations, which you can then solve to arrive at a mixed equilibrium where neither player increases their expected utility by altering the probability weightings they assign to their options.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;http://www.eprisner.de/MAT109/Mixedb.html#3.2&quot;&gt;Example of calculating the general case for a 3x3 payoff matrix&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://www.eprisner.de/MAT109/AnalysisVNMPOKER.html#2.1&quot;&gt;More complex example, drawn from poker&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;http://people.hofstra.edu/stefan_waner/RealWorld/Summary4.html#min&quot;&gt;Summary&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;h2&gt;&lt;span style=&quot;color: #ff6600;&quot;&gt;The TL;DR; points to take away&lt;/span&gt;&lt;br&gt;&lt;/h2&gt;
&lt;p&gt;If you are competing in what is effectively a simultaneous option choice game, with a being who you suspect may have an equal or higher expertise to you at the game, you can nullify their advantage by picking a strategy that, each round chooses randomly (using a weighting) between the available options.&lt;/p&gt;
&lt;p&gt;Depending upon the details of the payoff matrix, there may be one option that it makes sense for you to pick most of the time but, unless that option is strictly better than all your other choices no matter what option your opponent picks, there is still utility to gain from occasionally picking the other options in order to keep your opponent on their toes.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0; Back to &lt;a href=&quot;/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/&quot;&gt;Part 1 - stating the problem&lt;/a&gt;&lt;br&gt;&amp;#xA0; This is&amp;#xA0; Part 2 - some mathematics&lt;br&gt;&amp;#xA0; Next to &lt;a href=&quot;/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/&quot;&gt;Part 3 - towards a solution&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/#comments"&gt;1 comment&lt;/a&gt;
</description>
</item>
<item>
<title>A solvable Newcomb-like problem - part 1 of 3</title>
<link>http://lesswrong.com/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/</guid>
<pubDate>Mon, 03 Dec 2012 20:26:46 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Douglas_Reay"&gt;Douglas_Reay&lt;/a&gt;
&amp;bull;
1 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/#comments"&gt;10 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;This is the first part of a three post sequence on a problem that is similar to &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Newcomb%27s_problem&quot;&gt;Newcomb's problem&lt;/a&gt; but is posed in terms of probabilities and limited knowledge.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&amp;#xA0; Part 1 - stating the problem&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 2 - some mathematics&lt;br&gt;&amp;#xA0;&amp;#xA0; Part 3 - towards a solution&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Omega is an AI, living in a society of AIs, who wishes to enhance his reputation in that society for being successfully able to predict human actions.&amp;#xA0; Given some exchange rate between money and reputation, you could think of that as a bet between him and another AI, let's call it Alpha.&amp;#xA0; And since there is also a human involved, for the sake of clarity, to avoid using &quot;you&quot; all the time, I'm going to sometimes refer to the human using the name &quot;Fred&quot;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Omega tells Fred:&lt;/p&gt;
&lt;p&gt;I'd like you to pick between two options, and I'm going to try to predict which option you're going to pick.&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0; Option &quot;one box&quot; is to open only box A, and take any money inside it&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0; Option &quot;two box&quot; is to open both box A and box B, and take any money inside them&lt;/p&gt;
&lt;p&gt;but, before you pick your option, declare it, then open the box or boxes, there are three things you need to know.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Firstly, you need to know the terms of my bet with Alpha.&lt;/p&gt;
&lt;p&gt;If Fred picks option &quot;one box&quot; then:&lt;br&gt;&amp;#xA0;&amp;#xA0; If box A contains $1,000,000 and box B contains $1,000 then Alpha pays Omega $1,000,000,000&lt;br&gt;&amp;#xA0;&amp;#xA0; If box A contains $0&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; and box B contains $1,000 then Omega pays Alpha $10,000,000,000&lt;br&gt;&amp;#xA0;&amp;#xA0; If anything else, then both Alpha and Omega pay Fred $1,000,000,000,000&lt;/p&gt;
&lt;p&gt;If Fred picks option &quot;two box&quot; then:&lt;br&gt;&amp;#xA0;&amp;#xA0; If box A contains $1,000,000 and box B contains $1,000 then Omega pays Alpha $10,000,000,000&lt;br&gt;&amp;#xA0;&amp;#xA0; If box A contains $0&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; and box B contains $1,000 then Alpha pays Omega $1,000,000,000&lt;br&gt;&amp;#xA0;&amp;#xA0; If anything else, then both Alpha and Omega pay Fred $1,000,000,000,000&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Secondly, you should know that I've already placed all the money in the boxes that I'm going to, and I can't change the contents of the boxes between now and when you do the opening, because Alpha is monitoring everything.&amp;#xA0; I've already made my prediction, using a model I've constructed of your likely reactions based upon your past actions.&lt;/p&gt;
&lt;p&gt;You can use any method you like to choose between the two options, short of contacting another AI, but be warned that if my model predicted that you'll use a method which introduces too large a random element (such as tossing a coin) then, while I may lose my bet with Alpha, I'll certainly have made sure you won't win the $1,000,000.&amp;#xA0; Similarly, if my model predicted that you'd make an outside bet with another human (let's call him George) to alter the value of winning $1,001,000 from me I'd have also taken that into account.&amp;#xA0; (I say &quot;human&quot; by the way, because my bet with Alpha is about my ability to predict humans so if you contact another AI, such as trying to lay a side bet with Alpha to skim some of his winnings, that invalidates not only my game with you, but also my bet with Alpha and there are no winning to skim.)&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;And, third and finally, you need to know my track record in previous similar situations.&lt;/p&gt;
&lt;p&gt;I've played this game 3,924 times over the past 100 years (ie since the game started), with humans picked at random from the full variety of the population.&amp;#xA0;&amp;#xA0; The outcomes were:&lt;br&gt;&amp;#xA0;&amp;#xA0; 3000 times players picked option &quot;one box&quot; and walked away with $1,000,000&lt;br&gt;&amp;#xA0;&amp;#xA0; 900&amp;#xA0; times players picked option &quot;two box&quot; and walked away with $1,000&lt;br&gt;&amp;#xA0;&amp;#xA0; 24 times players flipped a coin and or were otherwise too random.&amp;#xA0; Of those players:&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; 12 players picked option &quot;one box&quot; and walked away with $0&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; 12 players picked option &quot;two box&quot; and walked away with $1,000&lt;/p&gt;
&lt;p&gt;Never has anyone ever ended up walking away with $1,001,000 by picking option &quot;two box&quot;.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Omega stops talking.&amp;#xA0;&amp;#xA0; You are standing in a room containing two boxes, labelled &quot;A&quot; and &quot;B&quot;, which are both currently closed.&amp;#xA0; Everything Omega said matches what you expected him to say, as the conditions of the game are always the same and are well known - you've talked with other human players (who confirmed it is legit) and listened to their advice.&amp;#xA0;&amp;#xA0; You've not contacted any AIs, though you have read the published statement from Alpha that also confirms the terms of the bet and details of the monitoring.&amp;#xA0; You've not made any bets with other humans, even though your dad did offer to bet you a bottle of whiskey that you'd be one of them too smart alecky fools who walked away with only $1,000.&amp;#xA0; You responded by pre-committing to keep any winnings you make between you and your banker, and to never let him know.&lt;/p&gt;
&lt;p&gt;The only relevant physical object you've brought along is a radioactive decay based random number generator, that Omega would have been unable to predict the result of in advance, just in case you decide to use it as a factor in your choice.&amp;#xA0; It isn't a coin, giving only a 50% chance of &quot;one box&quot; and a 50% chance of &quot;two box&quot;.&amp;#xA0;&amp;#xA0; You can set arbitrary odds (tell it to generate a random integer between 0 and any positive integer you give it, up to 10 to the power of 100).&amp;#xA0;&amp;#xA0; Omega said in his spiel the phrase &quot;too large a random element&quot; but didn't specify where that boundary was.&lt;/p&gt;
&lt;p&gt;What do you do?&amp;#xA0;&amp;#xA0; Or, given that such a situation doesn't exist yet, and we're talking about a Fred in a possible future, what advice would you give to Fred on how to choose, were he to ever end up in such a situation?&lt;/p&gt;
&lt;p&gt;Pick &quot;one box&quot;?&amp;#xA0;&amp;#xA0; Pick &quot;two box&quot;?&amp;#xA0;&amp;#xA0; Or pick randomly between those two choices and, if so, at what odds?&lt;/p&gt;
&lt;p&gt;And why?&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; Part 1 - stating the problem&lt;br&gt;next &amp;#xA0; &lt;a href=&quot;/r/discussion/lw/fr6/a_solvable_newcomblike_problem_part_2_of_3/&quot;&gt;Part 2 - some mathematics&lt;/a&gt;&lt;br&gt;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0;&amp;#xA0; &lt;a href=&quot;/r/discussion/lw/fr7/a_solvable_newcomblike_problem_part_3_of_3/&quot;&gt;Part 3 - towards a solution&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/fpd/a_solvable_newcomblike_problem_part_1_of_3/#comments"&gt;10 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Is Omega Impossible? Can we even ask?</title>
<link>http://lesswrong.com/r/discussion/lw/f3x/is_omega_impossible_can_we_even_ask/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/f3x/is_omega_impossible_can_we_even_ask/</guid>
<pubDate>Thu, 25 Oct 2012 01:47:26 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/mwengler"&gt;mwengler&lt;/a&gt;
&amp;bull;
-8 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/f3x/is_omega_impossible_can_we_even_ask/#comments"&gt;51 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;EDIT: I see by the karma bombing we can't even ask. &amp;#xA0;Why even call this part of the site &quot;discussion?&quot; &amp;#xA0;&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Some of the classic questions about an omnipotent god include&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Can god make a square circle?&lt;/li&gt;
&lt;li&gt;Can god create an immovable object? &amp;#xA0;And then move it?&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;Saints and philosophers wrestled with these issues back before there was television. &amp;#xA0;My recollection is that people who liked the idea of an omnipotent god would answer &quot;omnipotence does not include the power to do nonsense&quot; where they would generally include contradictions as nonsense. &amp;#xA0;So omnipotence can't square a circle, can't make 2=3, can't make an atom which is simultaneously lead and gold. &amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;But where do the contradictions end and the merely difficult to&amp;#xA0;conceive&amp;#xA0;begin? &amp;#xA0;Can omnipotence make the ratio of the diameter to the circumference of a circle = 3, or 22/7? &amp;#xA0;Can omnipotence make sqrt(2)=1.4 or 2+2=5? &amp;#xA0;While these are not directly self-contradictory statements, they can be used with a variety of simple truths to quickly derive self-contradictory statements. &amp;#xA0;Can we then conclude that &quot;2+2=5&quot; is essentially a contradiction because it is close to a contradiction? &amp;#xA0;Where do we draw the line? &amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;What if were set some problem where we are told to assume that&amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;ol&gt;
&lt;li&gt;2+2 = 5&lt;/li&gt;
&lt;li&gt;1+1 = 2&lt;/li&gt;
&lt;li&gt;1+1+1+1+1 = 5&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;In solving this set problem, we can quickly derive that 1=0, and use that to prove effectively anything we want to prove. &amp;#xA0;Perhaps not formally, but we have violated the &quot;law of the excluded middle,&quot; that either a statement is true or its negation is. &amp;#xA0;Once you violate that, you can prove ANYTHING using simple laws of inference, because you have propositions that are true and false. &amp;#xA0;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;What if we set a problem where we are told to assume&lt;/div&gt;
&lt;div&gt;&lt;ol&gt;
&lt;li&gt;Omega is an infallible intelligence that does not lie&lt;/li&gt;
&lt;li&gt;Omega tells you 2+2=5&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;Well, we are going to have the same problem as above, we will be able to prove anything.&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;&lt;strong&gt;Newcomb's Problem&lt;/strong&gt;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;In Newcomb's box problem, we are told to assume that&lt;/div&gt;
&lt;div&gt;&lt;ol&gt;
&lt;li&gt;Omega is an infallible intelligence&lt;/li&gt;
&lt;li&gt;Omega has predicted correctly whether we will one box or two box. &amp;#xA0;&lt;/li&gt;
&lt;/ol&gt;
&lt;div&gt;From these assumptions we wind up with all sorts of problems of causality and/or free will and/or determinism. &amp;#xA0;&lt;/div&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;What if these statements are not consistent? &amp;#xA0;What if these statements are tantamount to assuming 0=1, or are within a few steps of assuming 0=1? &amp;#xA0;Or something just as contradictory, but harder to identify? &amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;Personally, I can think of LOTS of reasons to doubt that Newcomb's problem is even theoretically possible to set. &amp;#xA0;Beyond that, I can think that the empirical barrier to believing Omega exists in reality would be gigantic, millions of humans have watched magic shows performed by non-superior intelligences where cards we have signed have turned up in a previously sealed envelope or wallet or audience member's pocket. &amp;#xA0;We recognize that these are tricks, that they are not what they appear. &amp;#xA0;&lt;/div&gt;
&lt;div&gt;&lt;br&gt;&lt;/div&gt;
&lt;div&gt;To question Omega is not playing by the mathematician's or philosopher's rules. &amp;#xA0;But when we play by the rules, do we blithely assume 2+2=5 and then wrap ourselves around the logical axle trying to program a friendly AI to one-box? &amp;#xA0;Why is questioning Omega's possibility of existence, or possibility of proof of existence out-of-bounds? &amp;#xA0;&lt;/div&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/f3x/is_omega_impossible_can_we_even_ask/#comments"&gt;51 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Omega lies</title>
<link>http://lesswrong.com/r/discussion/lw/f3u/omega_lies/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/f3u/omega_lies/</guid>
<pubDate>Wed, 24 Oct 2012 21:46:16 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Stuart_Armstrong"&gt;Stuart_Armstrong&lt;/a&gt;
&amp;bull;
7 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/f3u/omega_lies/#comments"&gt;41 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;Just&amp;#xA0;developing&amp;#xA0;my second idea at the end of my last &lt;a href=&quot;/r/discussion/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/&quot;&gt;post&lt;/a&gt;. It seems to me that in the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Newcomb's_problem&quot;&gt;Newcomb problem&lt;/a&gt; and in the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Counterfactual_mugging&quot;&gt;counterfactual mugging&lt;/a&gt;, the completely trustworthy Omega lies to a greater or lesser extent.&lt;/p&gt;
&lt;p&gt;This is immediately obvious in scenarios where Omega simulates you in order to predict your reaction. In the Newcomb problem, the simulated you is told &quot;I have already made my decision...&quot;, which is not true at that point, and in the counterfactual mugging, whenever the coin comes up heads, the simulated you is told &quot;the coin came up tails&quot;. And the arguments only go through because these lies are accepted by the simulated you as being true.&lt;/p&gt;
&lt;p&gt;If Omega doesn't simulate you, but uses other methods to gauge your reactions, he isn't lying to you per se. But he is estimating your reaction in the hypothetical situation where you were fed untrue information that you believed to be true. And that you believed to be true, specifically because the source is Omega, and Omega is trustworthy.&lt;/p&gt;
&lt;p&gt;Doesn't really change much to the arguments here, but it's a thought worth bearing in mind.&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/f3u/omega_lies/#comments"&gt;41 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Naive TDT, Bayes nets, and counterfactual mugging</title>
<link>http://lesswrong.com/r/discussion/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/</guid>
<pubDate>Wed, 24 Oct 2012 02:58:03 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Stuart_Armstrong"&gt;Stuart_Armstrong&lt;/a&gt;
&amp;bull;
15 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/#comments"&gt;39 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;I set out to understand precisely why naive TDT (possibly) fails the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Counterfactual_mugging&quot;&gt;counterfactual mugging&lt;/a&gt; problem. While doing this I ended up drawing a lot of &lt;a href=&quot;http://en.wikipedia.org/wiki/Bayes_net&quot;&gt;Bayes nets&lt;/a&gt;, and seemed to gain some insight; I'll pass these on, in the hopes that they'll be useful. All errors are, of course, my own.&lt;/p&gt;
&lt;h2&gt;The grand old man of decision theory: the Newcomb problem&lt;/h2&gt;
&lt;p&gt;First let's look at the problem that inspired all this research: the &lt;a href=&quot;http://wiki.lesswrong.com/wiki/Newcomb's_problem&quot;&gt;Newcomb problem&lt;/a&gt;. In this problem, a supremely-insightful-and-entirely-honest superbeing called Omega presents two boxes to you, and tells you that you can either choose box A only (&quot;1-box&quot;), or take box A and box B (&quot;2-box&quot;). Box B will always contain $1K (one thousand dollars). Omega has predicted what your decision will be, though, and if you decided to 1-box, he's put $1M (&lt;a href=&quot;http://www.youtube.com/watch?v=l91ISfcuzDw&quot;&gt;one million dollars&lt;/a&gt;) in box A; otherwise he's put nothing in it. The problem can be cast as a Bayes net with the following nodes:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_0.png?v=6d4a401f747b45d9d67aba8865a7161f&quot; style=&quot;width: 100%; height: auto;&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;&lt;a id=&quot;more&quot;&gt;&lt;/a&gt;Your decision algorithm (or your your decision process) is the node that&amp;#xA0;determines&amp;#xA0;what you're going to decide. This leads to &quot;Your decision&quot; (1-box or 2-box) and &amp;#x3A9;&amp;#xA0;(puts $1M or zero in box A). These lead to the &quot;Money&quot; node, where you can end up with $1M+1K, $1M, $1K or $0 depending on the outputs of the other nodes. Note that the way the network is set up, you can never have $1M+1K or $0 (since &quot;&amp;#x3A9;&quot;&amp;#xA0;and &quot;Your decision&quot; are not independent). But it is the implied &quot;possibility&quot; of getting those two amounts that causes causal decision theory to 2-box in the Newcomb problem.&lt;/p&gt;
&lt;p&gt;In TDT, as I understand it, &lt;span style=&quot;text-decoration: line-through;&quot;&gt;you sever your decision algorithm node from the history of the universe&lt;/span&gt; (note this is incorrect, as explained &lt;a href=&quot;/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/7oli&quot;&gt;here&lt;/a&gt;. In fact you condition on the start of your program, and &lt;em&gt;screen out&lt;/em&gt; the history of the universe), and then pick the action that maximises our utility.&lt;/p&gt;
&lt;p&gt;But note that the graph is needlessly complicated: &quot;Your decision&quot; and &quot;&amp;#x3A9;&quot; are both superfluous nodes, that simply pass on their inputs to their outputs. Ignoring the &quot;History of the Universe&quot;, we can reduce the net to a more compact (but less illuminating) form:&lt;/p&gt;
&lt;p style=&quot;text-align:center&quot;&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_2.png?v=a3928a1b01c5a07613fcae2cb77e7c17&quot; alt=&quot;&quot; height=&quot;450&quot; width=&quot;221&quot;&gt;&lt;/p&gt;
&lt;p&gt;Here 1-box leads to $1M and 2-box leads to $1K. In this simplified version, the decision is obvious - maybe too obvious. The decision was&amp;#xA0;entirely&amp;#xA0;determined by the choice of how to lay out the Bayes net, and a causal decision theorist would disagree that the original &quot;screened out&quot; Bayes net was a valid encoding of the Newcomb problem.&lt;/p&gt;
&lt;h2&gt;The counterfactual mugging&lt;/h2&gt;
&lt;p&gt;In the counterfactual mugging, Omega is back, this time explaining that he tossed a coin. If the coin came up tails, he would have asked you to give him $1K, giving nothing in return. If the coin came up heads, he would have given you $1M - but only if when you would have given him the $1K in the tails world. That last fact he would have known by predicting your decision. Now Omega approaches you, telling you the coin was tails - what should you do? Here is a Bayes net with this information:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_3.png?v=1b2cce92c09f3f212d9c3219186c00ad&quot; style=&quot;width: 100%; height: auto;&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;I've removed the &quot;History of the Universe&quot; node, as we are screening it off anyway. Here &quot;Simulated decision&quot; and &quot;Your decision&quot; will output the same decision on the same input.&amp;#xA0;&amp;#x3A9; will behave the way he said, based on your simulated decision given tails. &quot;Coin&quot; will output heads or tails with 50% probability, and &quot;Tails&quot; simply outputs tails, for use in&amp;#xA0;&amp;#x3A9;'s prediction.&lt;/p&gt;
&lt;p&gt;Again, this graph is very elaborate, codifying all the problem's intricacies. But most of the nodes are superfluous for our decision, and the graph can be reduced to:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_4.png&quot; style=&quot;width: 100%; height: auto;&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;&quot;Coin&quot; outputs &quot;heads&quot; or &quot;tails&quot; and &quot;Your decision algorithm&quot; outputs &quot;Give $1K on tails&quot; or &quot;Don't give $1K on tails&quot;. Money is $1M if it receives &quot;heads&quot; and &quot;Give $1K on tails&quot;, -$1K if it receives &quot;tails&quot; and &quot;Give $1K on tails&quot;, and zero if receives &quot;Don't give $1K on tails&quot; (independent of the coin results).&lt;/p&gt;
&lt;p&gt;If our utility does not go down &lt;a href=&quot;/lw/e45/risk_aversion_does_not_explain_peoples_betting/&quot;&gt;too sharply in money&lt;/a&gt;,&amp;#xA0;we should choose &quot;Give $1K on tails&quot;, as a 50-50 bet on willing $1M and losing $1K is better than getting nothing with certainty. So precommitting to giving Omega $1K when he asks, leads to the better outcome.&lt;/p&gt;
&lt;p&gt;But now imagine that we are in the situation above: Omega has come to us and explained that yes, the coin has come up tails. The Bayes net now becomes:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_5.png?v=0d4bac5ca4b909a4f21bc18e9fe48804&quot; style=&quot;width: 100%; height: auto;&quot; alt=&quot;&quot;&gt;&lt;/p&gt;
&lt;p&gt;In this case, the course is clear: &quot;Give $1K on tails&quot; does nothing but lose us $1K. So we should decide not to - and nowhere in this causal graph can we see any problem with that course of action.&lt;/p&gt;
&lt;p&gt;So it seems that naive TDT has an inconsistency problem. And these graphs don't seem to fully encode the actual problem properly (ie that the action &quot;Give $1K on tails&quot; corresponds to situations where we truly believe that tails came up).&lt;/p&gt;
&lt;h2&gt;Thoughts on the problem&lt;/h2&gt;
&lt;p&gt;Some thoughts that&amp;#xA0;occurred&amp;#xA0;when formalising this problem:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The problem really is with updating on information, vindicating the instincts behind &lt;a href=&quot;/lw/15m/towards_a_new_decision_theory/&quot;&gt;updateless decision theory&lt;/a&gt;. The way you would have to behave, conditional on seeing new information, is different from how you want to behave, after seeing that new information.&lt;/li&gt;
&lt;li&gt;Naive TDT reaches different conclusions depending on whether Omega simulates you or predicts you. If you are unsure whether you are being simulated or not (but still care about the wealth of the non-simulated version), then TDT acts differently on updates. Being told &quot;tails&quot; doesn't actually confirm that the coin was tails: you might be the simulated version, being tested by Omega. Note that in this scenario, the simulated you is being lied to by the simulated Omega (the &quot;real&quot; coin need not have been tails), which might put the problem in a different perspective.&lt;/li&gt;
&lt;li&gt;The tools of TDT (Bayes nets cut at certain connection) feel inadequate. It's tricky to even express the paradox properly in this language, and even more tricky to know what to do about it. A possible problem seems to be that we don't have a way of expressing our own knowledge about the model, within the model - hence &quot;tails&quot; ends up being a fact about the universe, no a fact about our knowledge at the time. Maybe we need to make our map explicit &lt;a href=&quot;/lw/erp/skill_the_map_is_not_the_territory/&quot;&gt;in the territory&lt;/a&gt;, and get Bayes nets that go something like these:&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://images.lesswrong.com/t3_f37_6.png?v=8ce3f677639edfacca7fa9cfc57ea83d&quot; style=&quot;width: 100%; height: auto;&quot; alt=&quot;&quot;&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/f37/naive_tdt_bayes_nets_and_counterfactual_mugging/#comments"&gt;39 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Can anyone explain to me why CDT two-boxes?</title>
<link>http://lesswrong.com/r/discussion/lw/de6/can_anyone_explain_to_me_why_cdt_twoboxes/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/de6/can_anyone_explain_to_me_why_cdt_twoboxes/</guid>
<pubDate>Mon, 02 Jul 2012 16:06:36 +1000</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Andreas_Giger"&gt;Andreas_Giger&lt;/a&gt;
&amp;bull;
-12 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/de6/can_anyone_explain_to_me_why_cdt_twoboxes/#comments"&gt;136 comments&lt;/a&gt;
&lt;div&gt;&lt;p&gt;I have read lots of LW posts on this topic, and everyone seems to take this for granted without giving a proper explanation. So if anyone could explain this to me, I would appreciate that.&lt;/p&gt;
&lt;p&gt;This is a simple question that is in need of a simple answer. Please don't link to pages and pages of theorycrafting.&amp;#xA0;Thank you.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Edit: Since posting this, I have come to the conclusion that CDT doesn't actually play Newcomb. Here's a disagreement with that statement:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;If you write up a CDT algorithm and then put it into a Newcomb's problem simulator, it will do something. It's playing the game; maybe not well, but it's playing.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;And here's my response:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The thing is, an actual Newcomb simulator can't possibly exist because Omega doesn't exist. There are tons of workarounds, like using coin tosses as a substitution for Omega and ignoring the results whenever the coin was wrong, but that is something fundamentally different from Newcomb.&lt;/p&gt;
&lt;p&gt;You can only simulate Newcomb in theory, and it is perfectly possible to just not play a theoretical game, if you reject the theory it is based on. In theoretical Newcomb, CDT doesn't care about the rule of Omega being right, so CDT does not play Newcomb.&lt;/p&gt;
&lt;p&gt;If you're trying to simulate Newcomb in reality by substituting Omega with someone who has only empirically been proven right, you substitute Newcomb with a problem that consists of little more than simple calculation of priors and payoffs, and that's hardly the point here.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Edit 2: Clarification regarding backwards causality, which seems to confuse people:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Newcomb assumes that Omega is omniscient, which more importantly means that the decision you make right now determines whether Omega has put money in the box or not. Obviously this is backwards causality, and therefore not possible in real life, which is why Nozick doesn't spend too much ink on this.&lt;/p&gt;
&lt;p&gt;But if you rule out the possibility of backwards causality, Omega can only make his prediction of your decision based on all your actions up to the point where it has to decide whether to put money in the box or not. In that case, if you take two people who have so far always acted (decided) identical, but one will one-box while the other one will two-box, Omega &lt;em&gt;cannot&lt;/em&gt; make&amp;#xA0;different predictions for them. And no matter what prediction Omega makes, you don't want to be the one who one-boxes.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Edit 3: Further clarification on the possible problems that could be considered Newcomb:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;There's four types of Newcomb problems:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Omniscient Omega (backwards causality) - CDT rejects this case, which cannot exist in reality.&lt;/li&gt;
&lt;li&gt;Fallible Omega, but still backwards causality - CDT rejects this case, which cannot exist in reality.&lt;/li&gt;
&lt;li&gt;Infallible Omega, no backwards causality - CDT correctly two-boxes. To improve payouts, CDT would have to have decided differently &lt;em&gt;in the past&lt;/em&gt;, which is not decision theory anymore.&lt;/li&gt;
&lt;li&gt;Fallible Omega, no backwards causality -&amp;#xA0;CDT correctly two-boxes. To improve payouts, CDT would have to have decided differently &lt;em&gt;in the past&lt;/em&gt;, which is not decision theory anymore.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;That's all there is to it.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Edit 4: Excerpt from&amp;#xA0;Nozick's &quot;Newcomb's Problem and Two Principles of Choice&quot;:&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;background-color: #ffffcc; text-align: justify; line-height: 19px;&quot;&gt;&lt;span style=&quot;font-family: Arial, Helvetica, sans-serif;&quot;&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Now, at last, to return to Newcomb's example of the predictor. If one believes, for this case, that there is backwards causality, that your choice causes the money to be there or not, that it causes him to have made the prediction that he made, then there is no problem. One takes only what is in the second box. Or if one believes that the way the predictor works is by looking into the future; he, in some sense, sees what you are doing, and hence is no more likely to be wrong about what you do than someone else who is standing there at the time and watching you, and would normally see you, say, open only one box, then there is no problem. You take only what is in the second box. But suppose we establish or take as given that there is no backwards causality, that what you actually decide to do does not affect what he did in the past, that what you actually decide to do is not part of the explanation of why he made the prediction he made. So let us agree that the predictor works as follows: He observes you sometime before you are faced with the choice, examines you with complicated apparatus, etc., and then uses his theory to predict on the basis of this state you were in, what choice you would make later when faced with the choice. Your deciding to do as you do is not part of the explanation of why he makes the prediction he does, though your being in a certain state earlier, is part of the explanation of why he makes the prediction he does, and why you decide as you do.&lt;/p&gt;
&lt;p&gt;I believe that one should take what is in both boxes. I fear that the considerations I have adduced thus far will not convince those proponents of taking only what is in the second box. Furthermore I suspect that an adequate solution to this problem will go much deeper than I have yet gone or shall go in this paper. So I want to pose one question. I assume that it is clear that in the vaccine example, the person should not be convinced by the probability argument, and should choose the dominant action. I assume also that it is clear that in the case of the two brothers, the brother should not be convinced by the probability argument offered. The question I should like to put to proponents of taking only what is in the second box in Newcomb's example (and hence not performing the dominant action) is: what is the difference between Newcomb's example and the other two examples which make the difference between not following the dominance principle, and following it?&lt;/p&gt;
&lt;/blockquote&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/de6/can_anyone_explain_to_me_why_cdt_twoboxes/#comments"&gt;136 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Extremely Counterfactual Mugging or: the gist of Transparent Newcomb</title>
<link>http://lesswrong.com/r/discussion/lw/45s/extremely_counterfactual_mugging_or_the_gist_of/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/45s/extremely_counterfactual_mugging_or_the_gist_of/</guid>
<pubDate>Thu, 10 Feb 2011 02:20:54 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Bongo"&gt;Bongo&lt;/a&gt;
&amp;bull;
4 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/45s/extremely_counterfactual_mugging_or_the_gist_of/#comments"&gt;79 comments&lt;/a&gt;
&lt;div&gt;&lt;blockquote&gt;
&lt;p&gt;Omega will either award you $1000 or ask you to pay him $100. He will award you $1000 if he predicts you would pay him if he asked. He will ask you to pay him $100 if he predicts you wouldn't pay him if he asked.&amp;#xA0;&lt;/p&gt;
&lt;p&gt;Omega asks you to pay him $100. Do you pay?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This problem is roughly isomorphic to the branch of Transparent Newcomb (&lt;a href=&quot;/lw/43t/youre_in_newcombs_box/3h55&quot;&gt;version 1&lt;/a&gt;, &lt;a href=&quot;/lw/42r/punishing_future_crimes/3fiu&quot;&gt;version 2&lt;/a&gt;) where box B is empty, but it's simpler.&lt;/p&gt;
&lt;p&gt;Here's a diagram:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://i55.tinypic.com/k9i8vq.png&quot; alt=&quot;&quot; border=&quot;0&quot;&gt;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/45s/extremely_counterfactual_mugging_or_the_gist_of/#comments"&gt;79 comments&lt;/a&gt;
</description>
</item>
<item>
<title>Omega can be replaced by amnesia</title>
<link>http://lesswrong.com/r/discussion/lw/40o/omega_can_be_replaced_by_amnesia/</link>
<guid isPermaLink="true">http://lesswrong.com/r/discussion/lw/40o/omega_can_be_replaced_by_amnesia/</guid>
<pubDate>Wed, 26 Jan 2011 23:31:04 +1100</pubDate>
<description>
Submitted by &lt;a href="http://lesswrong.com/user/Bongo"&gt;Bongo&lt;/a&gt;
&amp;bull;
15 votes
&amp;bull;
&lt;a href="http://lesswrong.com/r/discussion/lw/40o/omega_can_be_replaced_by_amnesia/#comments"&gt;43 comments&lt;/a&gt;
&lt;div&gt;&lt;blockquote&gt;
&lt;p&gt;Let's play a game. Two times, I will give you an amnesia drug and let you enter a room with two boxes inside. Because of the drug, you won't know whether this is the first time you've entered the room. On the first time, both boxes will be empty. On the second time, box A contains $1000, and Box B contains $1,000,000 iff this is the second time and you took only box B the first time. You're in the room, do take both boxes or only box B?&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This is equivalent to Newcomb's Problem in the sense that any strategy does equally well on both, where by &quot;strategy&quot; I mean a mapping from info to (probability distributions over) actions.&lt;/p&gt;
&lt;p&gt;I suspect that any problem with Omega can be transformed into an equivalent problem with amnesia instead of Omega.&lt;/p&gt;
&lt;p&gt;Does CDT return the winning answer in such transformed problems?&lt;/p&gt;
&lt;p&gt;Discuss.&lt;/p&gt;
&lt;p&gt;&amp;#xA0;&lt;/p&gt;&lt;/div&gt;
&lt;a href="http://lesswrong.com/r/discussion/lw/40o/omega_can_be_replaced_by_amnesia/#comments"&gt;43 comments&lt;/a&gt;
</description>
</item>
</channel>
</rss>