Warrigal comments on Decision theory: Why we need to reduce “could”, “would”, “should” - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (46)
To be exact, the agent must not know "which action maximizes utility" not "which action it will do". If it does whichever action it knows it'll do, it can have Lob-type short circuits where you say "I know you'll shoot yourself" and it says "OK thanks!" and shoots itself.
http://lesswrong.com/lw/t8/you_provably_cant_trust_yourself/
Here, let me go back in time and become timtyler and retroactively write the following instead, thereby averting this whole discussion: