Lightwave comments on Desirable Dispositions and Rational Actions - Less Wrong

13 Post author: RichardChappell 17 August 2010 03:20AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (180)

You are viewing a single comment's thread. Show more comments above.

Comment author: Lightwave 18 August 2010 09:21:46AM *  0 points [-]

My take on this is the following: It's easier to see what is meant by disposition if you look at it in terms of AI. Replace the human with an AI, replace "disposition" with "source code" and replace "change your disposition to do some action X" to "rewrite your source code so that it does action X". Of course it would still want to incorporate the probability of a glitch as someone else already suggested.

If an AI, which is running CDT expects to encounter a newcomb-like problem, it would be rational for it to self-modify (in advance) to use a decision theory which one-boxes (i.e. the AI will change it's disposition).

Comment author: RichardChappell 19 August 2010 12:40:44AM 0 points [-]

Likewise, an AI surrounded by threat-fulfillers would rationally self-modify to become a threat-ignorer. (The debate is not about whether these are desirable dispositions to acquire -- that's common ground.) Do you think it follows from this that the act of ignoring a doomsday threat is also rational?