Rationality Quotes: November 2010

jaimeastorga2000

A monthly thread for posting rationality-related quotes you've seen recently (or had stored in your quotesfile for ages).

Please post all quotes separately, so that they can be voted up/down separately. (If they are strongly related, reply to your own comments. If strongly ordered, then go ahead and post them together.)
Do not quote yourself.
Do not quote comments/posts on LW/OB.
No more than 5 quotes per person per monthly thread, please.

A monthly thread for posting rationality-related quotes you've seen recently (or had stored in your quotesfile for ages).

Please post all quotes separately, so that they can be voted up/down separately. (If they are strongly related, reply to your own comments. If strongly ordered, then go ahead and post them together.)
Do not quote yourself.
Do not quote comments/posts on LW/OB.
No more than 5 quotes per person per monthly thread, please.

So far as I understand the situation, the SIAI is working on decision theory because they want to be able to create an AI that can be guaranteed not to modify its own decision function.

There are circumstances where CDT agents will self-modify to use a different decision theory (e.g. Parfit's Hitchhiker). If this happens (they believe), it will present a risk of goal-distortion, which is unFriendly.

Put another way: the objective isn't to get two AIs to cooperate, the objective is to make it so that an AI won't need to alter its decision function in order to cooperate with another AI. (Or any other theoretical bargaining partner.)

Does that make any sense? As a disclaimer, I definitely do not understand the issues here as well as the SIAI folks working on them.

There are circumstances where CDT agents will self-modify to use a different decision theory (e.g. Parfit's Hitchhiker).

Does that make any sense?

Not to me. But a reference might repair that deficiency on my part.

2orthonormal15y

I don't think that's quite right- a sufficiently smart Friendly CDT agent could self-modify into a TDT (or higher decision theory) agent without compromising Friendliness (albeit with the ugly hack of remaining CDT with respect to consequences that happened causally before the change). As far as I understand SIAI, the idea is that decision theory is the basis of their proposed AI architecture, and they think it's more promising than other AGI approaches and better suited to Friendliness content.

6

Rationality Quotes: November 2010

6

6

6

Rationality Quotes: November 2010

6

6