Contrived infinite-torture scenarios: July 2010

PlaidX

This is our monthly thread for collecting arbitrarily contrived scenarios in which somebody gets tortured for 3^^^^^3 years, or an infinite number of people experience an infinite amount of sorrow, or a baby gets eaten by a shark, etc. and which might be handy to link to in one of our discussions. As everyone knows, this is the most rational and non-obnoxious way to think about incentives and disincentives.

Please post all infinite-torture scenarios separately, so that they can be voted up/down separately. (If they are strongly related, reply to your own comments. If strongly ordered, then go ahead and post them together.)
No more than 5 infinite-torture scenarios per person per monthly thread, please.

Please post all infinite-torture scenarios separately, so that they can be voted up/down separately. (If they are strongly related, reply to your own comments. If strongly ordered, then go ahead and post them together.)
No more than 5 infinite-torture scenarios per person per monthly thread, please.

I don't think there is any reasonable utility function that is consistent with the actions the AI is claiming to have done.

What is your definition of 'reasonable' utility functions, which doesn't reference any other utility functions (such as our own)?

To me a reasonable utility function has to have a degree of self-consistency. A reasonable utility function wouldn't value both doing and undoing the same action simultaneously.

If an entity is using a utility function to determine its actions, then for every action the entity can perform, its utility function must be able to determine a utility value which then determines whether the entity does the action or not. If the utility function does not return a value, then the entity still has to act or not act, so the entity still has a utility function fo... (read more)

31

Contrived infinite-torture scenarios: July 2010

31

31

31

Contrived infinite-torture scenarios: July 2010

31

31