You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Stuart_Armstrong comments on Continually-adjusted discounted preferences - Less Wrong Discussion

3 Post author: Stuart_Armstrong 06 March 2015 04:03PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (15)

You are viewing a single comment's thread. Show more comments above.

Comment author: Stuart_Armstrong 09 March 2015 11:54:05AM 0 points [-]

I'd like to hear more about how you think discounting should work in a rational agent, on more conventional topics than time travel.

I don't think discounting should be used at all, and that rational facts about the past and future (eg expected future wealth) should be used to get discount-like effects instead.

However, there are certain agent designs (AIXI, unbounded utility maximisers, etc...) that might need discounting as a practical tool. In those cases, adding this hack could allow them to discount while reducing the negative effects.

Utility can't be stored, and gets re-evaluated for each decision.

Depends. Utility that sums (eg total hedonistic utilitarianism, reward-agent made into a utility maximiser, etc...) does accumulate. Some other variants have utility that accumulates non-linearly. Many non-accumulating utilities might have an accumulating component.