PhilGoetz comments on Values vs. parameters - Less Wrong

9 Post author: PhilGoetz 17 May 2011 05:53AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (65)

You are viewing a single comment's thread. Show more comments above.

Comment author: PhilGoetz 17 May 2011 06:10:23PM *  0 points [-]

Besides what I said in this reply to Eliezer,

I'll have to think about that. It's a little too deep for me to process right now.

discount via absolute time as opposed to time relative to "now".

I don't think that will work. Can you explain it in more detail? Distorting time won't prevent reversals of preference just because it makes some plotted curves match.

Comment author: pengvado 21 May 2011 03:19:10AM *  1 point [-]

Distorting time won't prevent reversals of preference just because it makes some plotted curves match.

If your discounting factor is f(t-now) for some function f, then f needs to be translation invariant (modulo positive affine scaling), on pain of preference reversals. The requirement of translation invariance is directly due to the fact that f gets translated by the varying values of "now". For two possible events x1 and x2, the agent compares U(x1)*f(t1-now) vs U(x2)*f(t2-now), where U is the non-discounted utility function, and if the result of that comparison depends on the value of "now" you have problems.

However, if your discounting factor is f(t) simpliciter, then f isn't translated and thus doesn't need to be translation invariant. No single event is ever valued according to multiple different outputs of f. The agent will derive the same preference between any two events regardless of when it computes the decision.