cousin_it comments on [SEQ RERUN] The Rhythm of Disagreement - Less Wrong

2 Post author: MinibearRex 24 May 2012 01:06AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (9)

You are viewing a single comment's thread. Show more comments above.

Comment author: cousin_it 24 May 2012 11:25:18PM *  1 point [-]

Thrun's algorithm is correct. To see why, note that no matter how the envelope contents are distributed, all situations faced by the player can be grouped into pairs, where each pair consists of situations (x,2x) and (2x,x) which are equally likely. Within each pair the chance of switching from x to 2x is higher than the chance of switching from 2x to x, because f(x)>f(2x) by construction.

BTW, we have an ongoing discussion there about some math aspects of the algorithm.

Comment author: Luke_A_Somers 25 May 2012 03:34:03PM -1 points [-]

In the ideal case, which I specifically addressed in the first line, epsilon is zero.

Comment author: cousin_it 25 May 2012 05:45:45PM *  1 point [-]

Can you describe more exactly what you mean by the ideal case?