You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Vaniver comments on [Discussion] The Kelly criterion and consequences for decision making under uncertainty - Less Wrong Discussion

5 Post author: Metus 06 January 2013 02:14AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (15)

You are viewing a single comment's thread. Show more comments above.

Comment author: Vaniver 07 January 2013 10:45:51PM 1 point [-]

Yes, potential actions are discrete and outcomes are arbitrarily distributed.

It seems like this paper or this paper might be relevant to your interests. (PM me your email if you don't have access to them.)

No, I mean that the Kelly criterion says that allocation to a bet should be proportional to expected value over payoff. If I hold expected value constant and integrate over payoff the integral diverges. Intuitively I would expect to see a finite integral, reflecting that Kelly restricts how much risk I should be willing to take.

Kelly tells you how much risk you should be willing to take for a particular b; integrating over b is not meaningful, since it's integrating over multiple bets. (Note that f is E/b, if E is the expected value, and 1/x diverges. Since p is capped by 1, then E is capped by b, and the maximum risk you should take is betting everything, if p=1 i.e. it's a sure thing.)

If you put a probability p(b) on any particular payout, you might get something meaningful out of integrating p(b)E/b, but it's not clear to me that's the right way to do things.

Interesting. I should try this later.

It won't work out very prettily, but it is instructive. Basically, that tells you how much your bet should have differed from Delta, given what happened. You can then figure out what would have been optimal for that sequence, then do a weighted sum over sequences. (If your utility function isn't scale invariant, and only log is, then you need information on how long the game runs; if you're allowed to change the fraction of your wealth that you put up each time, then it's an entirely different problem.)