This is the way I would do it, also taking into account EY's point of not hiding away new comments:
Assume each comment has an ‘upvote rate’ U, such that the probability that a comment at the age of t has u upvotes is a Poisson distribution with parameter Ut,
- P(u|U, t) = (Ut)^u exp(−Ut)/u!
and similarly for downvotes,
- P(d|D, t) = (Dt)^d exp(−Dt)/d!
If the prior probability distribution for U and D is P(U, D), their posterior probability distribution will be
- P(U, D|u, d, t) = D^d U^u exp(−(D + U)t) P(U, D)/(a normalization constant).
Then, you sort comments according to a functional of the posterior pdf of U and D; in analogy with expected utility maximization you could use the posterior expectation value of some function f(U, D), but other choices would be possible. (This reduces to your proposal when you take f(U, D) = U/(U + D).)
Of course this model isn't entirely realistic because U and D ought to vary with time (according to timezone, how old the thread is and whether it's currently linked to from the main page, etc.), but the main effect of disregarding this (pretending that a comment has the same probability of getting upvoted in the 10,000th hour after its publication as in the 1st hour) would be to cause very recent comments to be sorted higher, which IMO is a Good Thing anyway.
Subscribe to RSS Feed
= f037147d6e6c911a85753b9abdedda8d)
Added your acronym to main post. Please do not doci. Adboc instead.
I don't suppose it's possible to view the version history of the post, so can you state for posterity what "DOCI" used to stand for?