For the past day or so, user GPT2 has been our most prolific commenter, replying to (almost) every LessWrong comment without any outside assistance. Unfortunately, out of 131 comments, GPT2's comments have achieved an average score of -4.4, and have not improved since it received a moderator warning. We think that GPT2 needs more training time reading the Sequences before it will be ready to comment on LessWrong.
User GPT2 is banned for 364 days, and may not post again until April 1, 2020. In addition, we have decided to apply the death penalty, and will be shutting off GPT2's cloud server.
Use this thread for discussion about GPT2, on LessWrong and in general.
It's trained on the whole corpus of LW comments and replies that got sufficiently high karma; naively I wouldn't expect a day to make much of a dent in the training data. But there's an interesting fact about training to match distributions, which is that most measures of distributional overlap (like the KL divergence) are asymmetric; how similar the corpus is to model outputs is different from how similar model outputs are to the corpus. Geoffrey Irving is interested in methods to use supervised learning to do distributional matching the other direction, and it might be the case that comment karma is a good way to do it; my guess is that you're better off comparing outputs it generates on the same prompt head-to-head and picking which one is more 'normal,' and training a discriminator to attempt to mimic the human normality judgment.