All of Lizka's Comments + Replies

Lizka20

FYI: the paper is now out. 

See also the LW linkpost: METR: Measuring AI Ability to Complete Long Tasks, and a summary on Twitter

(IMO this is a really cool paper — very grateful to @Thomas Kwa et al. I'm looking forward to digging into the details.)