Lizka

I'm a researcher at Forethought

Before that, I ran the non-engineering side of the EA Forum and worked on some other content-related tasks at CEA. [More about the Forum/CEA Online job.] 

Most of my content (and a more detailed bio) is on my profile on the EA Forum.

Please feel free to reach out!

Wikitag Contributions

Comments

Sorted by
Lizka30

FYI: the paper is now out. 

See also the LW linkpost: METR: Measuring AI Ability to Complete Long Tasks, and a summary on Twitter

(IMO this is a really cool paper — very grateful to @Thomas Kwa et al. I'm looking forward to digging into the details.)