This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
METR (org)
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
METR (org)
Random Tag
Contributors
2
Ruby
Formerly ARC Evals
Posts tagged
METR (org)
Most Relevant
2
10
Review of METR’s public evaluation protocol
nahoj
,
JaimeRV
5mo
0
1
153
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Ω
Beth Barnes
1y
Ω
12
1
65
METR is hiring!
Beth Barnes
11mo
1
1
40
ARC Evals: Responsible Scaling Policies
Zach Stein-Perlman
1y
9