This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Tags
LW
$
Login
METR (org)
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
METR (org)
Random Tag
Contributors
2
Ruby
Formerly ARC Evals
Posts tagged
METR (org)
Most Relevant
2
10
Review of METR’s public evaluation protocol
nahoj
,
JaimeRV
6mo
0
1
153
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Ω
Beth Barnes
1y
Ω
12
Review
1
65
METR is hiring!
Beth Barnes
1y
1
1
40
ARC Evals: Responsible Scaling Policies
Zach Stein-Perlman
1y
10