This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Subscribe
Discussion
0
AI Benchmarking
Subscribe
Discussion
0
This page is a stub.
Posts tagged
AI Benchmarking
Most Relevant
2
49
FrontierMath Score of o3-mini Much Lower Than Claimed
YafahEdelman
9d
7
2
24
Broken Benchmark: MMLU
awg
2y
5
1
63
Some lessons from the OpenAI-FrontierMath debacle
7vik
2mo
9
1
36
Notable runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format
Roland Pihlakas
,
Sruthi Kuriakose
,
shrutidattagupta
10d
6
1
33
Introducing REBUS: A Robust Evaluation Benchmark of Understanding Symbols
Arjun Panickssery
,
agg
1y
0
1
30
Improving Model-Written Evals for AI Safety Benchmarking
Ω
Sunishchal Dev
,
Marius Hobbhahn
5mo
Ω
0
1
20
Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents
Ω
Sam F. Brown
,
BasilLabib
,
Codruta (Coco) Lugoj
,
Sai Sasank Y
8mo
Ω
0
1
19
Edge Cases in AI Alignment
Florian_Dietz
3d
2
1
18
Building AI safety benchmark environments on themes of universal human values
Roland Pihlakas
3mo
3
1
18
MMLU’s Moral Scenarios Benchmark Doesn’t Measure What You Think it Measures
Ω
corey morris
1y
Ω
2
1
10
In-Context Scheming: A Run is Worth a Thousand Words
noise-field
20d
0
1
9
Revealing alignment faking with a single prompt
Florian_Dietz
2mo
5
1
9
Understanding Benchmarks and motivating Evaluations
markov
,
Charbel-Raphaël
2mo
0
1
6
Closed-ended questions aren't as hard as you think
electroswing
1mo
0
1
5
Detailed Ideal World Benchmark
Knight Lee
2mo
2