This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
AlexMeinke
Posts
Sorted by New
58
Training AI agents to solve hard problems could lead to Scheming
Ω
2d
Ω
12
103
Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs
4mo
28
93
Apollo Research 1-year update
Ω
6mo
Ω
0
50
A starter guide for evals
Ω
10mo
Ω
2
45
Paper: Tell, Don't Show- Declarative facts influence how LLMs generalize
Ω
1y
Ω
4
Wiki Contributions
Comments
Sorted by
Newest