This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
LW
$
Login
Govind Pimpale
Posts
Sorted by New
156
Current safety training techniques do not fully transfer to the agent setting
2mo
8
45
~80 Interesting Questions about Foundation Model Agent Safety
2mo
4
69
Analyzing DeepMind's Probabilistic Methods for Evaluating Agent Capabilities
Ω
5mo
Ω
0
Wiki Contributions
Comments
Sorted by
Newest