This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Govind Pimpale
Posts
Sorted by New
147
Current safety training techniques do not fully transfer to the agent setting
20d
8
45
~80 Interesting Questions about Foundation Model Agent Safety
1mo
4
69
Analyzing DeepMind's Probabilistic Methods for Evaluating Agent Capabilities
Ω
4mo
Ω
0
Wiki Contributions
Comments
Sorted by
Newest