This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
LW
$
Login
mrinank_sharma
Posts
Sorted by New
76
Best-of-N Jailbreaking
Ω
5d
Ω
5
66
Towards Understanding Sycophancy in Language Models
Ω
1y
Ω
0
Review
70
Paper: Understanding and Controlling a Maze-Solving Policy Network
Ω
1y
Ω
0
Review
Wiki Contributions
Comments
Sorted by
Newest