LESSWRONG
LW

HarrietW
40000
Message
Dialogue
Subscribe

Posts

Sorted by New
8Cooperation and Alignment in Delegation Games: You Need Both!
10mo
0
49Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
2y
Ω
0

Wikitag Contributions

No wikitag contributions to display.

Comments

Sorted by
Newest
No Comments Found