This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Debate (AI safety technique)
•
Applied to
NYU Code Debates Update/Postmortem
by
David Rein
13d
ago
•
Applied to
Debating with More Persuasive LLMs Leads to More Truthful Answers
by
Akbir Khan
4mo
ago
•
Applied to
OpenAI Credit Account (2510$)
by
Emirhan BULUT
4mo
ago
•
Applied to
Anthropic Fall 2023 Debate Progress Update
by
ShayBenMoshe
6mo
ago
•
Applied to
Deception Chess: Game #2
by
RobertM
6mo
ago
•
Applied to
AI debate: test yourself against chess 'AIs'
by
Richard Willis
6mo
ago
•
Applied to
Debate helps supervise human experts [Paper]
by
RogerDearnaley
6mo
ago
•
Applied to
AI Safety 101 - Chapter 5.1 - Debate
by
Charbel-Raphaël
7mo
ago
•
Applied to
Evaluating Superhuman Models with Consistency Checks
by
Daniel Paleka
10mo
ago
•
Applied to
A Proposal for AI Alignment: Using Directly Opposing Models
by
Arne B
1y
ago
•
Applied to
Empathy bandaid for immediate AI catastrophe
by
installgentoo
1y
ago
•
Applied to
[New LW Feature] "Debates"
by
jimrandomh
1y
ago