This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Subscribe
Discussion
(0)
Truthful AI
Subscribe
Discussion
(0)
This page is a stub.
Posts tagged
Truthful AI
Most Relevant
4
64
Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses
Ω
TurnTrout
15d
Ω
3
2
72
New, improved multiple-choice TruthfulQA
Ω
Owain_Evans
,
James Chua
,
Steph Lin
15d
Ω
0
2
31
A tension between two prosaic alignment subgoals
Alex Lawsen
2y
8
2
26
How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
Ω
Owain_Evans
10mo
Ω
0
2
12
Truthfulness, standards and credibility
Ω
Joe Collman
3y
Ω
2
1
49
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
Felix Hofstätter
,
Francis Rhys Ward
,
HarrietW
,
LAThomson
,
Ollie J
,
Patrik Bartak
,
Sam F. Brown
1y
Ω
0