This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Subscribe
Discussion
0
Truthful AI
Truthful AI
Subscribe
Discussion
0
Summaries
Cancel
Submit
This page is a stub.
Posts tagged
Truthful AI
Most Relevant
4
64
Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses
Ω
TurnTrout
2mo
Ω
3
2
72
New, improved multiple-choice TruthfulQA
Ω
Owain_Evans
,
James Chua
,
Steph Lin
2mo
Ω
0
2
31
A tension between two prosaic alignment subgoals
Alex Lawsen
2y
8
2
26
How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
Ω
Owain_Evans
1y
Ω
0
2
12
Truthfulness, standards and credibility
Ω
Joe Collman
3y
Ω
2
1
49
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
Felix Hofstätter
,
Francis Rhys Ward
,
HarrietW
,
LAThomson
,
Ollie J
,
Patrik Bartak
,
Sam F. Brown
1y
Ω
0