This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Subscribe
Discussion
0
1
AXRP
Subscribe
Discussion
0
1
Written by
Multicore
,
DanielFilan
,
et al.
last updated
30th Dec 2024
AI X-
Risk
Research
Podcast
is a podcast hosted by Daniel Filan.
See also:
Audio
,
Interviews
Posts tagged
AXRP
Most Relevant
2
72
AXRP Episode 31 - Singular Learning Theory with Daniel Murfet
Ω
DanielFilan
10mo
Ω
4
2
69
AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt
Ω
DanielFilan
1y
Ω
10
2
55
AXRP Episode 24 - Superalignment with Jan Leike
Ω
DanielFilan
2y
Ω
3
2
52
AXRP Episode 22 - Shard Theory with Quintin Pope
Ω
DanielFilan
2y
Ω
11
2
45
AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda
Ω
DanielFilan
2y
Ω
0
2
43
AXRP Episode 25 - Cooperative AI with Caspar Oesterheld
Ω
DanielFilan
1y
Ω
0
2
41
AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment
Ω
DanielFilan
4mo
Ω
0
2
34
AXRP Episode 15 - Natural Abstractions with John Wentworth
Ω
DanielFilan
3y
Ω
1
2
34
AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
Ω
DanielFilan
4mo
Ω
0
2
34
AXRP Episode 33 - RLHF Problems with Scott Emmons
Ω
DanielFilan
9mo
Ω
0
2
25
AXRP Episode 13 - First Principles of AGI Safety with Richard Ngo
Ω
DanielFilan
3y
Ω
1
2
25
AXRP Episode 30 - AI Security with Jeffrey Ladish
Ω
DanielFilan
11mo
Ω
0
2
25
AXRP Episode 14 - Infra-Bayesian Physicalism with Vanessa Kosoy
Ω
DanielFilan
3y
Ω
10
2
25
AXRP Episode 36 - Adam Shai and Paul Riechers on Computational Mechanics
Ω
DanielFilan
6mo
Ω
0
2
24
Video/animation: Neel Nanda explains what mechanistic interpretability is
DanielFilan
2y
7