This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Subscribe
Discussion
(0)
(1)
AXRP
Subscribe
Discussion
(0)
(1)
Written by
Multicore
,
DanielFilan
,
et al.
last updated
28th Jan 2025
AI X-
Risk
Research
Podcast
is a podcast hosted by Daniel Filan.
See also:
Audio
,
Interviews
Posts tagged
AXRP
Most Relevant
2
72
AXRP Episode 31 - Singular Learning Theory with Daniel Murfet
Ω
DanielFilan
9mo
Ω
4
2
69
AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt
Ω
DanielFilan
10mo
Ω
10
2
55
AXRP Episode 24 - Superalignment with Jan Leike
Ω
DanielFilan
2y
Ω
3
2
52
AXRP Episode 22 - Shard Theory with Quintin Pope
Ω
DanielFilan
2y
Ω
11
2
45
AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda
Ω
DanielFilan
2y
Ω
0
2
43
AXRP Episode 25 - Cooperative AI with Caspar Oesterheld
Ω
DanielFilan
1y
Ω
0
2
41
AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment
Ω
DanielFilan
2mo
Ω
0
2
34
AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory
Ω
DanielFilan
2mo
Ω
0
2
34
AXRP Episode 15 - Natural Abstractions with John Wentworth
Ω
DanielFilan
3y
Ω
1
2
34
AXRP Episode 33 - RLHF Problems with Scott Emmons
Ω
DanielFilan
8mo
Ω
0
2
25
AXRP Episode 36 - Adam Shai and Paul Riechers on Computational Mechanics
Ω
DanielFilan
4mo
Ω
0
2
25
AXRP Episode 14 - Infra-Bayesian Physicalism with Vanessa Kosoy
Ω
DanielFilan
3y
Ω
10
2
25
AXRP Episode 30 - AI Security with Jeffrey Ladish
Ω
DanielFilan
9mo
Ω
0
2
24
Video/animation: Neel Nanda explains what mechanistic interpretability is
DanielFilan
2y
7
2
24
AXRP Episode 13 - First Principles of AGI Safety with Richard Ngo
Ω
DanielFilan
3y
Ω
1