x

LESSWRONG
LW

AXRP — LessWrong

AXRP

Edited by Multicore, DanielFilan, et al. last updated 30th Dec 2024

AI X-Risk Research Podcast is a podcast hosted by Daniel Filan.

See also: Audio, Interviews

Add Posts

1

1

Posts tagged AXRP

2

72AXRP Episode 31 - Singular Learning Theory with Daniel Murfet

2y

4

2

69AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt

2y

10

2

55AXRP Episode 24 - Superalignment with Jan Leike

3y

3

2

52AXRP Episode 22 - Shard Theory with Quintin Pope

3y

11

2

45AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda

3y

0

2

43AXRP Episode 25 - Cooperative AI with Caspar Oesterheld

2y

0

2

41AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment

1y

0

2

34AXRP Episode 33 - RLHF Problems with Scott Emmons

2y

0

2

34AXRP Episode 15 - Natural Abstractions with John Wentworth

4y

1

2

34AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory

1y

0

2

31AXRP Episode 45 - Samuel Albanie on DeepMind’s AGI Safety Approach

7mo

0

2

28AXRP Episode 13 - First Principles of AGI Safety with Richard Ngo

4y

1

2

28AXRP Episode 41 - Lee Sharkey on Attribution-based Parameter Decomposition

8mo

1

2

26AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

11mo

0

2

25AXRP Episode 14 - Infra-Bayesian Physicalism with Vanessa Kosoy

4y

10

Load More (15/61)

Add Posts