LESSWRONG
LW

Home All Posts Concepts Library Community

All Posts

Sorted by New

Timeframe:All time Daily Weekly Monthly Yearly Exponential

Sorted by:Magic (New & Upvoted)Top Top (Inflation Adjusted)Recent Comments New Old

Filtered by:All Posts Frontpage Curated Questions Events

Show Low Karma Show Events

284Thoughts on seed oil

13d

106

357Transformers Represent Belief State Geometry in their Residual Stream

3d

80

121My experience using financial commitments to overcome akrasia

10d

31

304The Best Tacit Knowledge Videos on Every Subject

Parker Conley, hans truman

20d

123

77[Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate

16d

22

1mo

34

248My PhD thesis: Algorithmic Bayesian Epistemology

1mo

14

173Toward a Broader Conception of Adverse Selection

25d

61

200"How could I have thought that faster?"

1mo

30

140Using axis lines for good or evil

2mo

39

231My Clients, The Liars

2mo

85

110Social status part 1/2: negotiations over object-level preferences

2mo

15

57Acting Wholesomely

2mo

64

263Scale Was All We Needed, At First

1mo

31

213CFAR Takeaways: Andrew Critch

2mo

62

139And All the Shoggoths Merely Players

3mo

57

124Updatelessness doesn't solve most problems

2mo

43

211Believing In

3mo

49

108Attitudes about Applied Rationality

3mo

18

160Without fundamental advances, misalignment and catastrophe are the default outcomes of training powerful AI

Jeremy Gillen, peterbarnett

3mo

60

245The case for ensuring that powerful AIs are controlled

ryan_greenblatt, Buck

3mo

66

122A Shutdown Problem Proposal

johnswentworth, David Lorell

3mo

61

350There is way too much serendipity

3mo

56

291Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

evhub, Carson Denison, Meg, Monte M, David Duvenaud, Nicholas Schiefer, Ethan Perez

4mo

94

131Deep atheism and AI risk

2mo

22

267Gentleness and the artificial Other

4mo

33

96A case for AI alignment being difficult

4mo

53

90Meaning & Agency

4mo

17

259Constellations are Younger than Continents

Jeffrey Heninger

4mo

22

131The Dark Arts

lsusr, Lyrongolem

4mo

49

147Discussion: Challenges with Unsupervised LLM Knowledge Discovery

Seb Farquhar, Vikrant Varma, zac_kenton, gasteigerjo, Vlad Mikulik, Rohin Shah

4mo

21

411Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible

GeneSmith, kman

5mo

162

289Speaking to Congressional staffers about AI risk

2mo

23

155How useful is mechanistic interpretability?

ryan_greenblatt, Neel Nanda, Buck, habryka

4mo

53

309Shallow review of live agendas in alignment & safety

technicalities, Stag

5mo

69

138Moral Reality Check (a short story)

5mo

44

215What are the results of more parental supervision and less outdoor play?

5mo

30

282Social Dark Matter

[DEACTIVATED] Duncan Sabien

5mo

112

255AI Timelines

habryka, Daniel Kokotajlo, Ajeya Cotra, Ege Erdil

6mo

74

185Thinking By The Clock

6mo

27

261The 6D effect: When companies take risks, one email can be very powerful.

5mo

40

104Deception Chess: Game #1

Zane, aphyer, Alex A, AdamYedidia

6mo

19

240Book Review: Going Infinite

6mo

109

238Alignment Implications of LLM Successes: a Debate in One Act

6mo

50

157Holly Elmore and Rob Miles dialogue on AI Safety Advocacy

jacobjacob, Robert Miles, Holly_Elmore

6mo

30

286Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

Zac Hatfield-Dodds

6mo

21

169Thomas Kwa's MIRI research experience

Thomas Kwa, peterbarnett, Vivek Hebbar, Jeremy Gillen, jacobjacob, Raemon

7mo

52

102Cohabitive Games so Far

7mo

116

326Inside Views, Impostor Syndrome, and the Great LARP

7mo

53

481The Talk: a brief explanation of sexual dimorphism

7mo

72