LESSWRONG
LW

Home All Posts Concepts Library Community

All Posts

Sorted by New

Timeframe:All time Daily Weekly Monthly Yearly Exponential

Sorted by:Magic (New & Upvoted)Top Top (Inflation Adjusted)Recent Comments New Old

Filtered by:All Posts Frontpage Curated Questions Events

Show Low Karma Show Events

229Thoughts on seed oil

2d

40

301The Best Tacit Knowledge Videos on Every Subject

Parker Conley, hans truman

9d

111

81[Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate

5d

22

24d

33

248My PhD thesis: Algorithmic Bayesian Epistemology

20d

14

171Toward a Broader Conception of Adverse Selection

14d

61

197"How could I have thought that faster?"

1mo

31

139Using axis lines for good or evil

1mo

39

228My Clients, The Liars

1mo

85

109Social status part 1/2: negotiations over object-level preferences

1mo

15

57Acting Wholesomely

1mo

64

262Scale Was All We Needed, At First

1mo

31

213CFAR Takeaways: Andrew Critch

2mo

62

139And All the Shoggoths Merely Players

2mo

56

124Updatelessness doesn't solve most problems

2mo

43

208Believing In

2mo

49

106Attitudes about Applied Rationality

3mo

18

159Without fundamental advances, misalignment and catastrophe are the default outcomes of training powerful AI

Jeremy Gillen, peterbarnett

3mo

59

238The case for ensuring that powerful AIs are controlled

ryan_greenblatt, Buck

3mo

66

122A Shutdown Problem Proposal

johnswentworth, David Lorell

3mo

61

349There is way too much serendipity

3mo

56

288Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

evhub, Carson Denison, Meg, Monte M, David Duvenaud, Nicholas Schiefer, Ethan Perez

3mo

94

130Deep atheism and AI risk

2mo

22

265Gentleness and the artificial Other

3mo

32

96A case for AI alignment being difficult

4mo

53

90Meaning & Agency

4mo

17

259Constellations are Younger than Continents

Jeffrey Heninger

4mo

22

131The Dark Arts

lsusr, Lyrongolem

4mo

49

147Discussion: Challenges with Unsupervised LLM Knowledge Discovery

Seb Farquhar, Vikrant Varma, zac_kenton, gasteigerjo, Vlad Mikulik, Rohin Shah

4mo

21

404Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible

GeneSmith, kman

4mo

162

288Speaking to Congressional staffers about AI risk

2mo

23

155How useful is mechanistic interpretability?

ryan_greenblatt, Neel Nanda, Buck, habryka

3mo

53

307Shallow review of live agendas in alignment & safety

technicalities, Stag

5mo

69

137Moral Reality Check (a short story)

4mo

44

215What are the results of more parental supervision and less outdoor play?

5mo

30

281Social Dark Matter

[DEACTIVATED] Duncan Sabien

5mo

112

252AI Timelines

habryka, Daniel Kokotajlo, Ajeya Cotra, Ege Erdil

5mo

74

185Thinking By The Clock

5mo

27

260The 6D effect: When companies take risks, one email can be very powerful.

5mo

40

104Deception Chess: Game #1

Zane, aphyer, Alex A, AdamYedidia

6mo

19

240Book Review: Going Infinite

6mo

109

238Alignment Implications of LLM Successes: a Debate in One Act

6mo

50

157Holly Elmore and Rob Miles dialogue on AI Safety Advocacy

jacobjacob, Robert Miles, Holly_Elmore

6mo

30

286Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

Zac Hatfield-Dodds

6mo

19

169Thomas Kwa's MIRI research experience

Thomas Kwa, peterbarnett, Vivek Hebbar, Jeremy Gillen, jacobjacob, Raemon

7mo

52

102Cohabitive Games so Far

6mo

116

324Inside Views, Impostor Syndrome, and the Great LARP

6mo

53

481The Talk: a brief explanation of sexual dimorphism

7mo

72

197UDT shows that decision theory is more puzzling than ever

7mo

51

222Sum-threshold attacks

6mo

52