LESSWRONG
is fundraising!
The Best of LessWrong
LW
$

The Best of LessWrong

Here you can find the best posts of LessWrong. When posts turn more than a year old, the LessWrong community reviews and votes on how well they have stood the test of time. These are the posts that have ranked the highest for all years since 2018 (when our annual tradition of choosing the least wrong of LessWrong began).

For the years 2018, 2019 and 2020 we also published physical books with the results of our annual vote, which you can buy and learn more about here.

Sort by:

curatedyear

Rationality

Eliezer Yudkowsky

Local Validity as a Key to Sanity and Civilization

Buck

"Other people are wrong" vs "I am right"

Mark Xu

Strong Evidence is Common

johnswentworth

You Are Not Measuring What You Think You Are Measuring

johnswentworth

Gears-Level Models are Capital Investments

Hazard

How to Ignore Your Emotions (while also thinking you're awesome at emotions)

Scott Garrabrant

Yes Requires the Possibility of No

Scott Alexander

Trapped Priors As A Basic Problem Of Rationality

Duncan Sabien (Deactivated)

Split and Commit

Ben Pace

A Sketch of Good Communication

Eliezer Yudkowsky

Meta-Honesty: Firming Up Honesty Around Its Edge-Cases

Duncan Sabien (Deactivated)

Lies, Damn Lies, and Fabricated Options

Duncan Sabien (Deactivated)

CFAR Participant Handbook now available to all

johnswentworth

What Are You Tracking In Your Head?

Mark Xu

The First Sample Gives the Most Information

Duncan Sabien (Deactivated)

Shoulder Advisors 101

Zack_M_Davis

Feature Selection

abramdemski

Mistakes with Conservation of Expected Evidence

Scott Alexander

Varieties Of Argumentative Experience

Eliezer Yudkowsky

Toolbox-thinking and Law-thinking

alkjash

Babble

Kaj_Sotala

The Felt Sense: What, Why and How

Duncan Sabien (Deactivated)

Cup-Stacking Skills (or, Reflexive Involuntary Mental Motions)

Ben Pace

The Costly Coordination Mechanism of Common Knowledge

Noticing Frame Differences

Duncan Sabien (Deactivated)

Sazen

AnnaSalamon

Reality-Revealing and Reality-Masking Puzzles

Eliezer Yudkowsky

ProjectLawful.com: Eliezer's latest story, past 1M words

Eliezer Yudkowsky

Self-Integrity and the Drowning Child

Jacob Falkovich

The Treacherous Path to Rationality

Scott Garrabrant

Tyranny of the Epistemic Majority

alkjash

More Babble

abramdemski

Most Prisoner's Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

Raemon

Being a Robust Agent

Zack_M_Davis

Heads I Win, Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists

Benquo

Reason isn't magic

habryka

Integrity and accountability are core parts of rationality

Raemon

The Schelling Choice is "Rabbit", not "Stag"

Diffractor

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Raemon

Propagating Facts into Aesthetics

johnswentworth

Simulacrum 3 As Stag-Hunt Strategy

LoganStrohl

Catching the Spark

Jacob Falkovich

Is Rationalist Self-Improvement Real?

Benquo

Excerpts from a larger discussion about simulacra

Zvi

Simulacra Levels and their Interactions

Comment reply: my low-quality thoughts on why CFAR didn't get farther with a "real/efficacious art of rationality"

Eric Raymond

Rationalism before the Sequences

Owain_Evans

The Rationalists of the 1950s (and before) also called themselves “Rationalists”

Optimization

sarahconstantin

The Pavlov Strategy

johnswentworth

Coordination as a Scarce Resource

AnnaSalamon

What should you change in response to an "emergency"? And AI risk

Zvi

Prediction Markets: When Do They Work?

johnswentworth

Being the (Pareto) Best in the World

alkjash

Is Success the Enemy of Freedom? (Full)

jasoncrawford

How factories were made safe

HoldenKarnofsky

All Possible Views About Humanity's Future Are Wild

jasoncrawford

Why has nuclear power been a flop?

Zvi

Simple Rules of Law

Elizabeth

Power Buys You Distance From The Crime

Eliezer Yudkowsky

Is Clickbait Destroying Our General Intelligence?

Scott Alexander

The Tails Coming Apart As Metaphor For Life

Zvi

Asymmetric Justice

Jeffrey Ladish

Nuclear war is unlikely to cause human extinction

Can crimes be discussed literally?

Said Achmiz

The Real Rules Have No Exceptions

Lars Doucet

Lars Doucet's Georgism series on Astral Codex Ten

johnswentworth

When Money Is Abundant, Knowledge Is The Real Wealth

Working With Monsters

jasoncrawford

Why haven't we celebrated any major achievements lately?

abramdemski

The Credit Assignment Problem

Martin Sustrik

Inadequate Equilibria vs. Governance of the Commons

Raemon

The Amish, and Strategic Norms around Technology

Zvi

Blackmail

KatjaGrace

Discontinuous progress in history: an update

Scott Alexander

Rule Thinkers In, Not Out

Jameson Quinn

A voting theory primer for rationalists

HoldenKarnofsky

Nonprofit Boards are Weird

Wei Dai

Beyond Astronomical Waste

World

Ben

The Redaction Machine

Samo Burja

On the Loss and Preservation of Knowledge

Alex_Altair

Introduction to abstract entropy

Martin Sustrik

Swiss Political System: More than You ever Wanted to Know (I.)

johnswentworth

Interfaces as a Scarce Resource

johnswentworth

Transportation as a Constraint

eukaryote

There’s no such thing as a tree (phylogenetically)

Scott Alexander

Is Science Slowing Down?

Martin Sustrik

Anti-social Punishment

Martin Sustrik

Research: Rescuers during the Holocaust

GeneSmith

Toni Kurz and the Insanity of Climbing Mountains

johnswentworth

Book Review: Design Principles of Biological Circuits

Elizabeth

Literature Review: Distributed Teams

Valentine

The Intelligent Social Web

jacobjacob

Unconscious Economics

eukaryote

Spaghetti Towers

Eli Tyre

Historical mathematicians exhibit a birth order effect too

johnswentworth

What Money Cannot Buy

Scott Alexander

Book Review: The Secret Of Our Success

johnswentworth

Specializing in Problems We Don't Understand

KatjaGrace

Why did everything take so long?

Ruby

[Answer] Why wasn't science invented in China?

Scott Alexander

Mental Mountains

Kaj_Sotala

My attempt to explain Looking, insight meditation, and enlightenment in non-mysterious terms

johnswentworth

Evolution of Modularity

johnswentworth

Science in a High-Dimensional World

zhukeepa

How uniform is the neocortex?

Kaj_Sotala

Building up to an Internal Family Systems model

Steven Byrnes

My computational framework for the brain

Natália

Counter-theses on Sleep

abramdemski

What makes people intellectually active?

Bucky

Birth order effect found in Nobel Laureates in Physics

KatjaGrace

Elephant seal 2

JackH

Anti-Aging: State of the Art

Vaniver

Steelmanning Divination

Kaj_Sotala

Book summary: Unlocking the Emotional Brain

Practical

alkjash

Pain is not the unit of Effort

benkuhn

Staring into the abyss as a core life skill

Unreal

Rest Days vs Recovery Days

juliawise

Notes from "Don't Shoot the Dog"

Elizabeth

Luck based medicine: my resentful story of becoming a medical miracle

johnswentworth

How To Write Quickly While Maintaining Epistemic Rigor

Duncan Sabien (Deactivated)

Ruling Out Everything Else

johnswentworth

Paper-Reading for Gears

Wei Dai

Forum participation as a research strategy

To listen well, get curious

HoldenKarnofsky

Useful Vices for Wicked Problems

pjeby

The Curse Of The Counterfactual

Darmani

Leaky Delegation: You are not a Commodity

Adam Zerner

Losing the root for the tree

chanamessinger

The Onion Test for Personal and Institutional Honesty

AnnaSalamon

“PR” is corrosive; “reputation” is not.

Raemon

You Get About Five Words

HoldenKarnofsky

Learning By Writing

Valentine

Noticing the Taste of Lotus

Ruby

Do you fear the rock or the hard place?

johnswentworth

Slack Has Positive Externalities For Groups

Raemon

Limerence Messes Up Your Rationality Real Bad, Yo

mingyuan

Cryonics signup guide #1: Overview

catherio

microCOVID.org: A tool to estimate COVID risk from common activities

orthonormal

The Loudest Alarm Is Probably False

Raemon

"Can you keep this confidential? How do you know?"

Duncan Sabien (Deactivated)

In My Culture

AI Strategy

Ajeya Cotra

Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

Daniel Kokotajlo

Cortés, Pizarro, and Afonso as Precedents for Takeover

Daniel Kokotajlo

The date of AI Takeover is not the day the AI takes over

paulfchristiano

What failure looks like

Daniel Kokotajlo

What 2026 looks like

gwern

It Looks Like You're Trying To Take Over The World

Andrew_Critch

What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

paulfchristiano

Another (outer) alignment failure story

Ajeya Cotra

Draft report on AI timelines

Eliezer Yudkowsky

Biology-Inspired AGI Timelines: The Trick That Never Works

HoldenKarnofsky

Reply to Eliezer on Biological Anchors

Richard_Ngo

AGI safety from first principles: Introduction

Daniel Kokotajlo

Fun with +12 OOMs of Compute

Wei Dai

AI Safety "Success Stories"

KatjaGrace

Counterarguments to the basic AI x-risk case

johnswentworth

The Plan

Rohin Shah

Reframing Superintelligence: Comprehensive AI Services as General Intelligence

What an actually pessimistic containment strategy looks like

Eliezer Yudkowsky

MIRI announces new "Death With Dignity" strategy

evhub

Chris Olah’s views on AGI safety

So8res

Comments on Carlsmith's “Is power-seeking AI an existential risk?”

Adam Scholl

Safetywashing

abramdemski

The Parable of Predict-O-Matic

KatjaGrace

Let’s think about slowing down AI

nostalgebraist

human psycholinguists: a critical appraisal

nostalgebraist

larger language models may disappoint you [or, an eternally unfinished draft]

Daniel Kokotajlo

Against GDP as a metric for timelines and takeoff speeds

paulfchristiano

Arguments about fast takeoff

Eliezer Yudkowsky

Six Dimensions of Operational Adequacy in AGI Projects

Technical AI Safety

Andrew_Critch

Some AI research areas and their relevance to existential safety

1a3orn

EfficientZero: How It Works

elspood

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

So8res

Decision theory does not imply that we get to have nice things

TurnTrout

Reward is not the optimization target

johnswentworth

Worlds Where Iterative Design Fails

Vika

Specification gaming examples in AI

Rafael Harth

Inner Alignment: Explain like I'm 12 Edition

evhub

An overview of 11 proposals for building safe advanced AI

johnswentworth

Alignment By Default

johnswentworth

How To Go From Interpretability To Alignment: Just Retarget The Search

The Solomonoff Prior is Malign

paulfchristiano

My research methodology

Eliezer Yudkowsky

The Rocket Alignment Problem

Eliezer Yudkowsky

AGI Ruin: A List of Lethalities

So8res

A central AI alignment problem: capabilities generalization, and the sharp left turn

Inaccessible information

TurnTrout

Seeking Power is Often Convergently Instrumental in MDPs

So8res

On how various plans miss the hard bits of the alignment challenge

abramdemski

Alignment Research Field Guide

paulfchristiano

The strategy-stealing assumption

Veedrac

Optimality is the tiger, and agents are its teeth

Sam Ringer

Models Don't "Get Reward"

johnswentworth

The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

Buck

Language models seem to be much better than humans at next-token prediction

abramdemski

An Untrollable Mathematician Illustrated

abramdemski

An Orthodox Case Against Utility Functions

johnswentworth

Selection Theorems: A Program For Understanding Agents

Rohin Shah

Coherence arguments do not entail goal-directed behavior

Alex Flint

The ground of optimization

paulfchristiano

Where I agree and disagree with Eliezer

Eliezer Yudkowsky

Ngo and Yudkowsky on alignment difficulty

abramdemski

Embedded Agents

evhub

Risks from Learned Optimization: Introduction

nostalgebraist

chinchilla's wild implications

johnswentworth

Why Agent Foundations? An Overly Abstract Explanation

zhukeepa

Paul's research agenda FAQ

Eliezer Yudkowsky

Coherent decisions imply consistent utilities

paulfchristiano

Open question: are minimal circuits daemon-free?

Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]

TurnTrout

Humans provide an untapped wealth of evidence about alignment

Neel Nanda

A Mechanistic Interpretability Analysis of Grokking

Collin

How "Discovering Latent Knowledge in Language Models Without Supervision" Fits Into a Broader Alignment Scheme

evhub

Understanding “Deep Double Descent”

Quintin Pope

The shard theory of human values

TurnTrout

Inner and outer alignment decompose one hard problem into two extremely hard problems

Eliezer Yudkowsky

Challenges to Christiano’s capability amplification proposal

Scott Garrabrant

Finite Factored Sets

paulfchristiano

ARC's first technical report: Eliciting Latent Knowledge

Diffractor

Introduction To The Infra-Bayesianism Sequence

20222021202020192018All

RationalityPracticalOptimizationWorldAI StrategyTechnical AI SafetyAll

This Can't Go On

We're used to the economy growing a few percent per year. But this is a very unusual situation. Zooming out to all of history, we see that growth has been accelerating, that it's near its historical high point, and that it's faster than it can be for all that much longer. There aren't enough atoms in the galaxy to sustain this rate of growth for even another 10,000 years!

What comes next – stagnation, explosion, or collapse?

#13

How factories were made safe

Back in the early days of factories, workplace injury rates were enormous. Over time, safety engineering took hold, various legal reforms were passed (most notably liability law), and those rates dramatically dropped. This is the story of how factories went from death traps to relatively safe.

#15

Making Vaccine

John made his own COVID-19 vaccine at home using open source instructions. Here's how he did it and why.

14Viliam

Two years later, I suppose we know more than we did when the article was written. I would like to read some postscript explaining how well this article has aged.

11Drake Morrison

A great example of taking the initiative and actually trying something that looks useful, even when it would be weird or frowned upon in normal society. I would like to see a post-review, but I'm not even sure if that matters. Going ahead and trying something that seems obviously useful, but weird and no one else is doing is already hard enough. This post was inspiring.

#17

All Possible Views About Humanity's Future Are Wild

It's wild to think that humanity might expand throughout the galaxy in the next century or two. But it's also wild to think that we definitely won't. In fact, all views about humanity's long-term future are pretty wild when you think about it. We're in a wild situation!

#36

Working With Monsters

A person wakes up from cryonic freeze in a post-apocalyptic future. A "scissor" statement – an AI-generated statement designed to provoke maximum controversy – has led to massive conflict and destruction. The survivors are those who managed to work with people they morally despise.

#40

Lars Doucet's Georgism series on Astral Codex Ten

An in-depth overview of Georgism, a school of political economy that advocates for a Land Value Tax (LVT), aiming to discourage land speculation and rent-seeking behavior; promote more efficient use of land, make housing more affordable, and taxes more efficient.

#45

Why has nuclear power been a flop?

Nuclear power once seemed to be the energy of the future, but has failed to live up to that promise. Why? Jason Crawford summarizes Jack Devanney's book "Why Nuclear Power Has Been a Flop", which blames overregulation driven by unrealistic radiation safety models.