charlieoneill

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

For people who care about falsifiable stakes rather than vibes TL;DR All timeline arguments ultimately turn on five quantitative pivots. Pick optimistic answers to three of them and your median forecast collapses into the 2026–2029 range; pick pessimistic answers to any two and you drift past 2040. The pivots (I think) are: 1. Which empirical curve matters (hardware spend, algorithmic efficiency, or revenue) 2. Whether software‑only recursive self‑improvement (RSI) can accelerate capabilities faster than hardware can be installed. 3. How sharply compute translates into economic value once broad “agentic” reliability is reached. 4. Whether automating half of essential tasks ignites runaway growth or whether Baumol’s law keeps aggregate productivity anchored until all bottlenecks fall 5. How much alignment fear, regulation, and supply‑chain friction slow scale‑up The rest of this post traces how the canonical short‑timeline narrative AI 2027 and the long‑timeline essays by Ege Erdil and Zhendong Zheng + Arjun Ramani diverge on each hinge, and proposes concrete bets that will force regular public updates. Shared premises * Six doublings in frontier training compute between GPT‑2 (2019) and GPT‑4 (2023) * GPT‑4‑level systems demonstrably replace some cognitive tasks * Alignment is non‑trivial; nobody claims a free deployment lunch Agreement in the forecasting/timelines community ends at the tempo question. Hinge #1: Which curve do we extrapolate? The first divide concerns what exactly we should project into the future. Short‑timeline advocates emphasise frontier‑training compute and algorithmic efficiency, or even just the general amalgamation of all benchmarks as "intelligence extrapolation". They point to six straight doublings in effective training FLOP between GPT‑2 and GPT‑4, and they cite scaling‑law papers showing a 1.6x yearly reduction in compute required to reach any fixed loss. This is the engine behind the claim in AI 2027, that “CapEx gr

128May 6, 2025

charlieoneill

Message

Undergrad maths + computer science + economics @ ANU.

244

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

May 6, 2025128

Born on Third Base: The Case for Inheriting Nothing and Building Everything

The idea that wealth should be inherited is so ingrained in our thinking that we rarely question it. But step back for a moment, and it becomes a curious thing—this notion that assets, land, fortunes should be passed down from one generation to the next without scrutiny. The debate over...

Feb 18, 2025-24

One-dimensional vs multi-dimensional features in interpretability

> Chris Olah's “What is a Linear Representation? What is a Multidimensional Feature?” (July Circuits Update) prompted a moment of pause for me regarding the term "one-dimensional feature." I initially conflated that phrase with the number of dimensions in the activation space (for example, the 768 dimensions in GPT‑2 Small)....

Feb 1, 20256

LLMs are really good at k-order thinking (where k is even)

I've noticed something about how humans and language models work together. There's a pattern that emerges whenever we collaborate effectively. It goes like this: Someone has an initial idea (step 1). An LLM can then generate variations and connections around that idea (step 2). A human needs to look at...

Jan 15, 20257

The nihilism of NeurIPS

> "What is the use of having developed a science well enough to make predictions if, in the end, all we're willing to do is stand around and wait for them to come true?" F. SHERWOOD HOWLAND in his speech accepting the Nobel Prize in Chemistry in 1995. > "Once...

Dec 20, 2024107

Can quantised autoencoders find and interpret circuits in language models?

Executive Summary * I try vector-quantised autoencoders (VQ-VAEs) as an alternative compression scheme of transformer activations (as opposed to something like a sparse autoencoder). * Whilst people have danced around this idea before, discrete quantisation has only ever been tried in the actual transformer architecture itself, rather than on cached...

Mar 24, 202430

LESSWRONG
LW

LESSWRONG
LW

charlieoneill

charlieoneill

charlieoneill

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

The nihilism of NeurIPS

Can quantised autoencoders find and interpret circuits in language models?

LLMs are really good at k-order thinking (where k is even)

charlieoneill

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

Born on Third Base: The Case for Inheriting Nothing and Building Everything

One-dimensional vs multi-dimensional features in interpretability

LLMs are really good at k-order thinking (where k is even)

The nihilism of NeurIPS

Can quantised autoencoders find and interpret circuits in language models?

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

Born on Third Base: The Case for Inheriting Nothing and Building Everything

One-dimensional vs multi-dimensional features in interpretability

LLMs are really good at k-order thinking (where k is even)

The nihilism of NeurIPS

Can quantised autoencoders find and interpret circuits in language models?

Five Hinge‑Questions That Decide Whether AGI Is Five Years Away or Twenty

The nihilism of NeurIPS

Can quantised autoencoders find and interpret circuits in language models?

LLMs are really good at k-order thinking (where k is even)