Cole Wyeth

Message

I am a PhD student in computer science at the University of Waterloo, supervised by Professor Ming Li and advised by Professor Marcus Hutter.

My current research is related to applications of algorithmic probability to sequential decision theory (universal artificial intelligence). Recently I have been trying...

5086

315

897

Cole Wyeth

My model of what is going on with LLMs

Epistemic status: You probably already know if you want to read this kind of post, but in case you have not decided: my impression is that people are acting very confused about what we can conclude about scaling LLMs from the evidence, and I believe my mental model cuts through a lot of this confusion - I have tried to rebut what I believe to be misconceptions in a scattershot way, but will attempt to collect the whole picture here. I am a theoretical computer scientist and this is a theory. Soon I want to do some more serious empirical research around it - but be aware that most of my ideas about LLMs have not had the kind of careful, detailed contact with reality that I would like at the time of writing this post. If you're a good engineer (or just think I am dropping the ball somewhere) and are interested in helping dig into this please reach out. This post is not about timelines, though I think it has obvious implications for timelines. We have seen LLMs scale to impressively general performance. This does not mean they will soon reach human level because intelligence is not just a knob that needs to get turned further, it comprises qualitatively distinct functions. At this point it is not plausible that we can precisely predict how far we are from unlocking all remaining functions since it will probably require more insights. The natural guess is that the answer is on the scale of decades. It's important to take a step back and understand the history of how progress in A.I. takes place, following the main line of connectionist algorithms that (in hindsight, back-chaining from the frontier) are load-bearing. This story is relatively old and well-known, but I still need to retell it because I want to make a couple of points clear. First, deep learning has made impressive steps several times over the course of decades. Second, "blind scaling" has contributed substantially but has not been the whole story, conceptual insights piled on top of (and occasionally

111Feb 13, 2025

Cole Wyeth

Message

I am a PhD student in computer science at the University of Waterloo, supervised by Professor Ming Li and advised by Professor Marcus Hutter.

My current research is related to applications of algorithmic probability to sequential decision theory (universal artificial intelligence). Recently I have been trying...

5086

315

897

"We are confused about agency"

Epistemic status: Speculation on agent foundations research culture (which I am pretty deeply engaged with) and whether "we are confused about agency" which I am not sure about. I will take for granted that this is a common refrain, which should be familiar to anyone who is part of the...

Feb 1756

Who is responsible for shutting down rogue AI?

A loss of control scenario would likely result in rogue AI replicating themselves across the internet, as discussed here: https://metr.org/blog/2024-11-12-rogue-replication-threat-model/ Under fast takeoff models, the first rogue AGI posing a serious takeover/extinction risk to humanity would very likely be the last, with no chance for serious opposition (e.g. Sable). This...

Jan 144

AIXI with general utility functions: "Value under ignorance in UAI"

This updated version of my AGI 2025 paper with Marcus Hutter, "Value under ignorance in universal artificial intelligence," studies general utility functions for AIXI. Surprisingly, the (hyper)computability properties have connections to imprecise probability theory! AIXI uses a defective Bayesian mixture called a semimeasure, which is often viewed as expressing a...

Dec 22, 202525

Cognitive Tech from Algorithmic Information Theory

Epistemic status: Compressed aphorisms. This post contains no algorithmic information theory (AIT) exposition, only the rationality lessons that I (think I've) learned from studying AIT / AIXI for the last few years. Many of these are not direct translations of AIT theorems, but rather frames suggested by AIT. In some...

Dec 11, 202541

On the Aesthetic of Wizard Power

Epistemic status: A response to @johnswentworth's "Orienting Towards Wizard Power." This post is about the aesthetic of wizard power, NOT its (nearest) instantiation in the real world, so that fictional evidence is appropriately treated as direct evidence. Wentworth distinguishes the power of kings from the power of wizards. The power...

Dec 4, 202530

Embedded Universal Predictive Intelligence

A team at Google has substantially advanced the theory of embedded agency with a grain of truth (GOT), including new developments on reflective oracles and an interesting alternative construction (the "Reflective Universal Inductor" or RUI). (I was not involved in this work) Abstract: > The standard theory of model-free reinforcement...

Dec 3, 202579

Considering the Relevance of Computational Uncertainty for AI Safety

Epistemic Status: Thoughts collected at the Sydney Mathematical Research Institute's Focus Period on the Mathematical Science of AI Safety, where I am a key participant. Not representative of anyone else's views. So far Agent Foundations (AF) has made steady but perhaps slow progress which has not yielded currently practical AI...

Nov 16, 202520

Load More (7/53)

LESSWRONG
LW

LESSWRONG
LW

Cole Wyeth

Cole Wyeth

Cole Wyeth

My model of what is going on with LLMs

Have LLMs Generated Novel Insights?

Cognitive Tech from Algorithmic Information Theory

Alignment as uploading with more steps

Cole Wyeth

"We are confused about agency"

Who is responsible for shutting down rogue AI?

AIXI with general utility functions: "Value under ignorance in UAI"

Cognitive Tech from Algorithmic Information Theory

On the Aesthetic of Wizard Power

Embedded Universal Predictive Intelligence

Considering the Relevance of Computational Uncertainty for AI Safety

My model of what is going on with LLMs

Have LLMs Generated Novel Insights?

Cognitive Tech from Algorithmic Information Theory

Alignment as uploading with more steps

"We are confused about agency"

Who is responsible for shutting down rogue AI?

AIXI with general utility functions: "Value under ignorance in UAI"

Cognitive Tech from Algorithmic Information Theory

On the Aesthetic of Wizard Power

Embedded Universal Predictive Intelligence

Considering the Relevance of Computational Uncertainty for AI Safety