Cole Wyeth

I am a PhD student in computer science at the University of Waterloo, supervised by Professor Ming Li and advised by Professor Marcus Hutter.

My current research is related to applications of algorithmic probability to sequential decision theory (universal artificial intelligence). Recently I have been trying to start a dialogue between the computational cognitive science and UAI communities. Sometimes I build robots, professionally or otherwise. Another hobby (and a personal favorite of my posts here) is the Sherlockian abduction master list, which is a crowdsourced project seeking to make "Sherlock Holmes" style inference feasible by compiling observational cues. Give it a read and see if you can contribute!

See my personal website colewyeth.com for an overview of my interests and work.

Sequences

Meta-theory of rationality

AIXI Agent foundations

Deliberative Algorithms as Scaffolding

Posts

Sorted by New

4Cole Wyeth's Shortform

6mo

52

20Existing UDTs test the limits of Bayesianism (and consistency)

13d

3

11Levels of analysis for thinking about agency

1mo

0

16Intelligence as Privilege Escalation

1mo

0

153Have LLMs Generated Novel Insights?

QΩ

1mo

QΩ

36

14What makes a theory of intelligence useful?

1mo

0

25Take over my project: do computable agents plan against the universal distribution pessimistically?

Q

1mo

Q

3

98My model of what is going on with LLMs

1mo

49

19What is the most impressive game LLMs can play well?

QΩ

2mo

QΩ

20

20Rebuttals for ~all criticisms of AIXI

Ω

3mo

Ω

15

19Heresies in the Shadow of the Sequences

4mo

12

Wikitag Contributions

AIXI

3mo

(+11/-174)

Anvil Problem

7mo

(+119)

Comments

Sorted by

Newest

Vanessa Kosoy's Shortform

Cole Wyeth21hΩ340

This called a Hurwicz decision rule / criterion (your t is usually alpha).

I think the content of this argument is not that maxmin is fundamental, but rather that simplicity priors "look like" or justify Hurwicz-like decision rules. Simple versions of this are easy to prove but (as far as I know) do not appear in the literature.

Reply

Embedded Agents

Cole Wyeth4d20

I also take this approach to agent foundations, which is why I like to tie different agendas together. Studying AIXI is part of that because many other approaches can be described as "depart from AIXI in this way to solve this informally stated problem with AIXI."

Reply

METR: Measuring AI Ability to Complete Long Tasks

Cole Wyeth5d52

The problem is deeper than that.

Playing a game of chess takes hours. LLMs are pretty bad it, but we have had good chess engines for decades - why isn’t there a point way off on the top left for chess?

Answer: we’re only interested in highly general AI agents, which basically means LLMs. So we’re only looking at the performance of LLMs, right? But if you only look at LLM performance without scaffolding, it looks to me like that asymptotes around 15 minutes. Only by throwing in systems that use a massive amount of inference time compute do we recover a line with a consistent upwards slope. So we’re allowed to use search, just not narrow search like chess engines. This feels a little forced to me - we’re putting two importantly different things on the same plot.

Here is an alternative explanation of that graph: LLMs have been working increasingly well on short tasks, but probably not doubling task length every seven months. Then after 2024, a massive amount of effort poured into trying to make them do longer tasks by paying up a high cost in inference time compute and very carefully designed scaffolding, with very modest success. It’s not clear that anyone has another (good) idea.

With that said, if the claimed trend continues for another year (now that there are actually enough data points to usefully draw a line through) that would be enough for me to start finding this pretty convincing.

Reply

METR: Measuring AI Ability to Complete Long Tasks

Cole Wyeth6d70

I haven’t read the paper (yet?) but from the plot I am not convinced. The points up to 2024 are too sparse, they don’t let us conclude much about that region of growth in abilities; but if they did, it would be a significantly lower slope. When the points become dense, the comparison is not fair - these are reasoning models which use far more inference time compute.

Reply

Help make the orca language experiment happen

Cole Wyeth9d54

Yes, but it's also very easy to convince yourself you have more evidence than you do, e.g. invent a theory that is actually crazy but seems insightful to you (may or may not apply to this case).

I think intelligence is particularly hard to assess in this way because of recursivity.

Reply

1

Help make the orca language experiment happen

Cole Wyeth9d199

Yeah, this is also just a pretty serious red flag for the OP’s epistemic humility… it amounts to saying “I have this brilliant idea but I am too brilliant to actually execute it, will one of you less smart people do it for me?” This is not something one should claim without a correspondingly stellar track record - otherwise, it strongly indicates that you simply haven’t tested your own ideas against reality.

Contact with reality may lower your confidence that you are one of the smartest younger supergeniuses, a hypothesis that should have around a 1 in a billion prior probability.

Reply

Rapid capability gain around supergenius level seems probable even without intelligence needing to improve intelligence

Cole Wyeth9d31

Which seems more likely: capabilities happen to increase very quickly around human genius levels of intelligence, or relative capabilities as compared to the rest of humanity by definition increase only when you’re on the frontier of human intelligence?
Einstein found a lot of currently undiscovered physics because he was somewhat smarter/more insightful than anyone else and so he got ahead. This says almost nothing about absolute capabilities of intelligence.

Reply

Help make the orca language experiment happen

Cole Wyeth9d40

If orcas were actually that smart wouldn’t it be dangerous to talk to them for exactly the same reasons it would be dangerous to talk to a superintelligence?

Reply

A Bear Case: My Predictions Regarding AI Progress

Cole Wyeth11d83

No, it's possible for LLMs to solve a subset of those problems without being AGI (even conceivable, as the history of AI research shows we often assume tasks are AI complete when they are not e.g. Hofstader with chess, Turing with the Turing test).

I agree that the tests which are still standing are pretty close to AGI; this is not a problem with Thane's list though. He is correctly avoiding the failure mode I just pointed it out.

Unfortunately, this does mean that we may not be able to predict AGI is imminent until the last moment. That is a consequence of the black-box nature of LLMs and our general confusion about intelligence.

Reply

Why I’m not a Bayesian

Cole Wyeth13d20

So the thing that coalitional agents are robust at is acting approximately like belief/goal agents, and you’re only making a structural claim about agency?

If so, I find your model pretty plausible.

Reply