abramdemski

The Parable of Predict-O-Matic

I've been thinking more about partial agency. I want to expand on some issues brought up in the comments to my previous post, and on other complications which I've been thinking about. But for now, a more informal parable. (Mainly because this is easier to write than my more technical thoughts.) This relates to oracle AI and to inner optimizers, but my focus is a little different. 1 Suppose you are designing a new invention, a predict-o-matic. It is a wonderous machine which will predict everything for us: weather, politics, the newest advances in quantum physics, you name it. The machine isn't infallible, but it will integrate data across a wide range of domains, automatically keeping itself up-to-date with all areas of science and current events. You fully expect that once your product goes live, it will become a household utility, replacing services like Google. (Google only lets you search the known!) Things are going well. You've got investors. You have an office and a staff. These days, it hardly even feels like a start-up any more; progress is going well. One day, an intern raises a concern. "If everyone is going to be using Predict-O-Matic, we can't think of it as a passive observer. Its answers will shape events. If it says stocks will rise, they'll rise. If it says stocks will fall, then fall they will. Many people will vote based on its predictions." "Yes," you say, "but Predict-O-Matic is an impartial observer nonetheless. It will answer people's questions as best it can, and they react however they will." "But --" the intern objects -- "Predict-O-Matic will see those possible reactions. It knows it could give several different valid predictions, and different predictions result in different futures. It has to decide which one to give somehow." You tap on your desk in thought for a few seconds. "That's true. But we can still keep it objective. It could pick randomly." "Randomly? But some of these will be huge issues! Companies -- no, nations --

365Oct 15, 2019

abramdemski

Message

21719

3970

261

2141

17y

Claude's Bad Primer Fanfic

Spoilers for Primer. I rewatched the movie Primer last night for the nth time. It's one of very few movies I'll rewatch every so often, when I've forgotten some of the details, because the experience of piecing the plot together is so satisfying. I sometimes watch it twice in a...

Feb 823

Condensation & Relevance

(This post elaborates on a few ideas from my review of Sam Eisenstat's Condensation: a theory of concepts. It should be somewhat readable on its own but doesn't fully explain what condensation is on its own; for that, see my review or Sam's paper. The post came out of conversations...

Jan 2334

Understanding Trust: Project Update

This is a brief note on what I did with my funding in 2025, and my plans for 2026, written primarily because Manifund nudged me for an update on my project. I ran my AISC project (which I announced here) with four mentees in Spring 2025: Norman Hsia, Hanna Gabor,...

Jan 1762

Taking LLMs Seriously (As Language Models)

This is my attempt to write down what I would be researching, if I were working directly with LLMs rather than doing Agent Foundations. (I'm open to collaboration on these ideas.) Machine Learning research can occupy different points on a spectrum between science and engineering: science-like research seeks to understand...

Jan 956

Inkhaven Retrospective

This will be the 30th post of at least 500 words I have written this month. (I did somewhat cheat two days ago, by making a 500+ word edit to Legitimate Deliberation, which I also posted independently as a shortform.) Inkhaven has been very much what I was hoping for....

Nov 30, 202540

Does SI Disfavor Computationalism?

cube_flipper of smoothbrains.net recently made something resembling the following argument in a talk. I like the argument because it uses tools of computationalism to argue against computationalism: it argues within the Solomonoff Induction framework, against the computationalist position on phenomenal consciousness. This is my own interpretation of the argument; if...

Nov 30, 202538

Legitimate Deliberation

Legitimate deliberation is an alignment target; an alternative to / variation of Coherent Extrapolated Volition. What could make us trust an AI? Can we imagine a near-future scenario where we might consider some frontier model's outputs to be reliable, rather than needing to check any claims it makes via human...

Nov 27, 202544

Load More (7/295)

LESSWRONG
LW

LESSWRONG
LW

abramdemski

abramdemski

abramdemski

The Parable of Predict-O-Matic

Alignment Research Field Guide

Leaving MIRI, Seeking Funding

Embedded Agents

abramdemski

Claude's Bad Primer Fanfic

Condensation & Relevance

Understanding Trust: Project Update

Taking LLMs Seriously (As Language Models)

Inkhaven Retrospective

Does SI Disfavor Computationalism?

Legitimate Deliberation

The Parable of Predict-O-Matic

Alignment Research Field Guide

Leaving MIRI, Seeking Funding

Embedded Agents

Claude's Bad Primer Fanfic

Condensation & Relevance

Understanding Trust: Project Update

Taking LLMs Seriously (As Language Models)

Inkhaven Retrospective

Does SI Disfavor Computationalism?

Legitimate Deliberation