I think the really interesting interaction between these two frames is when selection pressures lead to predictive capacities. When does this happen? A first guess might be: when the training (selecting) environment is so complicated, and there is so much local variance that the selective loop finds its easiest to instill a predictive agent and let that take care of the local adaptation.
A lot of stuff works like this: you can have generic chess/math heuristics but you need to be able to do local calculations to not fall flat on your face; evolution more or less works like this in mammals and obviously humans, maybe much more; presumably LLMs work like this; our central nervous system/mind works like this wrt individual cells in the body.
Are there other factors that mediate how a selective process can give rise to local predictive agents? What consequences does this transition have? Cancer/parasites/fraud are three instances of one example, what else?
Selective optimization finding predictive optimizers (with a different objective) is the main idea of "Risks from Learned Optimisation" and indeed they have Section 2: Conditions for mesa-optimization
I mean, "selection pressure creates artefacts with learning/predictive capabiltiies" is also just how evolution works. It's selective optimisation creating predictive optimisers all the way down: Even cells and nematodes have learning capabilities. What we humans see as our unique intelligence can be considered belonging to a long and storied genre of pathfinding and future-simulating behaviours, only now carried out in higher and higher dimensional action spaces---and at each step selection is used to grow and develop these capabilties. Given that, it seems reasonable to say that if evolution can be described as a coherent phenomenon at all, it will be a phenomenon that acts on intelligent goal-driven systems. (This comment comes from some notes I wrote down while reading the Moloch essay)
This made me think of Demski's post on selection vs control: https://www.lesswrong.com/posts/ZDZmopKquzHYPRNxq/selection-vs-control
I'm curious - Would you agree or disagree with the description that prediction is selection internally? i.e. we consider different hypotheses then select the ones that fit best?
I'd guess: A predictive optimizer is built out of parts, but its defining feature is that one of the parts is beliefs about external reality. It can contain other parts, like learning or planning algorithms, which can be selective optimizers.
Yeah it seems to me like predictive optimization is often a kind of internal selective optimization.
In 'Emergent' misaligned outcomes' I was trying to gesture at something like the selection effects on non-human, made-of-humans-for-now systems, and how they might incorporate AI capabilities (even non-agentic or corrigible agentic ones) and become 'more than merely/mainly selected' at some point.
Sometimes I find myself reaching for phrases like 'non-agentic AGI embedded in an economic context'. I’m trying to express a fuzzier but maybe more concerning concept that the political or economic environment itself might contain non-human goal-directed or optimising pressures/forces, and particular tyrannical humans or human organisations are just some of the outputs of those forces.
Such an optimising force, since inhuman, is liable to be misaligned by default. In fact this more general version of the argument may apply even if we make corrigible agentic AGI - perhaps obvious in hindsight, this has only become clear to me as I write.
We could suggestively call such an emergent goal-directed system a ‘miscoordination demon’. I would claim that such systems are already substantially reducing the amount of value in the world and will plausibly continue to do so with or without the introduction of AGI. If the introduction of AGI differentially empowers human agency vs such miscoordination demons, we could imagine either being able to subdue them (perhaps permanently), or being subdued by them (perhaps permanently).
I think this is something like a garbled proto gradual disempowerment perspective, and I appreciate the crisper articulation here and in your other writing.
Daniel Dennett has called competence without comprehension what you call selective optimization.
Gradient descent on Atari games
I feel a bit confused about gradient descent being described as a selective process, and thus about this binary. Is gradient descent a selective process? It doesn't seem like it.
All the other examples of selective processes involve... variation and selection: you have a population with variation, the population gets culled, the remaining population has more of some quality, repeat. But gradient descent does not feature this, at least not in a straightforward way. There's no pool of candidates, no acceptance / rejection, no competition, really.
(This might have consequences, for instance, with how gradient descent can work differently from more selective / evolutionary processes. Evolutionary Strategies At Scale for instance, finds that "Evolutionary Strategies" has a different behavior when used to train an LLM than gradient descent. See also.)
But generally this binary feels pretty fuzzy to me; the MECE-ness of it, or membership criteria seems unclear.
I wrote something about this a while back: in short, with a squint gradient descent and natural selection are the same.
From my point of view, one thing that's particularly relevant is that they're both operating locally, with very no/little foresight, over a high-dimensional design space. You could look at GD as selecting among all the possible local steps, and 'competing' them based on the heuristic of their local loss gradient (as approximated by the (sampled) dataset-derived estimator).
Some key practical differences between varying instantiations of GD/NS will be in the effective 'proposal'/generating procedures and 'promotion'/selection heuristics.
This confusion comes about because natural selection has no mechanism to maintain variation. Equivalently, gradient descent can only work with the data provided or in other words it has no "proposal" step like Gibbs sampling or MCMC. So the idea that gradient descent and natural selection are the same feels intuitive to me. (Caveat being frequency-dependent selection.)
It is also known that some models of evolutionary game theory recover Fisher's theorem of natural selection as a consequence of the replicator equation (a model of natural selection) as a gradient flow, see this arxiv paper. (Might have bungled the explanation on this one, so take with some salt.)
I think it's possible that gradient descent works by applying a selection pressure to preexisting circuits in the initial randomization with some finetuning. This would explain why most weights are zero after training as well as stuff like the lottery ticket hypothesis.
This would explain why most weights are zero after training
As far as I know this is just false, though?
It can go the other way too: a predictive process can create a selective one.
For example, a libertarian economist encouraging free markets, a programmer using genetic algorithms, someone setting up an art contest to find the best art, etc...
The art contest actually has multiple layers:
Looking over my favourite posts, I notice that many of them are making specific versions of a more general claim, which is essentially: don’t confuse selective processes for predictive processes.
Here, I’m going to try to make that more general claim, rehash some examples in light of it, and end with a few ambient confusions I think this framework can help with, for the reader to ponder.
When you encounter an entity that is very good at achieving some outcome, there are two very different processes that could be going on under the hood:
It’s not a perfect binary, and often what you see is a mix of the two. In particular, all predictive optimisers have emerged from selective optimisation and often retain some fingerprint.
Selective
Predictive
Weird Mix
Bacteria developing antibiotic resistance
Hacker finding a way to penetrate a secure system
Humans evolving to be good at lying
Gradient descent on Atari games
Tree searching Connect Four
AlphaZero training a policy on its own rollouts
Flowers co-evolving with their pollinators
Humans genetically modifying crops
Humans selectively breeding dogs
Human brains seem to be hardwired to reason about intent, in the same way that we see faces everywhere. The problem is, selective processes behave a bit differently. For example:
So when you try to interpret a system as purely predictive when it’s at least partly selective, you might mistakenly assume it generalises a lot more cleanly than it actually does, that its behaviour is in some meaningful sense intended, or that it can’t be that optimised because you can’t see much computation lying around. These can sometimes be dangerous mistakes.
(The last one in particular gives you a slightly more precise variant of Chesterton's Fence: before scrapping a tradition where nobody can articulate why it's useful, at least ballpark how much optimisation has probably gone into it.)
That’s the whole point. The rest of this post will just be spelling out examples, but feel free to stop if you’ve already got the gist.
This picture is basically right but not quite — can you spot why?
Classic Examples and Confusions
Now that I've given the general claim and a few instantiations, I’m going to close with some other cases where I think this distinction is relevant, for the reader to ponder.
Either by making predictions itself or by being crafted by a predicting entity
The author of this post claims that this is specifically because people find these explanations less intuitive.
One of the most common counterarguments is roughly “nobody wants this so it won’t happen”. I think it has some bite, but not much, and I am still not sure how to bridge that inferential gap — that’s part of what motivated me to write this post up.