All of Andy_McKenzie's Comments + Replies

It seems to me like your model is not necessarily taking into account technical debt sufficiently enough. https://neurobiology.substack.com/p/technical-debt-probably-the-main-roadblack-in-applying-machine-learning-to-medicine

It seems to me like this is the main thing that will slow down the extent to which foundation models can consistently beat newly trained specialized models.

Anecdotally, I know several people who don’t like to use chatgpt because its training cuts off in 2021. This seems like a form of technical debt.

I guess it depends on how easily ada... (read more)

Sounds good, can't find your email address, DM'd you.

Those sound good to me! I donated to your charity (the Animal Welfare Fund) to finalize it. Lmk if you want me to email you the receipt. Here's the manifold market:

Bet

Andy will donate $50 to a charity of Daniel's choice now.

If, by January 2027, there is not a report from a reputable source confirming that at least three companies, that would previously have relied upon programmers, and meet a defined level of success, are being run without the need for human programmers, due to the independent capabilities of an AI developed by Op... (read more)

2Daniel Kokotajlo2y

Sounds good, thank you! Emailing the receipt would be nice.

Sounds good, I'm happy with that arrangement once we get these details figured out.

Regarding the human programmer formality, it seems like business owners would have to be really incompetent for this to be a factor. Plenty of managers have coding experience. If the programmers aren't doing anything useful then they will be let go or new companies will start that don't have them. They are a huge expense. I'm inclined to not include this since it's an ambiguity that seems implausible to me.

Regarding the potential ban by the government, I wasn't r... (read more)

5Daniel Kokotajlo2y

How about this: --Re the first grey area: We rule in your favor here. --Re the second grey area: You decide, in 2027, based on your own best judgment, whether or not it would have happened absent regulation. I can disagree with your judgment, but I still have to agree that you won the bet (if you rule in your favor).

Andy_McKenzie2y30

Understandable. How about this?

Bet

Andy will donate $50 to a charity of Daniel's choice now.

Terms

Reputable Sourc... (read more)

2Daniel Kokotajlo2y

Given your lack of disposable money I think this would be a bad deal for you, and as for me, it is sorta borderline (my credence that the bet will resolve in your favor is something like 40%?) but sure, let's do it. As for what charity to donate to, how about Animal Welfare Fund | Effective Altruism Funds. Thanks for working out all these details! Here are some grey area cases we should work out: --What if there is a human programmer managing the whole setup, but they are basically a formality? Like, the company does technically have programmers on staff but the programmers basically just form an interface between the company and ChatGPT and theoretically if the managers of the company were willing to spend a month learning how to talk to ChatGPT effectively they could fire the human programmers? --What if it's clear that the reason you are winning the bet is that the government has stepped in to ban the relevant sorts of AI?

The basic reasons I expect AGI ruin

Andy_McKenzie2y70

I’m wondering if we could make this into a bet. If by remote workers we include programmers, then I’d be willing to bet that GPT-5/6, depending upon what that means (might be easier to say the top LLMs or other models trained by anyone by 2026?) will not be able to replace them.

8Daniel Kokotajlo2y

I've made several bets like this in the past, but it's a bit frustrating since I don't stand to gain anything by winning -- by the time I win the bet, we are well into the singularity & there isn't much for me to do with the money anymore. What are the terms you have in mind? We could do the thing where you give me money now, and I give it back with interest later.

Geoff Hinton Quits Google

These curves are due to temporary plateaus, not permanent ones. Moore's law is an example of a constraint that seems likely to plateau. I'm talking about takeoff speeds, not eventual capabilities with no resource limitations, which I agree would be quite high and I have little idea of how to estimate (there will probably still be some constraints, like within-system communication constraints).

1Decaeneus2y

Understood, and agreed, but I'm still left wondering about my question as it pertains to the first sigmoidal curve that shows STEM-capable AGI. Not trying to be nitpicky, just wondering how we should reason about the likelihood that the plateau of that first curve is not already far above the current limit of human capability. A reason to think so may be something to do with irreducible complexity making things very hard for us at around the same level that it would make them hard for a (first-gen) AGI. But a reason to think the opposite would be that we have line of sight to a bunch of amazing tech already, it's just a question of allocating the resources to support sufficiently many smart people working out the details. Another reason to think the opposite is that having a system that's (in some sense) directly optimized to be intelligent might just have a plateau drawn from a higher-meaned distribution than one that's optimized for fitness, and develops intelligence as a useful tool in that direction, since the pressure-on-intelligence for that sort of caps out at whatever it takes to dominate your immediate environment.

Andy_McKenzie2y145

Does anyone know of any AI-related predictions by Hinton?

Here's the only one I know of - "People should stop training radiologists now. It's just completely obvious within five years deep learning is going to do better than radiologists because it can get a lot more experience. And it might be ten years but we got plenty of radiologists already." - 2016, slightly paraphrased

This seems like still a testable prediction - by November 2026, radiologists should be completely replaceable by deep learning methods, at least other than regulatory requirements for trained physicians.

4Ilio2y

This is indeed an interesting losing* bet. He was mostly right on the technical side (yes deep learning now do better than the average radiologist on many tasks). He was completely wrong on the societal impact (no we still need to train radiologists). This was the same story with ophthalmologists when deep learning significantly shorten the time needed to perform part of their job: they just spent the saved time on doing more. *16+5=21, not 26 😉

6bvbvbvbvbvbvbvbvbvbvbv2y

Fyi actually radiology is not mostly looking at pictures but doing imagery-guided surgery (for example embolisation) which is significantly harder to automate. Same for family octors : it's not just following guidelines and renewing scripts but a good part is physical examination. I agree that AI can do a lot of what happens in medicine though.

AI doom from an LLM-plateau-ist perspective

Andy_McKenzie2yΩ120

Thanks! I agree with you about all sorts of AI alignment essays being interesting and seemingly useful. My question was more about how to measure the net rate of AI safety research progress. But I agree with you that an/your expert inside view of how insights are accumulating is a reasonable metric. I also agree with you that the acceptance of TAI x-risk in the ML community as a real thing is useful and that - while I am slightly worried about the risk of overshooting, like Scott Alexander describes - this situation seems to be generally improving.

Re... (read more)

4Steven Byrnes2y

Oh, I somehow missed that your original question was about takeoff speeds. When you wrote “algorithmic insights…will lead to dramatically faster AI development”, I misread it as “algorithmic insights…will lead to dramatically more powerful AIs”. Oops. Anyway, takeoff speeds are off-topic for this post, so I won’t comment on them, sorry. :)

1sanxiyn2y

I would not describe development of deep learning as discontinuous, but I would describe it as fast. As far as I can tell, development of deep learning happened by accumulation of many small improvements over time, sometimes humorously described as graduate student descent (better initialization, better activation function, better optimizer, better architecture, better regularization, etc.). It seems possible or even probable that brain-inspired RL could follow the similar trajectory once it took off, absent interventions like changes to open publishing norm.

AI doom from an LLM-plateau-ist perspective

Andy_McKenzie2yΩ26-2

Good essay! Two questions if you have a moment:

1. Can you flesh out your view of how the community is making "slow but steady progress right now on getting ready"? In my view, much of the AI safety community seems to be doing things that have unclear safety value to me, like (a) coordinating a pause in model training that seems likely to me to make things less safe if implemented (because of leading to algorithmic and hardware overhangs) or (b) converting to capabilities work (quite common, seems like an occupational hazard for someone with initially... (read more)

Steven Byrnes2y*Ω5146

Can you flesh out your view of how the community is making "slow but steady progress right now on getting ready"?

I finished writing this less than a year ago, and it seems to be meaningfully impacting a number of people’s thinking, hopefully for the better. I personally feel strongly like I’m making progress on a worthwhile project and would like lots more time to carry it through, and if it doesn’t work out I have others in the pipeline. I continue to have ideas at a regular clip that I think are both important and obvious-in-hindsight, and to notice new

... (read more)

The basic reasons I expect AGI ruin

Andy_McKenzie2y50

I didn't realize you had put so much time into estimating take-off speeds. I think this is a really good idea.

This seems substantially slower than the implicit take-off speed estimates of Eliezer, but maybe I'm missing something.

I think the amount of time you described is probably shorter than I would guess. But I haven't put nearly as much time into it as you have. In the future, I'd like to.

Still, my guess is that this amount of time is enough that there are multiple competing groups, rather than only one. So it seems to me like there w... (read more)

6Daniel Kokotajlo2y

It is substantially slower than the takeoff speed estimates of Eliezer, yes. I'm definitely disagreeing with Eliezer on this point. But as far as I can tell my view is closer to Eliezer's than to Hanson's, at least in upshot. (I'm a bit confused about this--IIRC Hanson also said somewhere that takeoff would last only a couple of years? Then why is he so confident it'll be so broadly distributed, why does he think property rights will be respected throughout, why does he think humans will be able to retire peacefully, etc.?) I also think it's plausible that there will be multiple competing groups rather than one singleton AI, though not more than 80% plausible; I can easily imagine it just being one singleton. I think that even if there are multiple competing groups, however, they are very likely to coordinate to disempower humans. From the perspective of the humans it'll be as if they are an AI singleton, even though from the perspective of the AIs it'll be some interesting multipolar conflict (that eventually ends with some negotiated peaceful settlement, I imagine) After all, this is what happened historically with colonialism. Colonial powers (and individuals within conquistador expeditions) were constantly fighting each other.

Andy_McKenzie2y202

Thanks for writing this up as a shorter summary Rob. Thanks also for engaging with people who disagree with you over the years.

Here's my main area of disagreement:

General intelligence is very powerful, and once we can build it at all, STEM-capable artificial general intelligence (AGI) is likely to vastly outperform human intelligence immediately (or very quickly).

I don't think this is likely to be true. Perhaps it is true of some cognitive architectures, but not for the connectionist architectures that are the only known examples of human-like ... (read more)

1Decaeneus2y

Hi Andy - how are you gauging the likely relative proportions of AI capability sigmoidal curves relative to the current ceiling of human capability? Unless I'm misreading your position, it seems like you are presuming that the sigmoidal curves will (at least initially) top out at a level that is on the same order as human capabilities. What informs this prior? Due to the very different nature of our structural limitations (i.e. a brain that's not too big for a mother's hips to safely carry and deliver, specific energetic constraints, the not-very-precisely-directed nature of the evolutionary process) vs an AGI's system's limitations (which are simply different) it's totally unclear to me why we should expect the AGI's plateaus to be found at close-to-human levels.

Charlie Sanders2y1310

Agreed. A common failure mode in these discussions is to treat intelligence as equivalent to technological progress, instead of as an input to technological progress.

Yes, in five years we will likely have AIs that will be able to tell us exactly where it would be optimal to allocate our scientific research budget. Notably, that does not mean that all current systemic obstacles to efficient allocation of scarce resources will vanish. There will still be the same perverse incentive structure for funding allocated to scientific progress as there is toda... (read more)

Andy_McKenzie2y50

I can see how both Yudkowsky's and Hanson's arguments can be problematic because they either assume fast or slow takeoff scenarios, respectively, and then nearly everything follows from that. So I can imagine why you'd disagree with every one of Hanson's paragraphs based on that. If you think there's something he said that is uncorrelated with the takeoff speed disagreement, I might be interested, but I don't agree with Hanson about everything either, so I'm mainly only interested if it's also central to AI x-risk. I don't want you to waste your time. ... (read more)

4Daniel Kokotajlo2y

I think there are probably disagreements I have with Hanson that don't boil down to takeoff speeds disagreements, but I'm not sure. I'd have to reread the article again to find out. To be clear, I definitely don't expect takeoff to take hours or days. Quantitatively I expect something like what takeoffspeeds.com says when you input the values of the variables I mentioned above. So, eyeballing it, it looks like it takes slightly more than 3 years to go from 20% R&D automation to 100% R&D automation, and then to go from 100% R&D automation to "starting to approach the fundamental physical limits of how smart minds running on ordinary human supercomputers can be" in about 6 months, during which period about 8 OOMs of algorithmic efficiency is crossed. To be clear I don't take that second bit very seriously at all, I think this takeoffspeeds.com model is much better as a model of pre-AGI takeoff than of post-AGI takeoff. But I do think that we'll probably go from AGI to superintelligent AGI in less than six months. How long it takes to get to nanotech or (name your favorite cool sci-fi technology) is less clear to me, but I expect it to be closer to one year than ten, and possibly more like one month. I would love to discuss this more & read attempts to estimate these quantities.

To clarify, when I mentioned growth curves, I wasn't talking about timelines, but rather takeoff speeds.

In my view, rather than indefinite exponential growth based on exploiting a single resource, real-world growth follows sigmoidal curves, eventually plateauing. In the case of a hypothetical AI at a human intelligence level, it would face constraints on its resources allowing it to improve, such as bandwidth, capital, skills, private knowledge, energy, space, robotic manipulation capabilities, material inputs, cooling requirements, legal and regulat... (read more)

Daniel Kokotajlo2y142

I too was talking about takeoff speeds. The website I linked to is takeoffspeeds.com.

Me & the other LWers you criticize do not expect indefinite exponential growth based on exploiting a single resource; we are well aware that real-world growth follows sigmoidal curves. We are well aware of those constraints and considerations and are attempting to model them with things like the model underlying takeoffspeeds.com + various other arguments, scenario exercises, etc.

I agree that much of LW has moved past the foom argument and is solidly on Eliezers side r... (read more)

Andy_McKenzie2y*118

Here's a nice recent summary by Mitchell Porter, in a comment on Robin Hanson's recent article (can't directly link to the actual comment unfortunately):

Robin considers many scenarios. But his bottom line is that, even as various transhuman and posthuman transformations occur, societies of intelligent beings will almost always outweigh individual intelligent beings in power; and so the best ways to reduce risks associated with new intelligences, are socially mediated methods like rule of law, the free market (in which one is free to compete, but also

... (read more)

6Daniel Kokotajlo2y

Wait, how is it not how growth curves have worked historically? I think my position, which is roughly what you get when you go to this website and set the training requirements parameter to 1e30 and software returns to 2.5, is quite consistent with how growth has been historically, as depicted e.g. How Roodman's GWP model translates to TAI timelines - LessWrong (Also I resent the implication that SIAI/MIRI hasn't tended to directly engage with those arguments. The FOOM debate + lots of LW ink has been spilled over it + the arguments were pretty weak anyway & got more attention than they deserved)

Andy_McKenzie2y2-13

AIs can potentially trade with humans too though, that's the whole point of the post.

Especially if the AI's have architectures/values that are human brain-like and/or if humans have access to AI tools, intelligence augmentation, and/or whole brain emulation.

Also, it's not clear why AIs will find it easier to coordinate with one another than humans and humans or humans and AIs. Coordination is hard for game theoretic reasons.

These are all standard points, I'm not saying anything new here.

Why should we expect AIs to coordinate well?

Andy_McKenzie2y11-22

When you write "the AI" throughout this essay, it seems like there is an implicit assumption that there is a singleton AI in charge of the world. Given that assumption, I agree with you. But if that assumption is wrong, then I would disagree with you. And I think the assumption is pretty unlikely.

No need to relitigate this core issue everywhere, just thought this might be useful to point out.

8quetzal_rainbow2y

What's the difference? Multiple AIs can agree to split the universe and gains from disassembling biosphere/building Dyson sphere/whatever and forget to include humanity in negotiations. Unless preferences of AIs are diametrically opposed, they can trade.

7trevor2y

Why is the assumption of a unilateral AI unlikely? That's a very important crux, big if true, and it would be worth figuring out to explain it to people in fewer words so that more people will collide with it. In this post, So8res explicity states: This is well in line with the principle of instrumental convergence, and instrumental convergence seems to be a prerequisite for creating substantial amounts of intelligence. What we have right now is not-very-substantial amounts of intelligence, and hopefully we will only have not-very-substantial amounts of intelligence for a very long time, until we can figure out some difficult problems. But the problem is that a firm might develop substantial amounts of intelligence sooner instead of later.

Answer by Andy_McKenzieFeb 14, 2023104

I agree this is a very important point and line of research. This is how humans deal with sociopaths, after all.

Here’s me asking a similar question and Rob Bensinger’s response: https://www.lesswrong.com/posts/LLRtjkvh9AackwuNB/on-a-list-of-lethalities?commentId=J42Fh7Sc53zNzDWCd

One potential wrinkle is that in a very fast take off world AI’s could potentially coordinate very well because they would basically be the same, or close branches of the same AI.

How much is death a limit on knowledge accumulation?

Andy_McKenzie2y102

"Science advances one funeral at a time" -> this seems to be both generally not true as well as being a harmful meme (because it is a common argument used to argue against life extension research).

https://www.lesswrong.com/posts/fsSoAMsntpsmrEC6a/does-blind-review-slow-down-science

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Interesting, thanks. All makes sense and no need to apologize. I just like it when people write/think about schizophrenia and want to encourage it, even as a side project. IMO, it's a very important thing for our society to think about.

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

A lot of the quotes do find decreased connectivity, but some of them find increased connectivity between certain regions. It makes me think that there's a probability there might be something more complicated than just "increased or decreased", but rather specific types of connections. But that's just a guess, and I think an explanation across all cortical connections is more parsimonious and therefore more likely a priori.

Of your criteria of "things to explain", here are some thoughts:

4.1 The onset of schizophrenia is typically in the late-tee... (read more)

4Steven Byrnes2y

Update: there’s some discussion of antipsychotics in my follow-up post: Model of psychosis, take 2 :)

4Steven Byrnes2y

LOL I’m not focused on this at all. I think I’ve spent a whopping four days of my life thinking hard about schizophrenia—one day in 2021 that didn’t go anywhere, one day last summer where I read a bunch of papers and thought of this hypothesis and felt pretty good about it and then moved on to other things, then one more day like a week later to research and write the blindness + schizophrenia post, and yesterday to write this post. Schizophrenia not a significant personal or professional interest of mine. I am very impressed with myself for fooling you. Or maybe you’re just being polite. :) (Understanding schizophrenia is plausibly indirectly helpful for my professional interests, for various reasons. Also, I have a rule-of-thumb that if I can write a decent blog post in four hours, I should just do it, often it leads to unexpected good things!) Yeah the “things to explain” could have been more accurately titled “aspects of schizophrenia that I can easily think of right now, from either off the top of my head or skimming the wikipedia article”. :-P I think the cognitive deficits are very straightforwardly and naturally predicted by my hypothesis. I wrote something about nicotine but a different commenter said that what I wrote was flagrantly wrong. (I put a warning in the OP.) Guess I need to think about that more. Honestly, I don’t have a great understanding of what nicotine does to the brain in the first place. Something something acetylcholine :-P I haven’t looked into antipsychotics / neuroleptics, and agree that doing so would be an obvious next step, and indeed maybe I should have done it before posting this. Sorry. I’ll put it on my to-do list.

Transcript of Sam Altman's interview touching on AI safety

Interesting theory and very important topic.

I think the best data source here is probably neuroimaging. Here's a recent review: https://www.frontiersin.org/articles/10.3389/fnins.2022.1042814/full. Here are some quotes from that:

For functional studies, be they fluorodeoxyglucose positron emission tomography (FDG PET), rs-fMRI, task-based fMRI, diffusion tensor imaging (DTI) or MEG there generally is hypoactivation and disconnection between brain regions. ...
Histologically this gray matter reduction is accompanied by dendritic and synaptic densi

... (read more)

2Steven Byrnes2y

Thanks! Sorta. I kinda feel like my hypothesis (or something very close to it) really is an elegant explanation for everything about schizophrenia. Of course, I don’t know that—among other things, I don’t know everything about schizophrenia (obviously). I was writing this partly in the hopes that you or other commenters would tell me about aspects of schizophrenia that my hypothesis can’t explain, or contradicts, if such aspects exist. And then I can drop that hypothesis and find something better to believe. :) Do you think that anything in your excerpt contradicts my hypothesis? Seems to be almost entirely decreases in connectivity, right? That said, I don’t put much stock in functional connectivity comparisons anyway—e.g. you don’t really know if those regions are talking directly to each other vs correlated for some other reason, and even leaving that aside, you can’t disentangle what control-vs-SCZ difference is caused by direct physical connectivity differences versus “when schizophrenics are hanging out in the fMRI machine their wandering minds tend to be thinking about different things than when the control group people are hanging out in the fMRI machine”, or whatever. [I very generally find it quite hard to learn anything useful from neuroimaging data, compared to most other types of neuroscience data / evidence. But maybe that’s just me. You do you. :-) ]

My Model Of EA Burnout

Andy_McKenzie2y6-2

A quote I find relevant:

“A happy life is impossible, the highest thing that man can aspire to is a heroic life; such as a man lives, who is always fighting against unequal odds for the good of others; and wins in the end without any thanks. After the battle is over, he stands like the Prince in the re corvo of Gozzi, with dignity and nobility in his eyes, but turned to stone. His memory remains, and will be reverenced as a hero's; his will, that has been mortified all his life by toiling and struggling, by evil payment and ingratitude, is absorbed into Nirvana.” - Arthur Schopenhauer

Andy_McKenzie2y62

Good point.

I know your question was probably just rhetorical, but to answer it regardless -- I was confused in part because it would have made sense to me if he had said it would "better" if AGI timelines were short.

Lots of people want short AGI timelines because they think the alignment problem will be easy or otherwise aren't concerned about it and they want the perceived benefits of AGI for themselves/their family and friends/humanity (eg eliminating disease, eliminating involuntary death, abundance, etc). And he could have just said "better... (read more)

Transcript of Sam Altman's interview touching on AI safety

Andy_McKenzie2y30

One of the main counterarguments here is that the existence of multiple AGIs allows them to compete with one another in ways that could benefit humanity. E.g. policing one another to ensure alignment of the AGI community with human interests. Of course, whether this actually would outweigh your concern in practice is highly uncertain and depends on a lot of implementation details.

Transcript of Sam Altman's interview touching on AI safety