LessWrong team member / moderator. I've been a LessWrong organizer since 2011, with roughly equal focus on the cultural, practical and intellectual aspects of the community. My first project was creating the Secular Solstice and helping groups across the world run their own version of it. More recently I've been interested in improving my own epistemic standards and helping others to do so as well.
Subcruxes of mine here:
Do those feel like subcruxes for you, or are there other ones?
Is there a decent chance an AI takeover is relatively nice?
> This is an existential catastrophe IMO and should desperately avoided, even if they do leave us a solar system or w/e.
Actually, I think this maybe wasn't cruxy for anyone. I think @ryan_greenblatt said he agreed it didn't change the strategic picture, it just changed some background expectations.
(I maybe don't believe him that he doesn't think it affects the strategic picture? It seemed like his view was fairly sensitive to various things being like 30% likely instead of like 5% or <1%, and it feels like it's part of an overall optimistic package that adds up to being more willing to roll the dice on current proposals? But, I'd probably believe him if he reads his paragraph and is like "I have thought about whether this is a (maybe subconscious) motivation/crux and am confident it isn't)
If the international governing body starts approving AI development, then aren't we basically just back in the plan A regime?
I think MIRI's plan is clearly meant to eventually build superintelligence, given that they've stated various times it'd be an existential catastrophe if this never happened – they just think it should happen after a lot of augmentation and carefulness.
A lot of my point here is I just don't really see much difference between Plan A and Shutdown except for "once you've established some real control over AI racing, what outcome are you shooting for nearterm?", and I'm confused why Plan A advocates see it as substantially different.
(Or, I think the actual differences are more about "how you expect it to play out in practice, esp. if MIRI-style folk end up being a significant political force." Which is maybe fair, but, it's not about the core proposal IMO.)
"We wouldn't want to pause 30 years, and then do a takeoff very quickly – it's probably better to do a smoother takeoff."
> huh, this one seems kinda relevant to me.
Do you understand why I don't understand why you think that? Like, the MIRI plan is clearly aimed at eventually building superintelligence (I realize the literal treaty doesn't emphasize that, but, it's clear from very public writing in IABIED that it's part of the goal), and I think it's pretty agnostic over exactly how that shakes out.
You... could publish it as a top-level linkpost!
Here's an attempt to recap the previous discussion about "Global Shutdown" vs "Plan A/Controlled Takeoff", trying to skip ahead to the part where we're moving the conversation forward rather than rehashing stuff.
Cruxes that seemed particularly significant (phrased the way they made most sense to me, which is hopefully reasonably ITT passing)
...
How bad is Chinese Superintelligence? For some people, it's a serious crux whether a China-run superintelligence would be dramatically worse in outcome than a democratic country.
...
"The gameboard could change in all kinds of bad ways over 30 years." Nations or companies could suddenly pull out in a disastrous way. If things go down in the near future there's fewer actors to make deals with and it's easier to plan things out.
...
Can we leverage useful work out of significantly-more-powerful-but-nonsuperhuman AIs? Especially since "the gameboard might change a lot", it's useful to get lots of safety research done quickly, and it's easier to do that with more powerful AIs. So, it's useful to continue to scale up until we've got the most powerful AIs can we can confidently control. (Whereas Controlled Takeoff skeptics tend to think AI that is capable of taking on the hard parts of AI safety research will already be too dangerous and untrustworthy)
...
Is there a decent chance an AI takeover is relatively nice? Giving the humans the Earth/solar system is just incredibly cheap from percentage-of-resources standpoint. This does require the AI to genuinely care about and respect our agency in a sort of complete way. But, it only has to care about us as a pretty teeny amount
...
And then, the usual "how doomed are current alignment plans?". My impression is "Plan A" advocates are usually expecting a pretty good chance things go pretty well if humanity is making like a reasonably good faith attempt at controlled takeoff, whereas Controlled Takeoff skeptics are typically imagining "by default this just goes really poorly, you can tell because everyone seems to keep sliding off understanding or caring about the hard parts of the problem")
...
All of those seem like reasonable things for smart, thoughtful people to disagree on. I do think some disagreement about them feels fishy/sus to me, and I have my takes on them, but, I can see where you're coming from.
Three cruxes I still just don't really buy as decision-relevant:
Note: you can get a nice-to-read version of the Treaty on https://www.ifanyonebuildsit.com/treaty I'm not sure if there's any notable differences between that and the paper but I'm guessing it's mostly the same.
I'm curious about how the specifics worked in your case.
I think it's been surprisingly like 50/50 when I specifically flinch away from an idea because it felt impossible, and it turned out to be "actually pretty impossible" vs "okay actually sort of straightforward if I were trying all the obvious things".
Obviously, if I systematically list out impossible things, there will be way more actually-pretty-impossible things. But somehow when it actually comes up (sampled from "things I actively wanted to do"). Maybe if I got better at dreaming impossible thoughts more of them would turn out to actually be impossible.
I'm curious about the origin story of you using the Bosnia example here – is that a history bit that you already knew, and randomly realized was analogous to your situation? Did you learn the Bosnia example recently and then go "oh that was kinda like situation X"?, or want to write about situation X and then poked around for a real world example?
(I just find myself wondering because I don't recall you using that class of example very often and was just intrigued about the process, no particular followup implication or questions).
Also, nice post! I think this one of my favorite Inkhaven posts that I've read.
I feel so happy that "what's your crux?" / "is that cruxy" is common parlance on LW now, it is a meaningful improvement over the prior discourse. Thank you CFAR and whoever was part of the generation story of that.