Rob Bensinger

Communications @ MIRI. Unless otherwise indicated, my posts and comments here reflect my own views, and not necessarily my employer's. (Though we agree about an awful lot.)

Sequences

2022 MIRI Alignment Discussion

2021 MIRI Conversations

Naturalized Induction

Posts

Sorted by New

21Rob B's Shortform Feed

54MIRI Newsletter #123

25d

98MIRI’s 2024 End-of-Year Update

8mo

195Response to Aschenbrenner's "Situational Awareness"

146When is a mind me?

132

142AI Views Snapshots

91An artificially structured argument for expecting AGI ruin

70AGI ruin mostly rests on strong claims about alignment and deployment, not about society

189The basic reasons I expect AGI ruin

137Four mindset disagreements behind existential risk disagreements in ML

83Yudkowsky on AGI risk on the Bankless podcast

Wikitag Contributions

(+336)

(+498/-274)

(+255/-221)

(+620/-372)

(+47/-113)

(+2464/-1467)

(+5773/-1290)

Functional Decision Theory

(+17/-17)

Comments

Sorted by

Newest

TurnTrout's shortform feed

Rob Bensinger1mo10

yeah, I left off this part but Nate also said

[people having trouble separating them] does maybe enhance my sense that the whole community is desperately lacking in nate!courage, if so many people have such trouble distinguishing between "try naming your real worry" and "try being brazen/rude". (tho ofc part of the phenomenon is me being bad at anticipating reader confusions; the illusion of transparency continues to be a doozy.)

TurnTrout's shortform feed

Rob Bensinger1mo214

Nate messaged me a thing in chat and I found it helpful and asked if I could copy it over:

fwiw a thing that people seem to me to be consistently missing is the distinction between what i was trying to talk about, namely the advice "have you tried saying what you actually think is the important problem, plainly, even once? ideally without broadcasting signals of how it's a socially shameful belief to hold?", and the alternative advice that i was not advocating, namely "have you considered speaking to people in a way that might be described as 'brazen' or 'rude' depending on who's doing the describing?".
for instance, in personal conversation, i'm pretty happy to directly contradict others' views -- and that has nothing to do with this 'courage' thing i'm trying to describe. nate!courage is completely compatible with saying "you don't have to agree with me, mr. senator, but my best understanding of the evidence is [thing i believe]. if ever you're interested in discussing the reasons in detail, i'd be happy to. and until then, we can work together in areas where our interests overlap." there are plenty of ways to name your real worry while being especially respectful and polite! nate!courage and politeness are nearly orthogonal axes, on my view.

TurnTrout's shortform feed

Rob Bensinger1mo104

FWIW, as someone who's been working pretty closely with Nate for the past ten years (and as someone whose preferred conversational dynamic is pretty warm-and-squishy), I actively enjoy working with the guy and feel positive about our interactions.

A case for courage, when speaking of AI danger

Rob Bensinger1mo*1913

(Considering how little cherry-picking they did.)

From my perspective, FWIW, the endorsements we got would have been surprising even if they had been maximally cherry-picked. You usually just can't find cherries like those.

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo113

(That was indeed my first thought when Bernanke said he liked the book; no dice, though.)

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo3019

Yep. And equally, the blurbs would be a lot less effective if the title were more timid and less stark.

Hearing that a wide range of respected figures endorse a book called If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All is a potential "holy shit" moment. If the same figures were endorsing a book with a vaguely inoffensive title like Smarter Than Us or The AI Crucible, it would spark a lot less interest (and concern).

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo1511

Yeah, I think people usually ignore blurbs, but sometimes blurbs are helpful. I think strong blurbs are unusually likely to be helpful when your book has a title like If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All.

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo152

Aside from the usual suspects (people like Tegmark), we mostly sent the book to people following the heuristic "would an endorsement from this person be helpful?", much more so than "do we know that this person would like the book?". If you'd asked me individually about Church, Schneier, Bernanke, Shanahan, or Spaulding in advance, I'd have put most of my probability on "this person won't be persuaded by the book (if they read it at all) and will come away strongly disagreeing and not wanting to endorse". They seemed worth sharing the book with anyway, and then they ended up liking it (at least enough to blurb it) and some very excited MIRI slack messages ensued.

(I'd have expected Eddy to agree with the book, though I wouldn't have expected him to give a blurb; and I didn't know Wolfsthal well enough to have an opinion.)

Nate has a blog post coming out in the next few days that will say a bit more about "How filtered is this evidence?" (along with other topics), but my short answer is that we haven't sent the book to that many people, we've mostly sent it to people whose AI opinions we didn't know much about (and who we'd guess on priors would be skeptical to some degree), and we haven't gotten many negative reactions at all. (Though we've gotten people who just didn't answer our inquiries, and some of those might have read the book and disliked it enough to not reply.)

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo3927

Now, how much is that evidence about the correctness of the book? Extremely little!

It might not be much evidence for LWers, who are already steeped in arguments and evidence about AI risk. It should be a lot of evidence for people newer to this topic who start with a skeptical prior. Most books making extreme-sounding (conditional) claims about the future don't have endorsements from Nobel-winning economists, former White House officials, retired generals, computer security experts, etc. on the back cover.

New Endorsements for “If Anyone Builds It, Everyone Dies”

Rob Bensinger1mo100

We're still working out some details on the preorder events; we'll have an announcement with more info on LessWrong, the MIRI Newsletter, and our Twitter in the next few weeks.

You don't have to do anything special to get invited to preorder-only events. :) In the case of Nate's LessOnline Q&A, it was a relatively small in-person event for LessOnline attendees who had preordered the book; the main events we have planned for the future will be larger and online, so more people can participate without needing to be in the Bay Area.

(Though we're considering hosting one or more in-person events at some point in the future; if so, those would be advertised more widely as well.)