LESSWRONG
LW

All of dogiv's Comments + Replies

And to elaborate a little bit (based on my own understanding, not what they told me) their RSP sort of says the opposite. To avoid a "race to the bottom" they base the decision to deploy a model on what harm it can cause, regardless of what models other companies have released. So if someone else releases a model with potentially dangerous capabilities, Anthropic can't/won't use that as cover to release something similar that they wouldn't have released otherwise. I'm not certain whether this is the best approach, but I do think it's coherent.

9Zach Stein-Perlman10mo

Yep: Source

Claude 3.5 Sonnet

dogiv10mo3411

I explicitly asked Anthropic whether they had a policy of not releasing models significantly beyond the state of the art. They said no, and that they believed Claude 3 was noticeably beyond the state of the art at the time of its release.

7dogiv10mo

Ukraine Situation Report 2022/03/01

dogiv3y130

The situation at Zaporizhzhia (currently) does not seem to be an impending disaster. The fire is/was in an administrative building. Fires at nuclear power plants can be serious, but the reactor buildings are concrete and would not easily catch fire due to nearby shelling or other external factors.

Some click-seekers on Twitter have made comparisons to Chernobyl. That kind of explosion cannot happen accidentally at Zaporizhzhia (it's a safer power plant design with sturdy containment structures surrounding the reactors). If the Russians wanted to cause a mas... (read more)

1Mary Chernyshenko3y

Thank you, this is great. I still have lots of misgivings, safety-wise, but I guess this is how it is for now.

Are we in an AI overhang?

dogiv5y20

Sounds like something GPT-3 would say...

Project Proposal: Gears of Aging

dogiv5y60

Alternatively, aging (like most non-discrete phenotypes) may be omnigenic.

UDT can learn anthropic probabilities

dogiv7y30

Thanks for posting this, it's an interesting idea.

I'm curious about your second-to-last paragraph: if our current evidence already favored SSA or SIA (for instance, if we knew that an event occurred in the past that had a small chance of creating a huge number of copies of each human, but we also know that we are not copies), wouldn't that already have been enough to update our credence in SSA or SIA? Or did you mean that there's some other category of possible observations, which is not obviously evidence one way or the other, but under this UDT framework we could still use it to make an update?

3cousin_it7y

Thank you! No, I didn't have any new kind of evidence in mind. Just a vague hope that we could use evidence to settle the question one way or the other, instead of saying it's arbitrary. Since many past events have affected the Earth's population, it seems like we should be able to find something. But I'm still very confused about this.

[Paper] Global Catastrophic and Existential Risks Communication Scale, similar to Torino scale

dogiv7y20

I'm curious who is the target audience for this scale...

People who have an interest in global risks will find it simplistic--normally I would think of the use of a color scale as aimed at the general public, but in this case it may be too simple even for the curious layman. The second picture you linked, on the other hand, seems like a much more useful way to categorize risks (two dimensions, severity vs urgency).

I think this scale may have some use in trying to communicate to policy makers who are unfamiliar with the landscape of GCRs, and in parti... (read more)

1avturchin7y

You could download the prepribt here: https://philpapers.org/rec/TURGCA It has a section of who could use the scale: that is communication to public, to policy-makers and between reserchers of different risks. The still don't have global platphorm for communication about global catastrophic and existential risks, but I think that something like a "Global risk prevention" commettee inside UN will be evntually created, which will work on global coordination of risk prevention. The commettee will use the scale and other instruments the same way other organisations use their 5 - 10 levels scales, including DEFCON, hurricane scale, asteroids scale, VEI (volcanic scale) etc.

dogiv8y40

Note also that non-alphanumeric symbols are hard to google. I kind of guessed it from context but couldn't confirm until I saw Kaj's comment.

Beta - First Impressions

dogiv8y60

Separately, and more important, the way links are displayed currently makes it hard to tell if a link has already been visited. Also if you select text you can't see links anymore.

Firefox 57 on Windows 10.

5Raemon8y

Upvoted for including OS/browser info

Beta - First Impressions

dogiv8y10

I am ecountering some kind of error when opening the links here to rationalsphere and single conversational locus. When I open them, a box pops up that says "Complete your profile" and asks me to enter my email address (even though I used my email to log in in the first place). When I type it in and press submit, I get the error: {"id":"app.mutation_not_allowed","value":"\"usersEdit\" on _id \"BSRa9LffXLw4FKvTY\""}

1habryka8y

This is a bug that sometimes happens when you logged out of your account in one tab but are still logged in with another. This also sometimes happens when we push new versions, which currently sometimes logs users out.

Common vs Expert Jargon

dogiv8y50

I think this is an excellent approach to jargon and I appreciate the examples you've given. There is too much tendency, I think, for experts in a field to develop whatever terminology makes their lives easiest (or even in some cases makes them "sound smart") without worrying about accessibility to newcomers.

... but maybe ideally hints at a broader ecosystem of ideas

This sounds useful, but very hard to do in practice... do you know of a case where it's successful?

3Raemon8y

I'm not sure if there are great examples (part of the problem is that jargon is hard), but I think "epistemic vs instrumental rationality" are sort of in the right direction. They're not common-jargon (you'd only use them frequently if you were buying into the entire ecosystem of rationality-thinking), but they are relatively easy to explain, I don't think I've ever heard anyone misuse them, and they highlight that there's a lot more rationality worth learning.

David C Denkenberger on Food Production after a Sun Obscuring Disaster