zeshen

aisafety.community - A living document of AI safety communities

Thanks to plex for co-authoring the post (co-authors are currently not reflected in EA Forum when crossposted from LessWrong). The AI Safety Communities logo, by DALL-E 2 The AI safety field has been rapidly growing over the last few years, and more and more communities have been sprouting up all over the world, both physically and online. The Alignment Ecosystem Development (AED) team identified having a living document listing them all as a low-hanging fruit, and volunteers are maintaining this live database of AI safety communities. Here’s what you can do now: 1. Join communities you’re interested in and participate. 2. Link this to your friends. 3. Add communities to the document if they’re missing (account creation with coda.io is required) 4. Improve the descriptions of any communities which you’re a part of. Many of the current descriptions are incomplete or outdated, jump right in and share your knowledge. 5. Volunteer as a maintainer, or join the Alignment Ecosystem Development Discord server, where they host monthly calls to pitch or join shovel ready volunteering opportunities. Thanks to JJ Hepburn from AI Safety Support for providing the domain through ea.domains, another soon-to-be-launched AED project!

58Oct 28, 2022

zeshen

Message

Feedback welcomed: www.admonymous.co/zeshen

I sometimes write my thoughts here: airisks.substack.com

410

zeshen's Shortform

Feb 174

On thinking about AI risks concretely

Epistemic status: Shower thoughts, not meant to be rigorous. There seems to be a fundamental difference in how I (and perhaps others as well) think about AI risks as compared to the dominant narrative on LessWrong (hereafter the “dominant narrative”), that is difficult to reconcile. The dominant narrative is that...

Jul 11, 20259

Non-loss of control AGI-related catastrophes are out of control too

For the Open Philanthropy AI Worldview Contest Executive Summary * Conditional on AGI being developed by 2070, we estimate the probability that humanity will suffer an existential catastrophe due to loss of control (LoC) over an AGI system at ~6%, using a quantitative scenario model that attempts to systematically assess...

Jun 12, 20232

Is there a way to sort LW search results by date posted?

Mar 12, 20235

A newcomer’s guide to the technical AI safety field

This post was written during Refine. Thanks to Jonathan Low, Linda Linsefors, Koen Holtman, Aaron Scher, and Nicholas Kees Dupuis for helpful discussion and feedback. Disclaimer: This post reflects my current understanding of the field and may not be an accurate representation of it. Feel free to comment if you...

Nov 4, 202243

Embedding safety in ML development

This post was written as part of Refine. Thanks to Adam Shimi, Alexander Gietelink Oldenziel, and Vanessa Kosoy for helpful discussion and feedback. Summary This post aims to: * Advocate for embedding safety into development of machine learning models * Propose a framing on how to think about safety, where...

Oct 31, 202224

aisafety.community - A living document of AI safety communities

Oct 28, 202258

Load More (7/12)

LESSWRONG
LW

LESSWRONG
LW

zeshen

zeshen

zeshen

aisafety.community - A living document of AI safety communities

I missed the crux of the alignment problem the whole time

My Thoughts on the ML Safety Course

A newcomer’s guide to the technical AI safety field

zeshen

zeshen's Shortform

On thinking about AI risks concretely

Non-loss of control AGI-related catastrophes are out of control too

Is there a way to sort LW search results by date posted?

A newcomer’s guide to the technical AI safety field

Embedding safety in ML development

aisafety.community - A living document of AI safety communities

aisafety.community - A living document of AI safety communities

I missed the crux of the alignment problem the whole time

My Thoughts on the ML Safety Course

A newcomer’s guide to the technical AI safety field

zeshen's Shortform

On thinking about AI risks concretely

Non-loss of control AGI-related catastrophes are out of control too

Is there a way to sort LW search results by date posted?

A newcomer’s guide to the technical AI safety field

Embedding safety in ML development

aisafety.community - A living document of AI safety communities