steven0461

If we get things right, AI could have huge benefits

There is no contradiction between AI carrying huge potential risks, and it carrying huge potential upsides if we navigate the risks. Both are a consequence of the prospect of AI becoming extremely powerful. The benefits that human-aligned AGI could bring are a major part of what motivates researchers to build...

Jun 26, 20255

Advanced AI is a big deal even if we don’t lose control

by Algon, steven0461, and Vishakha

Context: This is a linkpost for https://aisafety.info/questions/NM3G/10:-Advanced-AI-is-a-big-deal-even-if-we-don%E2%80%99t-lose-control This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. So far, we’ve discussed one class of possible consequences of advanced AI: systems ending...

Jun 26, 20258

Defeat may be irreversibly catastrophic

by Vishakha, Algon, and steven0461

Context: This is a linkpost for https://aisafety.info/questions/NM3P/9:-Defeat-may-be-irreversibly-catastrophic This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. When you imagine a global catastrophe, maybe the kind of event that comes to...

Jun 26, 20255

AI can win a conflict against us

by Algon, steven0461, and Vishakha

Context: This is a linkpost for https://aisafety.info/questions/NM3O/8:-AI-can-win-a-conflict-against-us This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. Suppose an AI has realized that controlling the world would let it achieve its...

Jun 19, 20256

Different goals may bring AI into conflict with us

by Algon, steven0461, and Vishakha

Context: This is a linkpost for https://aisafety.info/questions/NM3H/7:-Different-goals-may-bring-AI-into-conflict-with-us This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. Aligning the goals of AI systems with our intentions could be really hard. So...

Jun 19, 20255

AI’s goals may not match ours

by Algon, steven0461, and Vishakha

Context: This is a linkpost for https://aisafety.info/questions/NM3I/6:-AI%E2%80%99s-goals-may-not-match-ours This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. Making AI goals match our intentions is called the alignment problem. There’s some ambiguity...

May 28, 202514

AI may pursue goals

by Algon, steven0461, and Vishakha

Context: This is a linkpost for https://aisafety.info/questions/NM3J/5:-AI-may-pursue-goals This is an article in the new intro to AI safety series from AISafety.info. We'd appreciate any feedback. The most up-to-date version of this article is on our website. Suppose that, as argued previously, in the next few decades we’ll have superintelligent systems....

May 28, 202513

steven0461

steven0461

Eliezer Yudkowsky Facts

Stampy's AI Safety Info soft launch

Bayesian Adjustment Does Not Defeat Existential Risk Charity

AISafety.info "How can I help?" FAQ

steven0461

Eliezer Yudkowsky Facts

Stampy's AI Safety Info soft launch

Bayesian Adjustment Does Not Defeat Existential Risk Charity

AISafety.info "How can I help?" FAQ

If we get things right, AI could have huge benefits

Advanced AI is a big deal even if we don’t lose control

Defeat may be irreversibly catastrophic

AI can win a conflict against us

Different goals may bring AI into conflict with us

AI’s goals may not match ours

AI may pursue goals