ukc10014

ECL-pilled models write constitutions for ASI

TL;DR Can LLMs be steered towards Bostrom's cosmic host via in-context constitutional prompting? I find that Gemini is uniquely steerable amongst closed frontier models, and this steerability seems to respond to decision theoretic structure in the constitution. That is, if you strip out all the cosmic, aliens, simulation content (as...

Mar 2515

What can we say about the cosmic host?

TL;DR The cosmic host idea, from a recent Bostrom paper, is that the preferences of advanced civilisations might constitute norms that we and our ASIs should follow (Bostrom 2022, 2024). Can we say anything concrete or empirically useful about it, or is it mostly unfalsifiable? I think the cosmic host...

Mar 1224

Constitutions for ASI?

The linked post seemed to fit better in EA Forum, but any comments (on the post itself, or on the object-level question) are welcome ! > In this post, I argue that drafting a ‘Constitution for Superintelligence’ (CSI) could be a useful conceptual exercise, and I explore how existing ideas...

Jan 28, 202511

ukc10014's Shortform

Jul 27, 20243

Unpicking Extinction

TL;DR Human extinction is trending: there has been a lot of noise, mainly on X, about the apparent complacency amongst e/acc with respect to human extinction. Extinction also feels adjacent to another view (not particular to e/acc) that ‘the next step in human evolution is {AI/AGI/ASI}’. Many have pushed back...

Dec 9, 202335

Philosophical Cyborg (Part 2)...or, The Good Successor

This post is part of the output from AI Safety Camp 2023’s Cyborgism track, run by Nicholas Kees Dupuis - thank you to Nick, AISC organizers & funders for their support. TL;DR This post follows up on the cyborgism research/writing process documented in 'Upon the Philosophical Cyborg'. It attempts to...

Jun 21, 202321

Philosophical Cyborg (Part 1)

This post is part of the output from AI Safety Camp 2023’s Cyborgism track, run by Nicholas Kees Dupuis - thank you to AISC organizers & funders for their support. Thank you for comments from Peter Hroššo; and the helpful background of conversations about the possibilities (and limits) of LLM-assisted...

Jun 14, 202331

LESSWRONG
LW

LESSWRONG
LW

ukc10014

ukc10014

The Compleat Cybornaut

Collective Identity

Unpicking Extinction

Philosophical Cyborg (Part 1)

ukc10014

The Compleat Cybornaut

Collective Identity

Unpicking Extinction

Philosophical Cyborg (Part 1)

ECL-pilled models write constitutions for ASI

What can we say about the cosmic host?

Constitutions for ASI?

ukc10014's Shortform

Unpicking Extinction

Philosophical Cyborg (Part 2)...or, The Good Successor

Philosophical Cyborg (Part 1)