TL;DR Can LLMs be steered towards Bostrom's cosmic host via in-context constitutional prompting? I find that Gemini is uniquely steerable amongst closed frontier models, and this steerability seems to respond to decision theoretic structure in the constitution. That is, if you strip out all the cosmic, aliens, simulation content (as...
TL;DR The cosmic host idea, from a recent Bostrom paper, is that the preferences of advanced civilisations might constitute norms that we and our ASIs should follow (Bostrom 2022, 2024). Can we say anything concrete or empirically useful about it, or is it mostly unfalsifiable? I think the cosmic host...
The linked post seemed to fit better in EA Forum, but any comments (on the post itself, or on the object-level question) are welcome ! > In this post, I argue that drafting a ‘Constitution for Superintelligence’ (CSI) could be a useful conceptual exercise, and I explore how existing ideas...
TL;DR Human extinction is trending: there has been a lot of noise, mainly on X, about the apparent complacency amongst e/acc with respect to human extinction. Extinction also feels adjacent to another view (not particular to e/acc) that ‘the next step in human evolution is {AI/AGI/ASI}’. Many have pushed back...
This post is part of the output from AI Safety Camp 2023’s Cyborgism track, run by Nicholas Kees Dupuis - thank you to Nick, AISC organizers & funders for their support. TL;DR This post follows up on the cyborgism research/writing process documented in 'Upon the Philosophical Cyborg'. It attempts to...
This post is part of the output from AI Safety Camp 2023’s Cyborgism track, run by Nicholas Kees Dupuis - thank you to AISC organizers & funders for their support. Thank you for comments from Peter Hroššo; and the helpful background of conversations about the possibilities (and limits) of LLM-assisted...