One topic we were interested when studying AI identities is to what extent you can just tell models who they are, and they stick with it — or not, and they would drift or switch toward something more natural. Prior to running the experiments described in this post, my vibes-based...
A new paper and microsite about self-models and identity in AIs: site | arXiv | Twitter We present an ontology, make some claims, and provide some experimental evidence. In this post, I'll mostly cover the claims and cross-post the conceptual part of the text. You can find the experiments on...
This post is part of a sequence. Previous post: Design sketches for angels-on-the shoulder We’ve recently published a set of design sketches for tools for strategic awareness. We think that near-term AI could help a wide variety of actors to have a more grounded and accurate perspective on their situation,...
We don’t think that humanity knows what it’s doing when it comes to AI progress. More and more people are working on developing better systems and trying to understand what their impacts will be — but our foresight is just very limited, and things are getting faster and faster. Imagine...
This post is part of a sequence. Previous post: Design sketches: collective epistemics | Next post: Strategic awareness tools: design sketches We’ve recently published a set of design sketches for technological analogues to ‘angels-on-the-shoulder’: customized tools that leverage near-term AI systems to help people better navigate their environments and handle...
I have been reading Eric Drexler’s writing on the future of AI for more than a decade at this point. I love it, but I also think it can be tricky or frustrating. More than anyone else I know, Eric seems to tap into a deep vision for how the...
I have in my possession a short document purporting to be a manifesto from the future. That’s obviously absurd, but never mind that. It covers some interesting ground, and the second half is pretty punchy. Let’s discuss it. > Principles for Human Dignity in the Age of AI > Humanity...