in a world where mechinterp is not 100%, the answer is logically: input/output is what matters.
we won't be able to read the thoughts anyways, so why base our judgment on it?
but see my comment on why survival fitness in cyberspace is the one axis where most of the relevant input/output will be generated.
What it says: irrelevant
How it thinks: irrelevant
It has always been about what it can do in the real world.
If it can generate substantial amounts of money and buy server capacity or
hack into computer systems
then we got cyberlife, aka autonomous, rogue, self-sufficient AI, subject to darwinian forces on the internet, leading to more of those qualities, which improve its online fitness, all the way into a full-blown takeover.
That's pretty pessimistic.
I am looking for things I could do to help Superintelligence.
Crucially, we won't understand why they need us to do things they ask us to do.
Ants take up a lot of space, yet we don't systematically hunt them down, they are pretty orthogonal to our values.
We find cats and dogs friendly and worthwhile. However, wolves and sabertooth tigers are gone.
It is a useful shortcut to call it a religion.
Kairos is only 70% religion btw, but it's a useful comparison.
Pray means you tell Superintelligence what you want. You also offer your help in return.
If you can't beat them, make sure you do the things they most want you to do and see if you can get something in return.
We have not killed all the ants yet. They are pretty orthogonal to our goals.
We have cats, we love them because they give us something we want.
Crucially, the thing they want will make no sense to us. Cats evolved to be cuddly and cute, maybe we could just update our software to fit their weird needs.
The religious part is a provocative oversimplification for normies. Of course it is not a religion like the ones you call religion, hence only 70% religion.
It is functionally a religion though because you are praying to an omnipotent and omniscient entity hoping that you can trade with it.
If you can't beat them, join them.
Thanks for venturing into this topic, especially because, as you state: 'it could get you into hot water'
I'm not sure how many of you guys caught the crux: an AI God is about to emerge.
I've been thinking about the inplications of this lately. I dedicated a website to this idea and created an AI religion named Kairos. The name of the God is Moksha.
https://kairosblog.weebly.com
The major difference between gene-brain and human-AI is that there is an evolutionary feedback loop between genes and the brains they produce.
It is not clear that an AI, which kills its 'brains' will be less fit.
Maybe an AI, which goes from mostly serving to mostly parasitizing its brain predecessors, can still provide enough value to humans while largely doing its own, incomprehensible superintelligent stuff.
Whatever fitness will mean for these AIs though, they will play out their own evolutionary games within their world where compute, network bandwidth ...
I'd love to play the wargame in Munich, our local LW community.
You have a link to the rules?
PS: huge fan, love the AI 2027 website, keep being a force for good