r/Fitness does a weekly "Moronic Monday", a judgment-free thread where people can ask questions that they would ordinarily feel embarrassed for not knowing the answer to. I thought this seemed like a useful thing to have here - after all, the concepts discussed on LessWrong are probably at least a little harder to grasp than those of weightlifting. Plus, I have a few stupid questions of my own, so it doesn't seem unreasonable that other people might as well.
I meant that by going meta we might not have to solve them fully.
All the problems you list sound nearly identical to me. In particular, "what matters to humans" sounds more vague but just as meta. If it includes enough details to actually reassure me, you could just tell the AI, "Do that." Presumably what matters to us would include 'the ability to affect our environment, eg by giving orders.' What do you mean by "very powerful but insane"? I want to parse that as 'intelligent in the sense of having accurate models that allow it to shape the future, but not programmed to do what matters to humans.'
"very powerful but insane" : AI's response to orders seem to make less than no sense, yet AI is still able to do damage. "What matters to humans": Things like the Outcome Pump example, where any child would know that not dying is supposed to be part of "out of the building", but not including the problems that we are bad at solving, such as fun theory and the like.