nyan_sandwich comments on AI box: AI has one shot at avoiding destruction - what might it say? - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (354)
I think we are suffering from hindsight bias a lot in evaluating whether you'd type "AI DESTROYED"
Let's play a different game. Privately flip a coin. If heads, you're friendly, if tails, you're a paperclip maximizer. Reply to this post with your gambit, and people can try to guess whether you are friendly (talk to AI, RELEASE AI) or unfriendly (AI DESTROYED).
Let's see if anyone can get useful information out of the AI without getting pwned or nuking a friendly AI.
"What's your favorite color? My favorite color is paperclips."