Jiro comments on The Strangest Thing An AI Could Tell You - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (574)
If it is possible for me to have a "denial" function, that doesn't just apply to flat earths and talking cows. I could equally well have a "denial" function that makes me blind to properly reading the AI's code and properly auditing the AI. I would never have a reason to prefer "I have the specific denial function the AI told me about" to "I have a denial function that prevents me from seeing how bad the AI is".