eli_sennesh comments on AI caught by a module that counterfactually doesn't exist - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (22)
It would be better to say, "The AI believes in falsely believing X" or "The AI believes it ought to falsely believe X" or "the AI is compelled to self-delude on X."