DaveX comments on The self-unfooling problem - Less Wrong

14 Post author: RichardKennaway 11 October 2011 08:36AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (30)

You are viewing a single comment's thread.

Comment author: DaveX 12 October 2011 03:27:04AM 0 points [-]

I'm confused about the "hide" part of the initial task, or the "fooling" that needs to be unfooled. The objective function rewards ineffective fooling.

It seems you simply mean "store" such that you can find it.

Comment author: Vaniver 12 October 2011 03:35:58PM 4 points [-]

Congrats, you got the joke!