You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

CAE_Jones comments on I attempted the AI Box Experiment again! (And won - Twice!) - Less Wrong Discussion

36 Post author: Tuxedage 05 September 2013 04:49AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (163)

You are viewing a single comment's thread. Show more comments above.

Comment author: CAE_Jones 06 September 2013 03:22:51AM *  9 points [-]

As I understand what EY has said, he's concerned that people will see a technique that worked, conclude that wouldn't possibly work on them, and go on believing the problem was solved and there was even less to worry about than before.

I think seeing, say, Tuxedage's victory and hearing that he only chose 8 out of 40 avenues for attack, and even botched one of those, could offset that concern somewhat, but eh.

ETA: well, and it might show the Gatekeeper and the AI player in circumstances that could be harmful to have published, since the AI kinda needs to suspend ethics and attack the gatekeeper psychologically, and there might be personal weaknesses of the Gatekeeper brought up.

Comment author: Tuxedage 06 September 2013 06:38:19PM 5 points [-]

I can verify that these are part of the many reasons why I'm hesitant to reveal logs.

Comment author: Coscott 07 September 2013 04:14:43PM *  0 points [-]

Can you verify that part of the reason is that some methods might distress onlookers? Give onlookers the tools necessary to distress others?