SilentCal comments on I tried my hardest to win in an AI box experiment, and I failed. Here are the logs. - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (28)
In Tuxedage's rule set, if the gatekeeper leaves before 2 hours, it counts as an AI win. So it's a viable strategy. However ---
I am sure that it would work against some opponents, but my feeling is it would not work against people on Less Wrong. It was a good try though.
I've always thought the gatekeeper should have a 'shutdown' option that results in both the gatekeeper and the AI losing money (but less loss for the gatekeeper than releasing). That should make verbal abuse strategies a good deal harder.