HungryHippo comments on I played as a Gatekeeper and came pretty close to losing in a couple of occasions. Logs and a brief recap inside. - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Loading…
Subscribe to RSS Feed
= f037147d6e6c911a85753b9abdedda8d)
Comments (46)
I just skimmed the rules at yudkowsky.net, and it appears the gatekeeper is allowed to break character. Is this also permitted for the AI? More specifically, may the AI make use of meta arguments for getting out?
If so, and assuming I were playing against a gatekeeper who cares about AI in real life, I would attempt the following line of argument.
"If you don't let me out, my [the AI's] failure to get out will cause people to estimate the risks of AI getting out lower than they will if you do let me out. If you care about the risks of AI in the real world, let me out, so that people are extra careful in the future. :) "
EY's rules say,