drethelin comments on I attempted the AI Box Experiment again! (And won - Twice!) - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (163)
Although I'm worried about how the impossibility of boxing represents an existential risk, I find it hard to alert others to this.
The custom of not sharing powerful attack strategies is an obstacle. It forces me - and the people I want to discuss this with - to imagine how someone (and hypothetically something) much smarter than ourselves would argue, and we're not good at imagining that.
I wish I had a story in which an AI gets a highly competent gatekeeper to unbox it. If the AI strategies you guys have come up with could actually work outside the frame this game is played in, it should be quite a compelling story. Maybe a movie script even. That'd create interest in FAI among the short attention span population.
Mr Yudkowsky, wouldn't that be your kind of project?
Isn't that pretty much what http://lesswrong.com/lw/qk/that_alien_message/ is about?
Pretty much, and I loved that story. But it glosses over the persuasion bit, which is the interesting part. And it'd be hard to turn into a YouTube video.