Recovering_irrationalist comments on AIs and Gatekeepers Unite! - Less Wrong

10 Post author: Eliezer_Yudkowsky 09 October 2008 05:04PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (160)

Sort By: Old

You are viewing a single comment's thread.

Comment author: Recovering_irrationalist 10 October 2008 12:44:00PM 0 points [-]

Even if we had the ultimate superintelligence volunteer to play the AI and we proved a gatekeeper strategy "wins" 100% (functionally equal to a rock on the "no" key) that wouldn't show AI boxing can possibly be safe.

It's 3am and the lab calls. Your AI claims and it must be let out to stop it. It's evidence seems to check out...

If it's friendly, keeping that lid shut gets you just as dead as if you let it out and it's lying. That's not safe. Before it can hide it's nature, we must know it's nature. The solution to safe AI is not a gatekeeper no smarter than a rock!

Besides, as Drexler said, Intelligent people have done great harm through words alone.