asr comments on AI Box Log - Less Wrong

16 Post author: Dorikka 27 January 2012 04:47AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (30)

You are viewing a single comment's thread. Show more comments above.

Comment author: asr 27 January 2012 09:10:57PM 0 points [-]

This depends on the internal structure of the thing. The inner workings of any particular human mind are mostly a black box to us. The internal workings of software need not be. If your AI has data structures and control logic that we can understand, you could dump results out and review by hand. For instance, there might be a debug interface that lets you unambiguously access the AI's internal probability estimate for some contingency.

Note that you need not have a perfect understanding of how the AI works in order to rule out the presence of a whole shadow AI inside the running program.