Bugmaster comments on The flawed Turing test: language, understanding, and partial p-zombies - Less Wrong

11 Post author: Stuart_Armstrong 17 May 2013 02:02PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (184)

You are viewing a single comment's thread. Show more comments above.

Comment author: Bugmaster 20 May 2013 06:09:04AM 2 points [-]

I think the Turing Test clearly does measure something: it measures how closely an agent's behavior resembles that of a human. The real argument is not, "what does the test measure ?", but "is measuring behavior similarity enough for all intents and purposes, or do we need more ?"

Comment author: SaidAchmiz 20 May 2013 06:41:41AM 0 points [-]

If we prefer to be pedantic, we must go further than that: the test measures whether an agent can fool some particular interrogator into having a no-better-than-chance probability of correctly discerning whether said agent is a human (in the case where the agent in question is not, in fact, a human).

How well that particular factor correlates with actual behavioral similarity to a human (and how would we define and measure such similarity? along what dimensions? operationalized how?), is an open question. It might, it might not. It might take advantage of some particular biases of the interrogator (e.g. pareidolia, the tendency to anthropomorphize aspects of the inanimate world, etc.) to make him/her see behavioral similarity where little exists (cf. Eliza and other chatbots).

(Remember, also, that Turing thought that a meaningful milestone would be for a computer to "play the imitation game so well that an average interrogator will not have more than 70 percent chance of making the right identification after five minutes of questioning." ! [Emphasis mine.])

I do partly agree with this:

The real argument is not, "what does the test measure ?", but "is measuring behavior similarity enough for all intents and purposes, or do we need more ?"

And of course the question then becomes: just what are our intents and/or purposes here?

Comment author: Bugmaster 20 May 2013 09:21:45PM 1 point [-]

"play the imitation game so well that an average interrogator will not have more than 70 percent chance of making the right identification after five minutes of questioning."

I think we've hit this milestone already, but we kind of cheated: in addition to just making computers smarter, we made human conversations dumber. Thus, if we wanted to stay true to Turing's original criteria, we'd need to scale up our present-day requirements (say, to something like 80% chance over 60 minutes), in order to keep up with inflation.

And of course the question then becomes: just what are our intents and/or purposes here?

I can propose one relatively straightforward criterion: "can this agent take the place of a human on our social network graph ?" By this I don't simply mean, "can we friend it on Facebook"; that is, when I say "social network", I mean "the overall fabric of our society". This network includes relationships such as "friend", "employee", "voter", "possessor of certain rights", etc.

I think this is a pretty good criterion, and I also think that it could be evaluated in purely functional terms. We shouldn't need to read an agent's genetic/computer/quantum/whatever code in order to determine whether it can participate in our society; we can just give it the Turing Test, instead. In a way, we already do this with humans, all the time -- only the test is administered continuously, and sometimes we get the answers wrong.