Alan Turing proposed a simple test for determining whether or not a computer system possesses human-level intelligence: let people communicate with it, via a terminal, and then ask them if they have been talking to a human or an automatic system. I’ve seen several people asserting that ChatGPT and similar systems have already passed the Turing test, but haven’t encountered anyone who tried it with Turing’s own interesting and subtle example of what such a conversation might be like.. So I decided to do it myself.
Here’s Turing’s example:
Interrogator: In the first line of your sonnet which reads “Shall I compare thee to a summer’s day,” would not “a spring day” do as well or better?
Witness: It wouldn’t scan.
Interrogator: How about “a winter’s day,” That would scan all right.
Witness: Yes, but nobody wants to be compared to a winter’s day.
Interrogator: Would you say Mr. Pickwick reminded you of Christmas?
Witness: In a way.
Interrogator: Yet Christmas is a winter’s day, and I do not think Mr. Pickwick would mind the comparison.
Witness: I don’t think you’re serious. By a winter’s day one means a typical winter’s day, rather than a special one like Christmas.
Now let’s see how ChatGPT responds to the same question:
Me: In the first line of the sonnet which reads “Shall I compare thee to a summer’s day” would not “a spring day” do as well or better?