Note: This work is a close collaboration between myself and Nathan Labenz (@Nathan). Many thanks to him for starting this particular piece of research and letting me tag along as well to @JustisMills for help on navigating the submission process and giving feedback on the draft. If you're interested in running experiments yourself, check out our Replit instance or read the code on github.
Introduction
Theory of mind (ToM) is the ability to attribute mental states to ourselves and others; accurately predicting other people’s beliefs, intents, desires and emotions is key to navigating everyday life. Understanding whether LLMs have theory of mind is an interesting theoretical question. However the question of whether AI can... (read 4089 more words →)