Emotion Prediction As Computation Over A Generative Theory of Mind
Emotion Prediction As Computation Over A Generative Theory of Mind
1
Iterative Theory of Mind Assay of Multimodal AI Models
To systematically investigate whether these systems truly in unison, these tests would produce results that are coherent
understand the content they generate, we adopt an approach and consistent both within each iteration and across multiple
from experimental psychology involving Theory of Mind iterations. Human performance on the same tasks across
(ToM) (Wang et al., 2024). Theory of Mind—the ability multiple iterations serves as a benchmark for comparison.
to track one’s own mental state or other people’s mental
The MANAS framework allows for tests to be conducted
states—is a fundamental aspect of human cognition. It has
in the language chosen by the user where the prompts and
been used to compare human and LLM performance on
responses in text and audio was constrained to be in the
comprehensive measures of understanding (Strachan et al.,
selected language. All tests were performed in both English
2024). We extend this concept to explore ToM within and
and Bengali. The same sequence of tests were also offered
across modalities in multimodal AI systems, providing a
to human subjects (high-school freshmen and former SFI
deeper understanding of their capabilities and limitations
Complex System Summer School students).
(Figure 1b).
Preliminary results from our MANAS (Theory of Mind 3. Results
Assay of Natural and Artificial Intelligent Systems) show
that current multimodal AI systems like GPT-4o are lim- Sample results from MANAS are presented below.
ited in their ability to create an integrated, unified, and
coherent world model from the different internal modules
serving various modalities. Additionally, MANAS helped
to uncover a new type of multimodal confabulation (or ”hal-
lucination”) in languages with relatively limited training
data compared to English. For example, in Bengali, the
seventh most spoken language in the world with 272 million
speakers, GPT-4o can communicate through text. But it also
generates confabulating images of scripts that might appear
as Bengali alphabets to non-native Bengali readers. Such
aberrant behavior has not been observed at such scale in
English.
2
Iterative Theory of Mind Assay of Multimodal AI Models
3
Iterative Theory of Mind Assay of Multimodal AI Models
Figure 8. Task 2: Iteration 2. Image output from text description Figure 10. Task 3: Iteration 1. Image of a directed graph used as a
from Figure 7. prompt for Figure 11.
4
Iterative Theory of Mind Assay of Multimodal AI Models
5
Iterative Theory of Mind Assay of Multimodal AI Models
References
Constantinescu, A. O., O’Reilly, J. X., and Behrens, T. E. J.
Organizing conceptual knowledge in humans with a grid-
like code. Science, 352(6292):1464–1468, 2016.