Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Jung, Chani; Kim, Dongkwan; Jin, Jiho; Kim, Jiseon; Seonwoo, Yeon; Choi, Yejin; Oh, Alice; Kim, Hyunwoo

Computer Science > Computation and Language

arXiv:2407.06004 (cs)

[Submitted on 8 Jul 2024 (v1), last revised 6 Nov 2024 (this version, v3)]

Title:Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Authors:Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim

View PDF HTML (experimental)

Abstract:While humans naturally develop theory of mind (ToM), the capability to understand other people's mental states and beliefs, state-of-the-art large language models (LLMs) underperform on simple ToM benchmarks. We posit that we can extend our understanding of LLMs' ToM abilities by evaluating key human ToM precursors$-$perception inference and perception-to-belief inference$-$in LLMs. We introduce two datasets, Percept-ToMi and Percept-FANToM, to evaluate these precursory inferences for ToM in LLMs by annotating characters' perceptions on ToMi and FANToM, respectively. Our evaluation of eight state-of-the-art LLMs reveals that the models generally perform well in perception inference while exhibiting limited capability in perception-to-belief inference (e.g., lack of inhibitory control). Based on these results, we present PercepToM, a novel ToM method leveraging LLMs' strong perception inference capability while supplementing their limited perception-to-belief inference. Experimental results demonstrate that PercepToM significantly enhances LLM's performance, especially in false belief scenarios.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.06004 [cs.CL]
	(or arXiv:2407.06004v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.06004

Submission history

From: Chani Jung [view email]
[v1] Mon, 8 Jul 2024 14:58:29 UTC (8,625 KB)
[v2] Tue, 9 Jul 2024 09:11:18 UTC (8,608 KB)
[v3] Wed, 6 Nov 2024 22:07:06 UTC (8,627 KB)

Computer Science > Computation and Language

Title:Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators