Analyzing machine-learned representations: A natural language case study

Dasgupta, Ishita; Guo, Demi; Gershman, Samuel J.; Goodman, Noah D.

Computer Science > Computation and Language

arXiv:1909.05885 (cs)

[Submitted on 12 Sep 2019]

Title:Analyzing machine-learned representations: A natural language case study

Authors:Ishita Dasgupta, Demi Guo, Samuel J. Gershman, Noah D. Goodman

View PDF

Abstract:As modern deep networks become more complex, and get closer to human-like capabilities in certain domains, the question arises of how the representations and decision rules they learn compare to the ones in humans. In this work, we study representations of sentences in one such artificial system for natural language processing. We first present a diagnostic test dataset to examine the degree of abstract composable structure represented. Analyzing performance on these diagnostic tests indicates a lack of systematicity in the representations and decision rules, and reveals a set of heuristic strategies. We then investigate the effect of the training distribution on learning these heuristic strategies, and study changes in these representations with various augmentations to the training set. Our results reveal parallels to the analogous representations in people. We find that these systems can learn abstract rules and generalize them to new contexts under certain circumstances -- similar to human zero-shot reasoning. However, we also note some shortcomings in this generalization behavior -- similar to human judgment errors like belief bias. Studying these parallels suggests new ways to understand psychological phenomena in humans as well as informs best strategies for building artificial intelligence with human-like language understanding.

Comments:	This article supersedes a previous article arXiv:1802.04302
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1909.05885 [cs.CL]
	(or arXiv:1909.05885v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1909.05885

Submission history

From: Ishita Dasgupta [view email]
[v1] Thu, 12 Sep 2019 18:03:17 UTC (703 KB)

Computer Science > Computation and Language

Title:Analyzing machine-learned representations: A natural language case study

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Analyzing machine-learned representations: A natural language case study

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators