Few-Shot Semantic Parsing with Language Models Trained On Code

Shin, Richard; Van Durme, Benjamin

Computer Science > Computation and Language

arXiv:2112.08696 (cs)

[Submitted on 16 Dec 2021 (v1), last revised 29 May 2022 (this version, v2)]

Title:Few-Shot Semantic Parsing with Language Models Trained On Code

Authors:Richard Shin, Benjamin Van Durme

View PDF

Abstract:Large language models can perform semantic parsing with little training data, when prompted with in-context examples. It has been shown that this can be improved by formulating the problem as paraphrasing into canonical utterances, which casts the underlying meaning representation into a controlled natural language-like representation. Intuitively, such models can more easily output canonical utterances as they are closer to the natural language used for pre-training. Recently, models also pre-trained on code, like OpenAI Codex, have risen in prominence. For semantic parsing tasks where we map natural language into code, such models may prove more adept at it. In this paper, we test this hypothesis and find that Codex performs better on such tasks than equivalent GPT-3 models. We evaluate on Overnight and SMCalFlow and find that unlike GPT-3, Codex performs similarly when targeting meaning representations directly, perhaps because meaning representations are structured similar to code in these datasets.

Comments:	NAACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2112.08696 [cs.CL]
	(or arXiv:2112.08696v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2112.08696

Submission history

From: Richard Shin [view email]
[v1] Thu, 16 Dec 2021 08:34:06 UTC (28 KB)
[v2] Sun, 29 May 2022 15:47:04 UTC (41 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Richard Shin
Benjamin Van Durme

export BibTeX citation

Computer Science > Computation and Language

Title:Few-Shot Semantic Parsing with Language Models Trained On Code

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Few-Shot Semantic Parsing with Language Models Trained On Code

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators