Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

Koneru, Sai; Wu, Jian; Rajtmajer, Sarah

Computer Science > Computation and Language

arXiv:2309.06578 (cs)

[Submitted on 7 Sep 2023 (v1), last revised 26 Mar 2024 (this version, v3)]

Title:Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

Authors:Sai Koneru, Jian Wu, Sarah Rajtmajer

View PDF HTML (experimental)

Abstract:Hypothesis formulation and testing are central to empirical research. A strong hypothesis is a best guess based on existing evidence and informed by a comprehensive view of relevant literature. However, with exponential increase in the number of scientific articles published annually, manual aggregation and synthesis of evidence related to a given hypothesis is a challenge. Our work explores the ability of current large language models (LLMs) to discern evidence in support or refute of specific hypotheses based on the text of scientific abstracts. We share a novel dataset for the task of scientific hypothesis evidencing using community-driven annotations of studies in the social sciences. We compare the performance of LLMs to several state-of-the-art benchmarks and highlight opportunities for future research in this area. The dataset is available at this https URL

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.06578 [cs.CL]
	(or arXiv:2309.06578v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.06578

Submission history

From: Sai Koneru [view email]
[v1] Thu, 7 Sep 2023 04:15:17 UTC (4,702 KB)
[v2] Wed, 25 Oct 2023 04:57:41 UTC (3,722 KB)
[v3] Tue, 26 Mar 2024 03:33:45 UTC (2,011 KB)

Computer Science > Computation and Language

Title:Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators