Active Prompting with Chain-of-Thought for Large Language Models

Diao, Shizhe; Wang, Pengcheng; Lin, Yong; Pan, Rui; Liu, Xiang; Zhang, Tong

Computer Science > Computation and Language

arXiv:2302.12246 (cs)

[Submitted on 23 Feb 2023 (v1), last revised 21 Jul 2024 (this version, v5)]

Title:Active Prompting with Chain-of-Thought for Large Language Models

Authors:Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang

View PDF HTML (experimental)

Abstract:The increasing scale of large language models (LLMs) brings emergent abilities to various complex tasks requiring reasoning, such as arithmetic and commonsense reasoning. It is known that the effective design of task-specific prompts is critical for LLMs' ability to produce high-quality answers. In particular, an effective approach for complex question-and-answer tasks is example-based prompting with chain-of-thought (CoT) reasoning, which significantly improves the performance of LLMs. However, current CoT methods rely on a fixed set of human-annotated exemplars, which are not necessarily the most effective examples for different tasks. This paper proposes a new method, Active-Prompt, to adapt LLMs to different tasks with task-specific example prompts (annotated with human-designed CoT reasoning). For this purpose, we propose a solution to the key problem of determining which questions are the most important and helpful ones to annotate from a pool of task-specific queries. By borrowing ideas from the related problem of uncertainty-based active learning, we introduce several metrics to characterize the uncertainty so as to select the most uncertain questions for annotation. Experimental results demonstrate the superiority of our proposed method, achieving state-of-the-art on eight complex reasoning tasks. Further analyses of different uncertainty metrics, pool sizes, zero-shot learning, and accuracy-uncertainty relationship demonstrate the effectiveness of our method. Our code will be available at this https URL.

Comments:	Published in ACL 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2302.12246 [cs.CL]
	(or arXiv:2302.12246v5 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.12246

Submission history

From: Shizhe Diao [view email]
[v1] Thu, 23 Feb 2023 18:58:59 UTC (511 KB)
[v2] Sun, 26 Feb 2023 15:18:50 UTC (511 KB)
[v3] Tue, 23 May 2023 15:43:28 UTC (511 KB)
[v4] Fri, 7 Jun 2024 02:51:25 UTC (705 KB)
[v5] Sun, 21 Jul 2024 08:01:00 UTC (705 KB)

Computer Science > Computation and Language

Title:Active Prompting with Chain-of-Thought for Large Language Models

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Active Prompting with Chain-of-Thought for Large Language Models

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators