Prompt Optimization of Large Language Model for Interactive Tasks without Gradient and Demonstrations

Ouyang, Siqi; Li, Lei

Computer Science > Computation and Language

arXiv:2305.15064v1 (cs)

[Submitted on 24 May 2023 (this version), latest version 26 Oct 2023 (v3)]

Title:Prompt Optimization of Large Language Model for Interactive Tasks without Gradient and Demonstrations

Authors:Siqi Ouyang, Lei Li

View PDF

Abstract:Large language models (LLMs) have demonstrated remarkable language proficiency, but they face challenges when solving interactive tasks independently. Existing methods either rely on gradient access, which is often inaccessible in state-of-the-art LLMs like GPT-4, or necessitate diverse and high-quality in-context demonstrations. In this study, we propose LLM-PO, a novel approach that enables LLMs to address these tasks without gradient access or extensive demonstrations. The key idea is to maintain a text-based plan and ask LLMs to reflect on pros and cons of the current plan based on experience collected with it, to update the plan, and to collect more experiences with the new plan. Experiments on HotpotQA demonstrate that LLM-PO achieves higher or on par success rates compared to in-context learning (ICL) baselines while requiring less inference cost.

Comments:	Draft. Work in Progress
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.15064 [cs.CL]
	(or arXiv:2305.15064v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.15064

Submission history

From: Siqi Ouyang [view email]
[v1] Wed, 24 May 2023 11:52:23 UTC (3,160 KB)
[v2] Fri, 20 Oct 2023 18:27:33 UTC (3,068 KB)
[v3] Thu, 26 Oct 2023 16:44:39 UTC (3,068 KB)

Computer Science > Computation and Language

Title:Prompt Optimization of Large Language Model for Interactive Tasks without Gradient and Demonstrations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Prompt Optimization of Large Language Model for Interactive Tasks without Gradient and Demonstrations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators