Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming

Chakraborty, Saikat; Ebner, Gabriel; Bhat, Siddharth; Fakhoury, Sarah; Fatima, Sakina; Lahiri, Shuvendu; Swamy, Nikhil

Computer Science > Programming Languages

arXiv:2405.01787 (cs)

[Submitted on 3 May 2024 (v1), last revised 4 Sep 2024 (this version, v3)]

Title:Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming

Authors:Saikat Chakraborty, Gabriel Ebner, Siddharth Bhat, Sarah Fakhoury, Sakina Fatima, Shuvendu Lahiri, Nikhil Swamy

View PDF HTML (experimental)

Abstract:Proof-oriented programs mix computational content with proofs of program correctness. However, the human effort involved in programming and proving is still substantial, despite the use of Satisfiability Modulo Theories (SMT) solvers to automate proofs in languages such as F*. Seeking to spur research on using AI to automate the construction of proof-oriented programs, we curate a dataset of 600K lines of open-source F* programs and proofs, including software used in production systems ranging from Windows and Linux to Python and Firefox. Our dataset includes around 32K top-level F* definitions, each representing a type-directed program and proof synthesis problem producing a definition given a formal specification expressed as an F* type. We provide a program fragment checker that queries F* to check the correctness of candidate solutions. We also report on an extended version of our dataset containing a total of 940K lines of programs and proofs, with a total of 54k top-level F* definitions. We believe this is the largest corpus of SMT-assisted program proofs coupled with a reproducible program-fragment checker. Grounded in this dataset, we investigate the use of AI to synthesize programs and their proofs in F*, with promising results. Our main finding in that the performance of fine-tuned smaller language models (such as Phi-2 or StarCoder) compare favorably with large language models (such as GPT-4), at a much lower computational cost. We also identify various type-based retrieval augmentation techniques and find that they boost performance significantly. With detailed error analysis and case studies, we identify potential strengths and weaknesses of models and techniques and suggest directions for future improvements.

Comments:	47th International Conference on Software Engineering
Subjects:	Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
Cite as:	arXiv:2405.01787 [cs.PL]
	(or arXiv:2405.01787v3 [cs.PL] for this version)
	https://doi.org/10.48550/arXiv.2405.01787

Submission history

From: Saikat Chakraborty [view email]
[v1] Fri, 3 May 2024 00:14:33 UTC (695 KB)
[v2] Tue, 3 Sep 2024 17:11:31 UTC (758 KB)
[v3] Wed, 4 Sep 2024 19:32:16 UTC (758 KB)

Computer Science > Programming Languages

Title:Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Programming Languages

Title:Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators