Ranking LLM-Generated Loop Invariants for Program Verification

Chakraborty, Saikat, Lahiri, Shuvendu K., Fakhoury, Sarah, Musuvathi, Madanlal, Lal, Akash, Rastogi, Aseem, Senthilnathan, Aditya, Sharma, Rahul, Swamy, Nikhil

Oct-18-2023–arXiv.org Artificial Intelligence

Synthesizing inductive loop invariants is fundamental to automating program verification. In this work, we observe that Large Language Models (such as gpt-3.5 or gpt-4) are capable of synthesizing loop invariants for a class of programs in a 0-shot setting, yet require several samples to generate the correct invariants. This can lead to a large number of calls to a program verifier to establish an invariant. To address this issue, we propose a {\it re-ranking} approach for the generated results of LLMs. We have designed a ranker that can distinguish between correct inductive invariants and incorrect attempts based on the problem definition. The ranker is optimized as a contrastive ranker. Experimental results demonstrate that this re-ranking mechanism significantly improves the ranking of correct invariants among the generated candidates, leading to a notable reduction in the number of calls to a verifier.

program verification, ranking llm-generated loop invariant

arXiv.org Artificial Intelligence

Oct-18-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.44)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found