Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques

Han, Sangjun, Hur, Taeil, Hur, Youngmi, Lee, Kathy Sangkyung, Lee, Myungyoon, Lim, Hyojae

Feb-7-2025–arXiv.org Artificial Intelligence

The challenge of formal proof generation has a rich history, but with modern techniques, we may finally be at the stage of making actual progress in real-life mathematical problems. This paper explores the integration of ChatGPT and basic searching techniques to simplify generating formal proofs, with a particular focus on the miniF2F dataset. We demonstrate how combining a large language model like ChatGPT with a formal language such as Lean, which has the added advantage of being verifiable, enhances the efficiency and accessibility of formal proof generation. Despite its simplicity, our best-performing Lean-based model surpasses all known benchmarks with a 31.15% pass rate. We extend our experiments to include other datasets and employ alternative language models, showcasing our models' comparable performance in diverse settings and allowing for a more nuanced analysis of our results. Our findings offer insights into AI-assisted formal proof generation, suggesting a promising direction for future research in formal mathematical proof.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Feb-7-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Germany (0.04)
  - Austria > Upper Austria
    - Linz (0.04)
- Asia > South Korea
  - Seoul > Seoul (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Education > Educational Setting (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found