Towards Automated Functional Equation Proving: A Benchmark Dataset and A Domain-Specific In-Context Agent
Buali, Mahdi, Hoehndorf, Robert
–arXiv.org Artificial Intelligence
Automated Theorem Proving (ATP) faces significant challenges due to the vast action space and the computational demands of proof generation. Recent advances have utilized Large Language Models (LLMs) for action selection in ATP, but these methods often require substantial computational resources. This study introduces the Functional Equation Automated Solver (FEAS), an agent that builds on the COPRA in-context learning framework within the Lean environment. FEAS innovates by refining prompt generation and response parsing mechanisms, integrating domain-specific heuristics for functional equations, and introducing the FunEq dataset--a rigorously curated collection of functional equation problems categorized into three difficulty levels. The agent's performance is evaluated against established baselines using this dataset, demonstrating improvements in theorem proving accuracy, particularly with the integration of functional equation-specific heuristics. Our results highlight the effectiveness of FEAS in generating and formalizing high-level proof strategies into Lean proofs, emphasizing the potential of tailored approaches in domain-specific ATP challenges.
arXiv.org Artificial Intelligence
Jul-5-2024
- Country:
- Asia
- Indonesia > Java
- Yogyakarta > Yogyakarta (0.04)
- Middle East > Saudi Arabia (0.04)
- Indonesia > Java
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Technology: