Towards Finding Longer Proofs
Zombori, Zsolt, Csiszárik, Adrián, Michalewski, Henryk, Kaliszyk, Cezary, Urban, Josef
–arXiv.org Artificial Intelligence
We present a reinforcement learning (RL) based guidance system for automated theorem proving geared towards Finding Longer Proofs (FLoP). FLoP focuses on generalizing from short proofs to longer ones of similar structure. To achieve that, FLoP uses state-of-the-art RL approaches that were previously not applied in theorem proving. In particular, we show that curriculum learning significantly outperforms previous learning-based proof guidance on a synthetic dataset of increasingly difficult arithmetic problems.
arXiv.org Artificial Intelligence
May-30-2019
- Country:
- Europe (1.00)
- North America > United States
- California (0.28)
- New York > New York County
- New York City (0.14)
- Oceania > Australia
- New South Wales > Sydney (0.14)
- Genre:
- Industry:
- Energy (0.46)
- Leisure & Entertainment > Games (0.46)
- Technology: