Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically

Open in new window