Reinforced Large Language Model is a formal theorem prover