Project proposal: A modular reinforcement learning based automated theorem prover