A Deep Reinforcement Learning based Approach to Learning Transferable Proof Guidance Strategies