TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning