End-to-End Reinforcement Learning for Torque Based Variable Height Hopping