Deep Reinforcement Learning for Heat Pump Control