Constrained Reinforcement Learning for Safe Heat Pump Control