A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants