Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement Learning Approach