Data-Driven Policy Mapping for Safe RL-based Energy Management Systems