Non-conflicting Energy Minimization in Reinforcement Learning based Robot Control