Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards