Learning Optimal Fair Policies