Learning Fair Ranking Policies via Differentiable Optimization of Ordered Weighted Averages