Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Neural Information Processing Systems 

Long-term fairness is an important factor of consideration in designing and deploying learning-based decision systems in high-stake decision-making contexts.