State-Visitation Fairness in Average-Reward MDPs

Ghalme, Ganesh, Nair, Vineet, Patil, Vishakha, Zhou, Yilun

Feb-14-2021–arXiv.org Artificial Intelligence

Fairness has emerged as an important concern in automated decision-making in recent years, especially when these decisions affect human welfare. In this work, we study fairness in temporally extended decision-making settings, specifically those formulated as Markov Decision Processes (MDPs). Our proposed notion of fairness ensures that each state's long-term visitation frequency is more than a specified fraction. In an average-reward MDP (AMDP) setting, we formulate the problem as a bilinear saddle point program and, for a generative model, solve it using a Stochastic Mirror Descent (SMD) based algorithm. The proposed solution guarantees a simultaneous approximation on the expected average-reward and the long-term state-visitation frequency. We validate our theoretical results with experiments on synthetic data.

algorithm, algorithm 1, fairness, (15 more...)

arXiv.org Artificial Intelligence

Feb-14-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Asia > Middle East
  - Israel (0.04)

Genre:
- Research Report (0.64)

Industry:
- Social Sector (0.46)
- Transportation
  - Passenger (0.68)
  - Ground > Road (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.46)
  - Artificial Intelligence
    - Representation & Reasoning
      - Optimization (0.46)
      - Mathematical & Statistical Methods (0.46)
    - Machine Learning > Learning Graphical Models
      - Undirected Networks > Markov Models (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found