Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making

Hayes, Conor F., Verstraeten, Timothy, Roijers, Diederik M., Howley, Enda, Mannion, Patrick

Jun-2-2021–arXiv.org Artificial Intelligence

In many real-world scenarios, the utility of a user is derived from the single execution of a policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the returns must be optimised. Various scenarios exist where a user's preferences over objectives (also known as the utility function) are unknown or difficult to specify. In such scenarios, a set of optimal policies must be learned. However, settings where the expected utility must be maximised have been largely overlooked by the multi-objective reinforcement learning community and, as a consequence, a set of optimal solutions has yet to be defined. In this paper we address this challenge by proposing first-order stochastic dominance as a criterion to build solution sets to maximise expected utility. We also propose a new dominance criterion, known as expected scalarised returns (ESR) dominance, that extends first-order stochastic dominance to allow a set of optimal policies to be learned in practice. We then define a new solution concept called the ESR set, which is a set of policies that are ESR dominant. Finally, we define a new multi-objective distributional tabular reinforcement learning (MOT-DRL) algorithm to learn the ESR set in a multi-objective multi-armed bandit setting.

esr criterion, utility function, value distribution, (15 more...)

arXiv.org Artificial Intelligence

Jun-2-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Iowa (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe
  - Belgium (0.04)
  - Netherlands (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Andalusia
    - Cádiz Province > Cadiz (0.04)
  - Ireland > Connaught
    - County Galway > Galway (0.04)
- Asia
  - Singapore (0.04)
  - Japan > Honshū
    - Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report (0.64)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (0.48)
  - Therapeutic Area > Vaccines (0.30)

Technology:
- Information Technology
  - Game Theory (1.00)
  - Decision Support Systems (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Agents (1.00)
    - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found