Multi-objective dynamic programming with limited precision
Mandow, L., de la Cruz, J. L. Pérez, Pozas, N.
Markov decision processes (MDP) are a well-known conceptual tool useful for modelling sequential decision processes and have been widely used in real-world applications such as adaptive production control (see e. g. Kuhnle et al. (2020)), equipment maintenance (see Barde et al. (2019), Liu et al. (2019)) or robot planning (see Veeramani et al. (2020)), to name a few. Usual optimization procedures take into account just a scalar value to be maximized. However, in many cases the objective function is more accurately described by a vector (see e. g. Gen and Lin (2014), Zhang and Xu (2017)) and multi-objective optimization must be applied.
Sep-17-2020