BayesianRiskMarkovDecisionProcesses
–Neural Information Processing Systems
Markov decision process (MDP) is a paradigm for modeling sequential decision making under uncertainty. From a modeling perspective, some parameters of MDPs are unknown and need to be estimated from data. In this paper, we consider MDPs where transition probability and cost parametersarenotknown.
Neural Information Processing Systems
Feb-9-2026, 16:57:25 GMT