stackon
Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language pBC+
We extend probabilistic action language pBC+ with the notion of utility as in decision theory. The semantics of the extended pBC+ can be defined as a shorthand notation for a decision-theoretic extension of the probabilistic answer set programming language LPMLN. Alternatively, the semantics of pBC+ can also be defined in terms of Markov Decision Process (MDP), which in turn allows for representing MDP in a succinct and elaboration tolerant way as well as to leverage an MDP solver to compute pBC+. The idea led to the design of the system pbcplus2mdp, which can find an optimal policy of a pBC+ action description using an MDP solver.
Envelope-based Planning in Relational MDPs
Gardiol, Natalia H., Kaelbling, Leslie P.
A mobile robot acting in the world is faced with a large amount of sensory data and uncertainty in its action outcomes. Indeed, almost all interesting sequential decision-making domains involve large state spaces and large, stochastic action sets. We investigate a way to act intelligently as quickly as possible in domains where finding a complete policy would take a hopelessly long time.
Envelope-based Planning in Relational MDPs
Gardiol, Natalia H., Kaelbling, Leslie P.
A mobile robot acting in the world is faced with a large amount of sensory data and uncertainty in its action outcomes. Indeed, almost all interesting sequential decision-making domains involve large state spaces and large, stochastic action sets. We investigate a way to act intelligently as quickly as possible in domains where finding a complete policy would take a hopelessly long time.
Envelope-based Planning in Relational MDPs
Gardiol, Natalia H., Kaelbling, Leslie P.
A mobile robot acting in the world is faced with a large amount of sensory dataand uncertainty in its action outcomes. Indeed, almost all interesting sequentialdecision-making domains involve large state spaces and large, stochastic action sets. We investigate a way to act intelligently asquickly as possible in domains where finding a complete policy would take a hopelessly long time. This approach, Relational Envelopebased Planning(REBP) tackles large, noisy problems along two axes.