Correlation Priors for Reinforcement Learning
Alt, Bastian, Šošić, Adrian, Koeppl, Heinz
–Neural Information Processing Systems
Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment. In a Markov decision process model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations. This often implies locally correlated transition dynamics among the states. In order to complete a certain task in such environments, the operating agent usually needs to execute a series of temporally and spatially correlated actions. Though there exists a variety of approaches to capture these correlations in continuous state-action domains, a principled solution for discrete environments is missing.
Neural Information Processing Systems
Mar-19-2020, 02:31:25 GMT
- Technology: