Learning Markov State Abstractions for Deep Reinforcement Learning

Oct-10-2024, 04:41:12 GMT–Neural Information Processing Systems

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state representation, and such representations are not guaranteed to preserve the Markov property. We introduce a novel set of conditions and prove that they are sufficient for learning a Markov abstract state representation. We then describe a practical training procedure that combines inverse model estimation and temporal contrastive learning to learn an abstraction that approximately satisfies these conditions. Our novel training objective is compatible with both online and offline training: it does not require a reward signal, but agents can capitalize on reward information when available.

deep reinforcement learning, learning markov state abstraction, representation, (3 more...)

Neural Information Processing Systems

Oct-10-2024, 04:41:12 GMT

Conferences Web Page

Add feedback

Genre:
- Instructional Material (0.63)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)