Equivalence Between Wasserstein and Value-Aware Model-based Reinforcement Learning

Asadi, Kavosh, Cater, Evan, Misra, Dipendra, Littman, Michael L.

Jun-1-2018–arXiv.org Machine Learning

Learning a generative model is a key component of model-based reinforcement learning. Though learning a good model in the tabular setting is a simple task, learning a useful model in the approximate setting is challenging. Recently Farahmand et al. (2017) proposed a value-aware (VAML) objective that captures the structure of value function during model learning. Using tools from Lipschitz continuity, we show that minimizing the VAML objective is in fact equivalent to minimizing the Wasserstein metric.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Machine Learning

Jun-1-2018

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - Scotland (0.14)
- North America
  - Canada > Quebec (0.15)
  - United States > Texas (0.14)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models (0.48)
    - Reinforcement Learning (0.77)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found