Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective

Vemula, Anirudh, Sun, Wen, Bagnell, J. Andrew

Jan-31-2019–arXiv.org Machine Learning

Black-box optimizers that explore in parameter space have often been shown to outperform more sophisticated action space exploration methods developed specifically for the reinforcement learning problem. We examine these black-box methods closely to identify situations in which they are worse than action space exploration methods and those in which they are superior. Through simple theoretical analyses, we prove that complexity of exploration in parameter space depends on the dimensionality of parameter space, while complexity of exploration in action space depends on both the dimensionality of action space and horizon length. This is also demonstrated empirically by comparing simple exploration methods on several model problems, including Contextual Bandit, Linear Regression and Reinforcement Learning in continuous control.

contrasting exploration, experiment, exploration, (13 more...)

arXiv.org Machine Learning

Jan-31-2019

arXiv.org PDF

Add feedback

Country:
- Africa > Togo (0.04)
- North America > United States
  - Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Transportation > Air (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning
    - Reinforcement Learning (0.87)
    - Statistical Learning > Regression (0.35)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found