Parameterized Exploration

Jul-13-2019–arXiv.org Artificial Intelligence

We introduce Parameterized Exploration (PE), a simple family of methods for model-based tuning of the exploration schedule in sequential decision problems. Unlike common heuristics for exploration, our method accounts for the time horizon of the decision problem as well as the agent's current state of knowledge of the dynamics of the decision problem. We show our method as applied to several common exploration techniques has superior performance relative to un-tuned counterparts in Bernoulli and Gaussian multi-armed bandits, contextual bandits, and a Markov decision process based on a mobile health (mHealth) study. We also examine the effects of the accuracy of the estimated dynamics model on the performance of PE.

data mining, machine learning, reinforcement learning, (21 more...)

arXiv.org Artificial Intelligence

Jul-13-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > North Carolina (0.14)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (0.48)
  - Therapeutic Area > Endocrinology
    - Diabetes (0.47)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.67)
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found