Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach

Gimelfarb, Michael, Sanner, Scott, Lee, Chi-Guhn

Dec-31-2018–Neural Information Processing Systems

Potential based reward shaping is a powerful technique for accelerating convergence of reinforcement learning algorithms. Typically, such information includes an estimate of the optimal value function and is often provided by a human expert or other sources of domain knowledge. However, this information is often biased or inaccurate and can mislead many reinforcement learning algorithms. In this paper, we apply Bayesian Model Combination with multiple experts in a way that learns to trust a good combination of experts as training progresses. This approach is both computationally efficient and general, and is shown numerically to improve convergence across discrete and continuous domains and different reinforcement learning algorithms.

artificial intelligence, bayesian inference, reinforcement learning, (16 more...)

Neural Information Processing Systems

Dec-31-2018

Conferences PDF

Add feedback

Country:
- North America > Canada > Ontario > Toronto (0.28)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)
    - Reinforcement Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)

Duplicate Docs Excel Report

Title
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach

Similar Docs Excel Report more

Title	Similarity	Source
None found