Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee
–Neural Information Processing Systems
Potential-based reward shaping incorporates prior domain knowledge in the form of additional rewards provided during training to speed up convergence of reinforcement learning algorithms, without changing the optimal policies (Ng et al. [1999]).
Neural Information Processing Systems
Nov-18-2025, 14:47:04 GMT