Efficient Potential-based Exploration in Reinforcement Learning using Inverse Dynamic Bisimulation Metric

Neural Information Processing Systems 

While a number of RL methods have been proposed to boost exploration by designing an intrinsic reward signal as exploration bonus.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found