Model

Neural Information Processing Systems 

We further show that optimistic posterior sampling can control this Hellinger distance, when we measure model error via data likelihood. This technique allows us to design and analyze unified posterior sampling algorithms with state-of-the-art sample complexity guarantees for many model-based RL settings.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found