Model
–Neural Information Processing Systems
We further show that optimistic posterior sampling can control this Hellinger distance, when we measure model error via data likelihood. This technique allows us to design and analyze unified posterior sampling algorithms with state-of-the-art sample complexity guarantees for many model-based RL settings.
Neural Information Processing Systems
Feb-12-2026, 12:16:05 GMT