Offline Meta Reinforcement Learning - Identifiability Challenges and Effective Data Collection Strategies

Neural Information Processing Systems 

Here, we take a Bayesian RL (BRL) view, and seek to learn a Bayes-optimal policy from the offline data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found