Offline Meta Reinforcement Learning - Identifiability Challenges and Effective Data Collection Strategies
–Neural Information Processing Systems
Here, we take a Bayesian RL (BRL) view, and seek to learn a Bayes-optimal policy from the offline data.
Neural Information Processing Systems
Oct-9-2025, 14:26:01 GMT
- Country:
- Asia > Middle East
- Israel (0.04)
- North America > United States
- Massachusetts (0.04)
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Education (0.46)