MOPO: Model-based Offline Policy Optimization
–Neural Information Processing Systems
However, standard model-based RL methods, designed for the online setting, do not provide an explicit mechanism to avoid the offline setting's distributional shift issue.
Neural Information Processing Systems
Nov-14-2025, 21:04:09 GMT
- Country:
- Asia > China
- Shandong Province > Dongying (0.04)
- North America
- Canada (0.04)
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Massachusetts (0.04)
- California > Santa Clara County
- Asia > China
- Genre:
- Research Report (0.68)
- Technology: