DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-26-2025, 22:48:14 GMT
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-26-2025, 22:48:14 GMT