Variance Reduced Policy Evaluation with Smooth Function Approximation
Hoi-To Wai, Mingyi Hong, Zhuoran Yang, Zhaoran Wang, Kexin Tang
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-23-2025, 19:16:43 GMT
Hoi-To Wai, Mingyi Hong, Zhuoran Yang, Zhaoran Wang, Kexin Tang
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-23-2025, 19:16:43 GMT