A Omitted Statements and Proofs
–Neural Information Processing Systems
To obtain a conservative value estimation, we follow the suggestions given by Fujimoto et al. (2019) and Liu et al. (2020) to prune the unseen state-action pairs in
Neural Information Processing Systems
Aug-14-2025, 13:56:05 GMT