A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Wenhao Yang, Xiang Li, Zhihua Zhang
–Neural Information Processing Systems
Even if the optimal policy is obtained precisely, it is often the case the optimal policy is deterministic.
Neural Information Processing Systems
Feb-12-2026, 00:07:26 GMT