A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning