Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies

Open in new window