Approximation Benefits of Policy Gradient Methods with Aggregated States

Open in new window