Policy Optimization as Wasserstein Gradient Flows

Open in new window