A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence

Open in new window