Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games

Open in new window