On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

Open in new window