Convergence of Actor-Critic Methods with Multi-Layer Neural Networks

Feb-8-2026, 15:47:06 GMT–Neural Information Processing Systems

The early theory of actor-critic methods considered convergence using linear function approximators for the policy and value functions. Recent work has established convergence using neural network approximators with a single hidden layer. In this work we are taking the natural next step and establish convergence using deep neural networks with an arbitrary number of hidden layers, thus closing a gap between theory and practice. We show that actor-critic updates projected on a ball around the initial condition will converge to a neighborhood where the average of the squared gradients is O (1 / m) + O (ϵ), with m being the width of the neural network and ϵ the approximation quality of the best critic neural network over the projected set.

artificial intelligence, machine learning, min 2, (16 more...)

Neural Information Processing Systems

Feb-8-2026, 15:47:06 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Suffolk County > Boston (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Duplicate Docs Excel Report

Title
Convergence of Actor-Critic Methods with Multi-Layer Neural Networks

Similar Docs Excel Report more

Title	Similarity	Source
None found