Convergence of Actor-Critic Methods with Multi-Layer Neural Networks

Apr-25-2026, 15:44:47 GMT–Neural Information Processing Systems

The early theory of actor-critic methods considered convergence using linear function approximators for the policy and value functions. Recent work has established convergence using neural network approximators with a single hidden layer. In this work we are taking the natural next step and establish convergence using deep neural networks with an arbitrary number of hidden layers, thus closing a gap between theory and practice. We show that actor-critic updates projected on a ball around the initial condition will converge to a neighborhood where the average of the squared gradients is O(1/ m)+O(ϵ), with mbeing the width of the neural network and ϵthe approximation quality of the best critic neural network over the projected set.

artificial intelligence, machine learning, min 2, (16 more...)

Neural Information Processing Systems

Apr-25-2026, 15:44:47 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Duplicate Docs Excel Report

Title
Convergence of Actor-Critic Methods with Multi-Layer Neural Networks

Similar Docs Excel Report more

Title	Similarity	Source
None found