On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation

Open in new window