Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum

Open in new window