Review for NeurIPS paper: Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms

Jan-23-2025, 02:04:22 GMT–Neural Information Processing Systems

Additional Feedback: The authors' response has addressed my questions. I will keep my score. This is a natural question to ask, so it could be worth an explanation somewhere. However, this paper suggests a slower rate by a factor of (1-\gamma) {-2}. What could cause the difference and how could the theory here guide development of deep RL algorithms?

actor-critic algorithm, neurips paper, sample complexity bound

Neural Information Processing Systems

Jan-23-2025, 02:04:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)