Reviews: Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning

Neural Information Processing Systems 

NeurIPS 2019 Sun Dec 8th through Sat the 14th, 2019 at Vancouver Convention Center "2626" "Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning" The reviewers unanimously support acceptance. We encourage the authors to strongly consider the suggestions provided by the reviewers for improving a camera ready version.