Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost

Zhuoran Yang, Yongxin Chen, Mingyi Hong, Zhaoran Wang

Neural Information Processing Systems 

Moreover, weassumethatthedimensionsd and k are fixedthroughoutthispaper.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found