Safe Policy Optimization with Local Generalized Linear Function Approximations

Neural Information Processing Systems 

We propose a novel algorithm, SPO-LF, that optimizes an agent's policy while

Similar Docs  Excel Report  more

TitleSimilaritySource
None found