Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

Neural Information Processing Systems 

Based on our regularization, we propose an off-policy actor-critic algorithm.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found