Regularizing Trajectory Optimization with Denoising Autoencoders

Boney, Rinu, Palo, Norman Di, Berglund, Mathias, Ilin, Alexander, Kannala, Juho, Rasmus, Antti, Valpola, Harri

Mar-18-2020, 21:32:33 GMT–Neural Information Processing Systems

Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize trajectory optimization by means of a denoising autoencoder that is trained on the same trajectories as the model of the environment. We show that the proposed regularization leads to improved planning with both gradient-based and gradient-free optimizers. We also demonstrate that using regularized trajectory optimization leads to rapid initial learning in a set of popular motor control tasks, which suggests that the proposed approach can be a useful tool for improving sample efficiency.

denoising autoencoder, learning, regularizing trajectory optimization

Neural Information Processing Systems

Mar-18-2020, 21:32:33 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)