Towards Generalization and Simplicity in Continuous Control
Rajeswaran, Aravind, Lowrey, Kendall, Todorov, Emanuel V., Kakade, Sham M.
–Neural Information Processing Systems
The remarkable successes of deep learning in speech recognition and computer vision have motivated efforts to adapt similar techniques to other problem domains, including reinforcement learning (RL). Consequently, RL methods have produced rich motor behaviors on simulated robot tasks, with their success largely attributed to the use of multi-layer neural networks. This work is among the first to carefully study what might be responsible for these recent advancements. Our main result calls this emerging narrative into question by showing that much simpler architectures -- based on linear and RBF parameterizations -- achieve comparable performance to state of the art results. We not only study different policy representations with regard to performance measures at hand, but also towards robustness to external perturbations.
Neural Information Processing Systems
Feb-14-2020, 19:11:10 GMT
- Technology: