Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing
Tjanaka, Bryon, Fontaine, Matthew C., Lee, David H., Kalkar, Aniruddha, Nikolaidis, Stefanos
–arXiv.org Artificial Intelligence
Pre-training a diverse set of neural network controllers in simulation has enabled robots to adapt online to damage in robot locomotion tasks. However, finding diverse, high-performing controllers requires expensive network training and extensive tuning of a large number of hyperparameters. On the other hand, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), an evolution strategies (ES)-based quality diversity algorithm, does not have these limitations and has achieved state-of-the-art performance on standard QD benchmarks. However, CMA-MAE cannot scale to modern neural network controllers due to its quadratic complexity. We leverage efficient approximation methods in ES to propose three new CMA-MAE variants that scale to high dimensions. Our experiments show that the variants outperform ES-based baselines in benchmark robotic locomotion tasks, while being comparable with or exceeding state-of-the-art deep reinforcement learning-based quality diversity algorithms.
arXiv.org Artificial Intelligence
Sep-15-2023
- Country:
- North America > United States
- New York > New York County
- New York City (0.05)
- California
- Los Angeles County > Los Angeles (0.28)
- San Diego County > San Diego (0.04)
- New York > New York County
- North America > United States
- Genre:
- Research Report > New Finding (0.47)
- Technology:
- Information Technology > Artificial Intelligence
- Robots (1.00)
- Machine Learning
- Evolutionary Systems (0.90)
- Reinforcement Learning (0.87)
- Neural Networks (0.73)
- Information Technology > Artificial Intelligence