Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes

Zhang, Nick, Gupta, Abhishek, Chen, Zefeng, Ong, Yew-Soon

Nov-13-2022–arXiv.org Artificial Intelligence

Studies have shown evolution strategies (ES) to be a promising approach for reinforcement learning (RL) with deep neural networks. However, the issue of high sample complexity persists in applications of ES to deep RL over long horizons. This paper is the first to address the shortcoming of today's methods via a novel neuroevolutionary multitasking (NuEMT) algorithm, designed to transfer information from a set of auxiliary tasks (of short episode length) to the target (full length) RL task at hand. The auxiliary tasks, extracted from the target, allow an agent to update and quickly evaluate policies on shorter time horizons. The evolved skills are then transferred to guide the longer and harder task towards an optimal policy. We demonstrate that the NuEMT algorithm achieves data-efficient evolutionary RL, reducing expensive agent-environment interaction data requirements. Our key algorithmic contribution in this setting is to introduce, for the first time, a multitask skills transfer mechanism based on the statistical importance sampling technique. In addition, an adaptive resource allocation strategy is utilized to assign computational resources to auxiliary tasks based on their gleaned usefulness. Experiments on a range of continuous control tasks from the OpenAI Gym confirm that our proposed algorithm is efficient compared to recent ES baselines.

evolutionary algorithm, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-13-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York
      - Richmond County > New York City (0.04)
      - Queens County > New York City (0.04)
      - New York County > New York City (0.04)
      - Kings County > New York City (0.04)
      - Bronx County > New York City (0.04)
    - California > San Diego County
      - Carlsbad (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
  - Belgium > Flanders
    - West Flanders > Bruges (0.04)
- Asia
  - Singapore (0.04)
  - China (0.04)
  - Middle East > Jordan (0.04)
  - Macao (0.04)
  - Indonesia > Bali (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Leisure & Entertainment > Games (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Evolutionary Systems (1.00)
  - Neural Networks > Deep Learning (0.57)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found