Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
Steinparz, Christian, Schmied, Thomas, Paischer, Fabian, Dinu, Marius-Constantin, Patil, Vihang, Bitto-Nemling, Angela, Eghbal-zadeh, Hamid, Hochreiter, Sepp
–arXiv.org Artificial Intelligence
In lifelong learning, an agent learns throughout its entire life without resets, in a constantly changing environment, as we humans do. Consequently, lifelong learning comes with a plethora of research problems such as continual domain shifts, which result in non-stationary rewards and environment dynamics. These non-stationarities are difficult to detect and cope with due to their continuous nature. Therefore, exploration strategies and learning methods are required that are capable of tracking the steady domain shifts, and adapting to them. We propose Reactive Exploration to track and react to continual domain shifts in lifelong reinforcement learning, and to update the policy correspondingly. To this end, we conduct experiments in order to investigate different exploration strategies. We empirically show that representatives of the policy-gradient family are better suited for lifelong learning, as they adapt more quickly to distribution shifts than Q-learning. Thereby, policy-gradient methods profit the most from Reactive Exploration and show good results in lifelong learning with continual domain shifts. Our code is available at: https://github.com/ml-jku/reactive-exploration.
arXiv.org Artificial Intelligence
Sep-22-2022
- Country:
- South America > Chile
- Oceania > Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America
- United States
- New York > New York County
- New York City (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- New York > New York County
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Galicia
- A Coruña Province > Santiago de Compostela (0.04)
- Germany > North Rhine-Westphalia
- Upper Bavaria > Munich (0.04)
- France > Auvergne-Rhône-Alpes
- Austria
- Vienna (0.14)
- Upper Austria > Linz (0.04)
- United Kingdom > England
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Genre:
- Instructional Material (0.96)
- Research Report
- New Finding (1.00)
- Experimental Study (0.67)
- Industry:
- Education > Educational Setting > Continuing Education (0.96)
- Technology: