Stochastic Search In Changing Situations
Abdolmaleki, Abbas (University of Aveiro) | Simoes, David (University of Aveiro) | Lau, Nuno (University of Aveiro) | Reis, Luis Paulo (University of Minho) | Price, Bob (PARC) | Neumann, Gerhard (Technische Universität Darmstadt)
Stochastic search algorithms are black-box optimizer of an objective function. They have recently gained a lot of attention in operations research, machine learning and policy search of robot motor skills due to their ease of use and their generality. However, when the task or objective function slightly changes, many stochastic search algorithms require complete re-learning in order to adapt thesolution to the new objective function or the new context. As such, we consider the contextual stochastic search paradigm. Here, we want to find good parameter vectors for multiple related tasks, where each task is described by a continuous context vector. Hence, the objective function might change slightly for each parameter vector evaluation. In this paper, we investigate a contextual stochastic search algorithm known as Contextual Relative Entropy Policy Search (CREPS), an information-theoretic algorithm that can learn from multiple tasks simultaneously. We show the application of CREPS for simulated robotic tasks.
Feb-4-2017
- Technology: