An Optimization Framework for Task Sequencing in Curriculum Learning
Foglino, Francesco, Leonetti, Matteo
Abstract--Curriculum learning is gaining popularity in (deep) reinforcement learning. Current methods for automatic task sequencing for curriculum learning in reinforcement learning provided initial heuristic solutions, with little to no guarantee on their quality. We introduce an optimization framework for task sequencing composed of the problem definition, several candidate performance metrics for optimization, and three benchmark algorithms. We experimentally show that the two most commonly used baselines (learning with no curriculum, and with a random curriculum) perform worse than a simple greedy algorithm. Furthermore, we show theoretically and demonstrate experimentally that the three proposed algorithms provide increasing solution quality at moderately increasing computational complexity, and show that they constitute better baselines for curriculum learning in reinforcement learning. Reinforcement Learning (RL) has recently been successfully applied to a number of tasks whose complexity would have appeared overwhelming only a few years ago [1], [2]. In such large and complex environments, classical exploration strategies designed for Markov Decision Processes (MDPs), aiming at visiting every state the most efficiently, are inadequate. One approach actively investigated is the use of transfer learning [3] to generalize from previous similar tasks, and more recently the application of transfer learning to sequences of tasks of increasing complexity forming a curriculum . Curriculum Learning is often employed in (Deep) RL [4], [5] to let the agent progress more quickly towards better behaviors, but curricula are mostly designed by hand. Curriculum learning has the potential to greatly increase the quality of the behavior discovered by the agent. However, at the moment, creating an appropriate curriculum requires significant human intuition.
Jan-31-2019
- Country:
- Europe > United Kingdom (0.14)
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.68)
- Information Technology (0.68)
- Leisure & Entertainment > Games
- Computer Games (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Reinforcement Learning (1.00)
- Representation & Reasoning
- Agents (1.00)
- Optimization (0.84)
- Search (1.00)
- Information Technology > Artificial Intelligence