The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems

Bliek, Laurens, da Costa, Paulo, Afshar, Reza Refaei, Zhang, Yingqian, Catshoek, Tom, Vos, Daniël, Verwer, Sicco, Schmitt-Ulms, Fynn, Hottung, André, Shah, Tapan, Sellmann, Meinolf, Tierney, Kevin, Perreault-Lafleur, Carl, Leboeuf, Caroline, Bobbio, Federico, Pepin, Justine, Silva, Warley Almeida, Gama, Ricardo, Fernandes, Hugo L., Zaefferer, Martin, López-Ibáñez, Manuel, Irurozki, Ekhine

arXiv.org Artificial Intelligence 

The TSP is one of the classical combinatorial optimization problems, with many variants inspired by real-world applications. This first competition asked the participants to develop algorithms to solve a time-dependent orienteering problem with stochastic weights and time windows (TD-OPSWTW). It focused on two types of learning approaches: surrogate-based optimization and deep reinforcement learning. In this paper, we describe the problem, the setup of the competition, the winning methods, and give an overview of the results. The winning methods described in this work have advanced the state-of-the-art in using AI for stochastic routing problems. Overall, by organizing this competition we have introduced routing problems as an interesting problem setting for AI researchers. The simulator of the problem has been made open-source and can be used by other researchers as a benchmark for new AI methods.