Entropy-Driven Curriculum for Multi-Task Training in Human Mobility Prediction

Fang, Tianye, Luo, Xuanshu, Werner, Martin

arXiv.org Artificial Intelligence 

--The increasing availability of big mobility data from ubiquitous portable devices enables human mobility prediction through deep learning approaches. However, the diverse complexity of human mobility data impedes model training, leading to inefficient gradient updates and potential underfitting. This paper presents a unified training framework that integrates entropy-driven curriculum and multi-task learning to address these challenges. The proposed entropy-driven curriculum learning strategy quantifies trajectory predictability based on Lempel-Ziv compression and organizes training from simple to complex for faster convergence and enhanced performance. The multi-task training simultaneously optimizes the primary location prediction alongside auxiliary estimation of movement distance and direction for learning realistic mobility patterns, and improve prediction accuracy through complementary supervision signals. Extensive experiments conducted in accordance with the HuMob Challenge demonstrate that our approach achieves state-of-the-art performance on GEO-BLEU (0.354) and DTW (26.15) metrics with up to 2.92-fold convergence speed compared to training without curriculum learning. The inherent regularity of human mobility data, which exhibits predictability of individual mobility patterns across diverse populations and travel distances [1], provides the foundation for numerous location-based applications, including urban planning and management, transportation optimization, epidemic modeling, and recommendation systems [2]-[7]. With the proliferation of pervasive user devices with passive location acquisition capabilities, unprecedented volumes of human mobility data have been collected, enabling data-driven approaches, particularly sequential deep learning models, to effectively extract human mobility patterns [8]-[11]. In comparison to handcrafted pattern matching [12]-[14] and Markov models [15]-[17], deep learning methods generally achieve superior long-term prediction performance.