Combining time-series and textual data for taxi demand prediction in event areas: a deep learning approach

Rodrigues, Filipe, Markou, Ioulia, Pereira, Francisco

arXiv.org Machine Learning 

Accurate time-series forecasting is vital for numerous areas of application such as transportation, energy, finance, economics, etc. However, while modern techniques are able to explore large sets of temporal data to build forecasting models, they typically neglect valuable information that is often available under the form of unstructured text. Although this data is in a radically different format, it often contains contextual explanations for many of the patterns that are observed in the temporal data. In this paper, we propose two deep learning architectures that leverage word embeddings, convolutional layers and attention mechanisms for combining text information with time-series data. We apply these approaches for the problem of taxi demand forecasting in event areas. Using publicly available taxi data from New York, we empirically show that by fusing these two complementary cross-modal sources of information, the proposed models are able to significantly reduce the error in the forecasts. Keywords: Deep learning, Data fusion, Cross modality learning, Time series forecasting, Textual data, Taxi demand, Special events, Urban mobility 1. Introduction Understanding what drives the travel behavior of people is a key research topic for developing effective and efficient intelligent transportation systems that adapt to the travel demand. However, typical approaches focus only on capturing recurrent mobility trends that relate to habitual/routine behaviour [1], and on exploiting short-term correlations with recent observation patterns [2, 3]. While this type of approaches can be successful for long-term planning applications or for modeling demand in non-eventful areas such as residential neighborhoods, in lively and highly dynamic areas that are prone to the occurrence of multiple special events, such as music concerts, sports games, festivals, parades and protests, these approaches fail to accurately model mobility demand [4]. As we move towards the deployment of autonomous vehicles, understanding and being able to anticipate mobility demand becomes crucial, especially in shared-mobility scenarios, as this allows for properly managing fleets and increasing user-satisfaction. In order to capture the effects of events, one can exploit the vast amount of information that is shared online about what is planned to take place in the city. However, most of this information is typically in the form of unstructured natural-language text.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found