Robust Taxi Fare Prediction Under Noisy Conditions: A Comparative Study of GAT, TimesNet, and XGBoost
–arXiv.org Artificial Intelligence
--Precise fare prediction is crucial in ride-hailing platforms and urban mobility systems. This study examines three machine learning models--Graph Attention Networks (GA T), XGBoost, and TimesNet--to evaluate their predictive capabilities for taxi fares using a real-world dataset comprising over 55 million records. Both raw (noisy) and denoised versions of the dataset are analyzed to assess the impact of data quality on model performance. The study evaluated the models along multiple axes, including predictive accuracy, calibration, uncertainty estimation, out-of-distribution (OOD) robustness, and feature sensitivity. We also explore pre-processing strategies, including KNN imputation, Gaussian noise injection, and autoencoder-based denoising. The study reveals critical differences between classical and deep learning models under realistic conditions, offering practical guidelines for building robust and scalable models in urban fare prediction systems. Index T erms--T axi Fare Prediction, Machine Learning, Graph Attention Network, XGBoost, Time Series, Uncertainty Estimation, Ensemble Models, Kolmogorov-Smirnov (KS), Out-of-Distribution (OOD). A. Background and Motivation Accurately estimating taxi fares plays a pivotal role in intelligent transportation systems and urban mobility planning.
arXiv.org Artificial Intelligence
Jul-29-2025