Modeling Freight Mode Choice Using Machine Learning Classifiers: A Comparative Study Using the Commodity Flow Survey (CFS) Data
Uddin, Majbah, Anowar, Sabreena, Eluru, Naveen
–arXiv.org Artificial Intelligence
This study explores the usefulness of machine learning classifiers for modeling freight mode choice. We investigate eight commonly used machine learning classifiers, namely Naive Bayes, Support Vector Machine, Artificial Neural Network, K-Nearest Neighbors, Classification and Regression Tree, Random Forest, Boosting and Bagging, along with the classical Multinomial Logit model. US 2012 Commodity Flow Survey data are used as the primary data source; we augment it with spatial attributes from secondary data sources. The performance of the classifiers is compared based on prediction accuracy results. The current research also examines the role of sample size and training-testing data split ratios on the predictive ability of the various approaches. In addition, the importance of variables is estimated to determine how the variables influence freight mode choice. The results show that the tree-based ensemble classifiers perform the best. Specifically, Random Forest produces the most accurate predictions, closely followed by Boosting and Bagging. With regard to variable importance, shipment characteristics, such as shipment distance, industry classification of the shipper and shipment size, are the most significant factors for freight mode choice decisions.
arXiv.org Artificial Intelligence
Feb-1-2024
- Country:
- Asia (0.68)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.14)
- North America > United States
- Florida > Orange County
- Orlando (0.14)
- Missouri > Boone County
- Columbia (0.14)
- Florida > Orange County
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government (0.93)
- Transportation
- Freight & Logistics Services (0.94)
- Ground > Road (1.00)
- Infrastructure & Services (0.68)
- Passenger (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Decision Tree Learning (1.00)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.48)
- Neural Networks (1.00)
- Performance Analysis > Accuracy (0.67)
- Statistical Learning
- Nearest Neighbor Methods (0.55)
- Support Vector Machines (0.55)
- Representation & Reasoning > Uncertainty
- Fuzzy Logic (0.46)
- Machine Learning
- Information Technology > Artificial Intelligence