Improving the accuracy of freight mode choice models: A case study using the 2017 CFS PUF data set and ensemble learning techniques
Liu, Diyi, Lim, Hyeonsup, Uddin, Majbah, Liu, Yuandong, Han, Lee D., Hwang, Ho-ling, Chin, Shih-Miao
–arXiv.org Artificial Intelligence
The US Census Bureau has collected two rounds of experimental data from the Commodity Flow Survey, providing shipment-level characteristics of nationwide commodity movements, published in 2012 (i.e., Public Use Microdata) and in 2017 (i.e., Public Use File). With this information, data-driven methods have become increasingly valuable for understanding detailed patterns in freight logistics. In this study, we used the 2017 Commodity Flow Survey Public Use File data set to explore building a high-performance freight mode choice model, considering three main improvements: (1) constructing local models for each separate commodity/industry category; (2) extracting useful geographical features, particularly the derived distance of each freight mode between origin/destination zones; and (3) applying additional ensemble learning methods such as stacking or voting to combine results from local and unified models for improved performance. The proposed method achieved over 92% accuracy without incorporating external information, an over 19% increase compared to directly fitting Random Forests models over 10,000 samples. Furthermore, SHAP (Shapely Additive Explanations) values were computed to explain the outputs and major patterns obtained from the proposed model. The model framework could enhance the performance and interpretability of existing freight mode choice models.
arXiv.org Artificial Intelligence
Feb-12-2024
- Country:
- South America > Brazil
- Rio Grande do Sul (0.04)
- North America
- United States
- Michigan (0.04)
- Maryland (0.04)
- Tennessee
- Knox County > Knoxville (0.14)
- Anderson County > Oak Ridge (0.04)
- New York > New York County
- New York City (0.04)
- California > Los Angeles County
- Los Angeles (0.14)
- Cuba > Holguín Province
- Holguín (0.04)
- United States
- Europe
- Switzerland (0.04)
- Netherlands (0.04)
- Monaco (0.04)
- Greece (0.04)
- Denmark (0.04)
- Asia
- South Korea > Seoul
- Seoul (0.04)
- China > Jiangsu Province
- Nanjing (0.04)
- South Korea > Seoul
- South America > Brazil
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine (0.95)
- Information Technology (0.93)
- Energy (0.93)
- Transportation
- Freight & Logistics Services (1.00)
- Ground > Road (0.93)
- Government > Regional Government
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Artificial Intelligence > Machine Learning
- Statistical Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Ensemble Learning (0.89)
- Decision Tree Learning (0.88)
- Neural Networks (0.68)
- Information Technology