A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

Dec-24-2025, 21:32:56 GMT–Neural Information Processing Systems

The Dynamic Pickup and Delivery Problem (DPDP) is an essential problem in the logistics domain, which is NP-hard. The objective is to dynamically schedule vehicles among multiple sites to serve the online generated orders such that the overall transportation cost could be minimized. The critical challenge of DPDP is the orders are not known a priori, i.e., the orders are dynamically generated in real-time. To address this problem, existing methods partition the overall DPDP into fixed-size sub-problems by caching online generated orders and solve each sub-problem, or on this basis to utilize the predicted future orders to optimize each sub-problem further. However, the solution quality and efficiency of these methods are unsatisfactory, especially when the problem scale is very large. In this paper, we propose a novel hierarchical optimization framework to better solve large-scale DPDPs.

dynamic pickup and delivery problem, hierarchical reinforcement learning, large-scale dynamic pickup, (9 more...)

Neural Information Processing Systems

Dec-24-2025, 21:32:56 GMT

Conferences Web Page

Add feedback

Industry:
- Transportation > Freight & Logistics Services (0.97)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.58)