Goto

Collaborating Authors

 Search





A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems Yi Ma

Neural Information Processing Systems

To address this problem, existing methods partition the overall DPDP into fixed-size sub-problems by caching online generated orders and solve each sub-problem, or on this basis to utilize the predicted future orders to optimize each sub-problem further. However, the solution quality and efficiency of these methods are unsatisfactory, especially when the problem scale is very large.




A Supplementary Material

Neural Information Processing Systems

Thus, Proposition 1 yields a smallest PI-explanation.Proposition 2. PI-explanations of an XLC can be enumerated with log-linear delay. This means that all PIexplanations will be found. Thus, all leaves correspond to PI-explanations. Figure 5: Percentage of important "hits" of explanations produced by Anchor and SHAP .


hyperparameter tuning for each individual encoding, (preliminary) experiments on the DARTS search space, and

Neural Information Processing Systems

We thank the reviewers for their helpful reviews. Please see the details below. See the figure below for the results of Reg. We now provide preliminary results for experiments on the DARTS search space. See the figure below (top right).



A General Large Neighborhood Search Framework for Solving Integer Linear Programs Jialin Song

Neural Information Processing Systems

This paper studies a strategy for data-driven algorithm design for large-scale combinatorial optimization problems that can leverage existing state-of-the-art solvers in general purpose ways. The goal is to arrive at new approaches that can reliably outperform existing solvers in wall-clock time. We focus on solving integer linear programs, and ground our approach in the large neighborhood search (LNS) paradigm, which iteratively chooses a subset of variables to optimize while leaving the remainder fixed. The appeal of LNS is that it can easily use any existing solver as a subroutine, and thus can inherit the benefits of carefully engineered heuristic or complete approaches and their software implementations. We show that one can learn a good neighborhood selector using imitation and reinforcement learning techniques. Through an extensive empirical validation in bounded-time optimization, we demonstrate that our LNS framework can significantly outperform compared to state-of-the-art commercial solvers such as Gurobi.