Goto

Collaborating Authors

 cvrp



564127c03caab942e503ee6f810f54fd-Supplemental.pdf

Neural Information Processing Systems

This paper solves three NP-hard routing problems, traveling salesman problem (TSP), prize collecting TSP (PCTSP), and capacitated vehicle routing problem (CVRP). This section provides detailed descriptions of PCTSP and CVRP (for TSP, see section 3). The PCTSP is similar to TSP, while there are differences in that we do not have to visit all the nodes and that the destination is not the first node but the depot node, i.e., a tour is not a cycle. Let N be the number of nodes. The problem instance of PCTSP is s = {(xi,ฮปi,ยตi)}N+1i=1, where the xi R2 is in 2D euclidean coordinates, ฮปi R is the penalty of unvisited node, and ยตi R is the prize of visited node. The L(ฯ€|s) is the tour length, and ฮป(ฯ€|s) is the total penalty of the unvisited nodes.



0cddb777d3441326544e21b67f41bdc8-Supplemental-Conference.pdf

Neural Information Processing Systems

In this section, we prove the Theorem 2.1, which states a problem P and its' orthogonal transformed problem Q(P) = {{Qxi}Ni=1,f}have identical optimal solutions if Qis orthogonal matrix: QQT = QTQ = I. As we mentioned in Section 2.2, reward R is a function of a1:T (solution sequences), ||xi xj||i,j {1,...N} (relative distances) and f (nodes features). And Let R (P)is optimal value of problem P: i.e. Then, the remaining proof is to show Q(P)has an identical solution set with P. Let optimal solution set ฮ  (P) = {ฯ€i(P)}Mi=1, where ฯ€i(P)indicates optimal solution of P and M is the number of heterogeneous optimal solution. Conversely, For any ฯ€i(P) ฮ  (P), they have sample optimal value with Q(P): R(ฯ€i(P);P) = R (P) = R (Q(P)) Thus, ฯ€i(P) ฮ  (Q(P)).