DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

Neural Information Processing Systems 

DRL solvers can only scale to a few hundreds of nodes for combinatorial optimization problems on graphs, such as the Traveling Salesman Problem (TSP).