Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP