Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing