Exploratory Combinatorial Optimization with Reinforcement Learning