Structured Reinforcement Learning for Combinatorial Decision-Making