Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces