Reversible Action Design for Combinatorial Optimization with Reinforcement Learning