Action-Gap Phenomenon in Reinforcement Learning