Spotlight Slides: Action-Gap Phenomenon in Reinforcement Learning

Amir-massoud Farahmand

Neural Information Processing Systems 

Even if we don't know the exact quality ( value) of each choice ( action) Not a big deal if we choose the wrong one!