Measuring and Characterizing Generalization in Deep Reinforcement Learning