Gotta Learn Fast: A New Benchmark for Generalization in RL