Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment

Open in new window