Bigger, Better, Faster: Human-level Atari with human-level efficiency
Schwarzer, Max, Obando-Ceron, Johan, Courville, Aaron, Bellemare, Marc, Agarwal, Rishabh, Castro, Pablo Samuel
–arXiv.org Artificial Intelligence
We introduce a value-based RL agent, which we 64 call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling 16 the neural networks used for value estimation, as well as a number of other design choices that 4 enable this scaling in a sample-efficient manner. We conduct extensive analyses of these design 1 choices and provide insights for future work. We 2015 2017 2019 2021 2023 end with a discussion about updating the goalposts for sample-efficient RL research on the ALE. Figure 1: Environment samples to reach human-level performance, We make our code and data publicly available.
arXiv.org Artificial Intelligence
Nov-13-2023
- Country:
- North America > United States > Hawaii (0.14)
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: