Reward learning from human preferences and demonstrations in Atari
Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei
–Neural Information Processing Systems
Neural Information Processing Systems
May-26-2025, 08:12:14 GMT
- Country:
- North America > Canada (0.14)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (0.68)
- Leisure & Entertainment > Games (1.00)
- Technology: