Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals

Oct-9-2024, 08:37:37 GMT–Neural Information Processing Systems

High sample complexity has long been a challenge for RL. On the other hand, humans learn to perform tasks not only from interaction or demonstrations, but also by reading unstructured text documents, e.g., instruction manuals. Instruction manuals and wiki pages are among the most abundant data that could inform agents of valuable features and policies or task-specific environmental dynamics and reward structures. Therefore, we hypothesize that the ability to utilize human-written instruction manuals to assist learning policies for specific tasks should lead to a more efficient and better-performing agent. We propose the Read and Reward framework.

instruction manual, play atari, read and reap, (5 more...)

Neural Information Processing Systems

Oct-9-2024, 08:37:37 GMT

Conferences Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games > Computer Games (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.46)