Grounded Reinforcement Learning: Learning to Win the Game under Human Commands

Oct-10-2024, 13:45:45 GMT–Neural Information Processing Systems

We consider the problem of building a reinforcement learning (RL) agent that can both accomplish non-trivial tasks, like winning a real-time strategy game, and strictly follow high-level language commands from humans, like "attack", even if a command is sub-optimal. We call this novel yet important problem, Grounded Reinforcement Learning (GRL). Compared with other language grounding tasks, GRL is particularly non-trivial and cannot be simply solved by pure RL or behavior cloning (BC). From the RL perspective, it is extremely challenging to derive a precise reward function for human preferences since the commands are abstract and the valid behaviors are highly complicated and multi-modal. From the BC perspective, it is impossible to obtain perfect demonstrations since human strategies in complex games are typically sub-optimal.

grounded reinforcement learning, human command, real-time strategy game, (1 more...)

Neural Information Processing Systems

Oct-10-2024, 13:45:45 GMT

Conferences Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games (0.70)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)