Grounded Reinforcement Learning: Learning to Win the Game under Human Commands Supplementary Materials
–Neural Information Processing Systems
In this section, we describe the details of MiniRTS Environment and human dataset. The data do not contain any personally identifiable information or offensive content. Figure 1: MiniRTS [2] implements the rockpaper-scissors Figure 2: Building units can produce different attack graph, each army type army units using resources. "workshop" can produce has some units it is effective against and vulnerable "archer", "dragon" and "catapult" while other to. For example, "swordman" restrains buildings can build one unit type. Only "peasant" "spearman" but is retrained by "cavarly". Game Units There are 3 kinds of units in MiniRTS, including resource units, building units, and army units. Resource Units: Resource units are stationary and neutral. Resource units cannot be constructed by anyone and are created at the beginning of a game. One mine action could gather resources from the resource units, and the mined resources are necessary to build new building units or army units.
Neural Information Processing Systems
Jan-25-2025, 21:06:43 GMT