A OpenXLand Components
–Neural Information Processing Systems
In this environment, the agent receives reward when the orange sphere makes contact with the blue pyramid. We see that the orange sphere is elevated, and therefore the agent must find it and use the ramps to access it. As to the blue pyramid, we do not see it because it is not there: the agent must first get the orange sphere near the black rounded cube first to spawn one. This environment also contains a grey pyramid that serves as a distraction. Importantly, if the agent brings the grey pyramid near the black rounded cube, both will disappear, making it impossible for the agent to spawn a blue pyramid and subsequently obtain its reward.
Neural Information Processing Systems
May-28-2025, 07:43:45 GMT
- Country:
- North America
- Canada > Quebec (0.14)
- United States (0.46)
- North America
- Industry:
- Government > Regional Government (0.46)
- Technology: