OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment

Neural Information Processing Systems 

In particular, we model the integrated problem as a Markov game, wherein a team of agents learns a joint policy via interacting with a simulated environment.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found