OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment
–Neural Information Processing Systems
In particular, we model the integrated problem as a Markov game, wherein a team of agents learns a joint policy via interacting with a simulated environment.
Neural Information Processing Systems
Oct-8-2025, 20:59:21 GMT
- Country:
- Asia
- China > Zhejiang Province (0.04)
- Middle East > Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Retail (0.68)
- Transportation > Freight & Logistics Services (0.47)