AITopics | multi-agent reinforcement learning environment

OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment

Neural Information Processing SystemsDec-25-2025, 22:55:22 GMT

The dramatic growth of global e-commerce has led to a surge in demand for efficient and cost-effective order fulfillment which can increase customers' service levels and sellers' competitiveness. However, managing order fulfillment is challenging due to a series of interdependent online sequential decision-making problems. To clear this hurdle, rather than solving the problems separately as attempted in some recent researches, this paper proposes a method based on multi-agent reinforcement learning to integratively solve the series of interconnected problems, encompassing order handling, packing and pickup, storage, order consolidation, and last-mile delivery. In particular, we model the integrated problem as a Markov game, wherein a team of agents learns a joint policy via interacting with a simulated environment. Since no simulated environment supporting the complete order fulfillment problem exists, we devise Order Fulfillment COoperative mUlti-agent Reinforcement learning Scalable Environment (OFCOURSE) in the OpenAI Gym style, which allows reproduction and re-utilization to build customized applications. By constructing the fulfillment system in OFCOURSE, we optimize a joint policy that solves the integrated problem, facilitating sequential order-wise operations across all fulfillment units and minimizing the total cost of fulfilling all orders within the promised time. With OFCOURSE, we also demonstrate that the joint policy learned by multi-agent reinforcement learning outperforms the combination of locally optimal policies.

multi-agent reinforcement learning environment, name change, ofcourse, (5 more...)

Neural Information Processing Systems

Industry: Education (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment

Neural Information Processing SystemsJan-19-2025, 04:00:37 GMT

The dramatic growth of global e-commerce has led to a surge in demand for efficient and cost-effective order fulfillment which can increase customers' service levels and sellers' competitiveness. However, managing order fulfillment is challenging due to a series of interdependent online sequential decision-making problems. To clear this hurdle, rather than solving the problems separately as attempted in some recent researches, this paper proposes a method based on multi-agent reinforcement learning to integratively solve the series of interconnected problems, encompassing order handling, packing and pickup, storage, order consolidation, and last-mile delivery. In particular, we model the integrated problem as a Markov game, wherein a team of agents learns a joint policy via interacting with a simulated environment. Since no simulated environment supporting the complete order fulfillment problem exists, we devise Order Fulfillment COoperative mUlti-agent Reinforcement learning Scalable Environment (OFCOURSE) in the OpenAI Gym style, which allows reproduction and re-utilization to build customized applications.

multi-agent reinforcement learning environment, ofcourse, order fulfillment, (3 more...)

Neural Information Processing Systems

Industry: Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Tactics2D: A Multi-agent Reinforcement Learning Environment for Driving Decision-making

Li, Yueyuan, Zhang, Songan, Jiang, Mingyang, Chen, Xingyuan, Yang, Ming

arXiv.org Artificial IntelligenceNov-18-2023

Tactics2D is an open-source multi-agent reinforcement learning library with a Python backend. Its goal is to provide a convenient toolset for researchers to develop decision-making algorithms for autonomous driving. The library includes diverse traffic scenarios implemented as gym-based environments equipped with multi-sensory capabilities and violation detection for traffic rules. Additionally, it features a reinforcement learning baseline tested with reasonable evaluation metrics. Tactics2D is highly modular and customizable. The source code of Tactics2D is available at https://github.com/WoodOxen/Tactics2D.

dataset, scenario, tactics2d, (11 more...)

arXiv.org Artificial Intelligence

2311.11058

Country:

Asia > China > Shanghai > Shanghai (0.05)
Europe > Germany (0.05)

Genre: Research Report (0.50)

Industry: Transportation > Ground > Road (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Filters

Collaborating Authors

multi-agent reinforcement learning environment

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment

OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment

Tactics2D: A Multi-agent Reinforcement Learning Environment for Driving Decision-making