Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning

Feb-28-2025–arXiv.org Artificial Intelligence

In this paper, we introduce a simple yet effective reward dimension reduction method to tackle the scalability challenges of multi-objective reinforcement learning algorithms. While most existing approaches focus on optimizing two to four objectives, their abilities to scale to environments with more objectives remain uncertain. Our method uses a dimension reduction approach to enhance learning efficiency and policy performance in multi-objective settings. While most traditional dimension reduction methods are designed for static datasets, our approach is tailored for online learning and preserves Pareto-optimality after transformation. We propose a new training and evaluation framework for reward dimension reduction in multi-objective reinforcement learning and demonstrate the superiority of our method in environments including one with sixteen objectives, significantly outperforming existing online dimension reduction methods.

algorithm, objective, pareto frontier, (14 more...)

arXiv.org Artificial Intelligence

Feb-28-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - Mexico > Guanajuato (0.04)
  - United States
    - Nevada (0.04)
    - California
      - San Diego County > San Diego (0.04)
      - Los Angeles County > Long Beach (0.04)
  - Canada
    - British Columbia > Vancouver (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Netherlands (0.04)
  - Belgium (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia > South Korea
  - Daejeon > Daejeon (0.04)
- Africa > Rwanda
  - Kigali > Kigali (0.04)

Genre:
- Research Report (1.00)

Industry:
- Education > Educational Setting > Online (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Reinforcement Learning (1.00)
  - Learning in High Dimensional Spaces (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found