Constrained Reinforcement Learning for Safe Heat Pump Control

Zhang, Baohe, Frison, Lilli, Brox, Thomas, Bödecker, Joschka

Sep-29-2024–arXiv.org Artificial Intelligence

Constrained Reinforcement Learning (RL) has emerged as a significant research area within RL, where integrating constraints with rewards is crucial for enhancing safety and performance across diverse control tasks. In the context of heating systems in the buildings, optimizing the energy efficiency while maintaining the residents' thermal comfort can be intuitively formulated as a constrained optimization problem. However, to solve it with RL may require large amount of data. Therefore, an accurate and versatile simulator is favored. In this paper, we propose a novel building simulator I4B which provides interfaces for different usages and apply a model-free constrained RL algorithm named constrained Soft Actor-Critic with Linear Smoothed Log Barrier function (CSAC-LB) to the heating optimization problem. Benchmarking against baseline algorithms demonstrates CSAC-LB's efficiency in data exploration, constraint satisfaction and performance.

algorithm, constraint, efficiency, (15 more...)

arXiv.org Artificial Intelligence

Sep-29-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - New York > New York County
      - New York City (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California
      - Los Angeles County > Long Beach (0.04)
      - Alameda County > Berkeley (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - United Kingdom > England
    - Greater London > London (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Germany > Baden-Württemberg
    - Freiburg (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)

Genre:
- Research Report (0.82)

Industry:
- Construction & Engineering (1.00)
- Energy > Renewable
  - Geothermal > Geothermal Energy Systems and Facilities > Direct Use of Geothermal Energy > Geothermal Heating, Ventilation, and Air Conditioning (HVAC) System (0.44)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning > Reinforcement Learning (1.00)