A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Systems

Christian Blad; Simon Bøgh; Carsten Kallesøe

doi:10.3390/en14227491

A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Systems

Christian Blad^*, Simon Bøgh, Carsten Kallesøe

^*Kontaktforfatter

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

5 Citationer (Scopus)

62 Downloads (Pure)

Abstract

This paper addresses the challenge of minimizing training time for the control of Heating, Ventilation, and Air-conditioning (HVAC) systems with online Reinforcement Learning (RL). This is done by developing a novel approach to Multi-Agent Reinforcement Learning (MARL) to HVAC systems. In this paper, the environment formed by the HVAC system is formulated as a Markov Game (MG) in a general sum setting. The MARL algorithm is designed in a decentralized structure, where only relevant states are shared between agents, and actions are shared in a sequence, which are sensible from a system’s point of view. The simulation environment is a domestic house located in Denmark and designed to resemble an average house. The heat source in the house is an air-to-water heat pump, and the HVAC system is an Underfloor Heating system (UFH). The house is subjected to weather changes from a data set collected in Copenhagen in 2006, spanning the entire year except for June, July, and August, where heat is not required. It is shown that: (1) When comparing Single Agent Reinforcement Learning (SARL) and MARL, training time can be reduced by 70% for a four temperature-zone UFH system, (2) the agent can learn and generalize over seasons, (3) the cost of heating can be reduced by 19% or the equivalent to 750 kWh of electric energy per year for an average Danish domestic house compared to a traditional control method, and (4) oscillations in the room temperature can be reduced by 40% when comparing the RL control methods with a traditional control method.

Originalsprog	Engelsk
Artikelnummer	7491
Tidsskrift	Energies
Vol/bind	14
Udgave nummer	22
Antal sider	19
ISSN	1996-1073
DOI	https://doi.org/10.3390/en14227491
Status	Udgivet - nov. 2021

FN’s Verdensmål

Denne publikation bidrager til følgende verdensmål

Adgang til dokumentet

10.3390/en14227491Licens: CC BY 4.0

energies-14-07491Forlagets udgivne version, 4,8 MBLicens: CC BY 4.0

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Link to publication in Scopus

Reinforcement Learning Baseret Styring til Gulvvarme Systemer
Bøgh, S. & Blad, C.
01/01/2019 → 31/12/2021
Projekter: Projekt › Forskning

Citationsformater

@article{13a3ef0004434acd8cbbeb6c52332a4c,

title = "A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Systems",

abstract = "This paper addresses the challenge of minimizing training time for the control of Heating, Ventilation, and Air-conditioning (HVAC) systems with online Reinforcement Learning (RL). This is done by developing a novel approach to Multi-Agent Reinforcement Learning (MARL) to HVAC systems. In this paper, the environment formed by the HVAC system is formulated as a Markov Game (MG) in a general sum setting. The MARL algorithm is designed in a decentralized structure, where only relevant states are shared between agents, and actions are shared in a sequence, which are sensible from a system{\textquoteright}s point of view. The simulation environment is a domestic house located in Denmark and designed to resemble an average house. The heat source in the house is an air-to-water heat pump, and the HVAC system is an Underfloor Heating system (UFH). The house is subjected to weather changes from a data set collected in Copenhagen in 2006, spanning the entire year except for June, July, and August, where heat is not required. It is shown that: (1) When comparing Single Agent Reinforcement Learning (SARL) and MARL, training time can be reduced by 70% for a four temperature-zone UFH system, (2) the agent can learn and generalize over seasons, (3) the cost of heating can be reduced by 19% or the equivalent to 750 kWh of electric energy per year for an average Danish domestic house compared to a traditional control method, and (4) oscillations in the room temperature can be reduced by 40% when comparing the RL control methods with a traditional control method.",

keywords = "Reinforcement Learning, multi-agent RL, HVAC, Comfort, Energy, Artificial Intelligence, Underfloor heating, Energy in buildings, HVAC-systems, Deep reinforcement learning, Artificial intelligence, Predictive analytics",

author = "Christian Blad and Simon B{\o}gh and Carsten Kalles{\o}e",

year = "2021",

month = nov,

doi = "10.3390/en14227491",

language = "English",

volume = "14",

journal = "Energies",

issn = "1996-1073",

publisher = "M D P I AG",

number = "22",

}

TY - JOUR

T1 - A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Systems

AU - Blad, Christian

AU - Bøgh, Simon

AU - Kallesøe, Carsten

PY - 2021/11

Y1 - 2021/11

N2 - This paper addresses the challenge of minimizing training time for the control of Heating, Ventilation, and Air-conditioning (HVAC) systems with online Reinforcement Learning (RL). This is done by developing a novel approach to Multi-Agent Reinforcement Learning (MARL) to HVAC systems. In this paper, the environment formed by the HVAC system is formulated as a Markov Game (MG) in a general sum setting. The MARL algorithm is designed in a decentralized structure, where only relevant states are shared between agents, and actions are shared in a sequence, which are sensible from a system’s point of view. The simulation environment is a domestic house located in Denmark and designed to resemble an average house. The heat source in the house is an air-to-water heat pump, and the HVAC system is an Underfloor Heating system (UFH). The house is subjected to weather changes from a data set collected in Copenhagen in 2006, spanning the entire year except for June, July, and August, where heat is not required. It is shown that: (1) When comparing Single Agent Reinforcement Learning (SARL) and MARL, training time can be reduced by 70% for a four temperature-zone UFH system, (2) the agent can learn and generalize over seasons, (3) the cost of heating can be reduced by 19% or the equivalent to 750 kWh of electric energy per year for an average Danish domestic house compared to a traditional control method, and (4) oscillations in the room temperature can be reduced by 40% when comparing the RL control methods with a traditional control method.

AB - This paper addresses the challenge of minimizing training time for the control of Heating, Ventilation, and Air-conditioning (HVAC) systems with online Reinforcement Learning (RL). This is done by developing a novel approach to Multi-Agent Reinforcement Learning (MARL) to HVAC systems. In this paper, the environment formed by the HVAC system is formulated as a Markov Game (MG) in a general sum setting. The MARL algorithm is designed in a decentralized structure, where only relevant states are shared between agents, and actions are shared in a sequence, which are sensible from a system’s point of view. The simulation environment is a domestic house located in Denmark and designed to resemble an average house. The heat source in the house is an air-to-water heat pump, and the HVAC system is an Underfloor Heating system (UFH). The house is subjected to weather changes from a data set collected in Copenhagen in 2006, spanning the entire year except for June, July, and August, where heat is not required. It is shown that: (1) When comparing Single Agent Reinforcement Learning (SARL) and MARL, training time can be reduced by 70% for a four temperature-zone UFH system, (2) the agent can learn and generalize over seasons, (3) the cost of heating can be reduced by 19% or the equivalent to 750 kWh of electric energy per year for an average Danish domestic house compared to a traditional control method, and (4) oscillations in the room temperature can be reduced by 40% when comparing the RL control methods with a traditional control method.

KW - Reinforcement Learning

KW - multi-agent RL

KW - HVAC

KW - Comfort

KW - Energy

KW - Artificial Intelligence

KW - Underfloor heating

KW - Energy in buildings

KW - HVAC-systems

KW - Deep reinforcement learning

KW - Artificial intelligence

KW - Predictive analytics

UR - http://www.scopus.com/inward/record.url?scp=85119337181&partnerID=8YFLogxK

U2 - 10.3390/en14227491

DO - 10.3390/en14227491

M3 - Journal article

SN - 1996-1073

VL - 14

JO - Energies

JF - Energies

IS - 22

M1 - 7491

ER -

A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Systems

Abstract

FN’s Verdensmål

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Projekter

Reinforcement Learning Baseret Styring til Gulvvarme Systemer

Citationsformater