Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management

Homod, Raad Z.; Yaseen, Zaher Mundher; Hussein, Ahmed Kadhim; Almusaed, Amjad; Alawi, Omer A.; Falah, Mayadah W.; Abdelrazek, Ali H.; Ahmed, Waqar; Eltaweel, Mahmoud

View/Open

JBE_D_22_07022_R2.pdf (PDF, 6Mb)

Author

Homod, Raad Z.

Yaseen, Zaher Mundher

Hussein, Ahmed Kadhim

Almusaed, Amjad

Alawi, Omer A.

Falah, Mayadah W.

Abdelrazek, Ali H.

Ahmed, Waqar

Eltaweel, Mahmoud

Abstract

Chillers are responsible for almost half of the total energy demand in buildings. Hence, the obligation of control systems of multi-chiller due to changes indoor environments is one of the most significant parts of a smart building. Such a controller is described as a nonlinear and multi-objective algorithm, and its fabrication is crucial to achieving the optimal balance between indoor thermal comfort and running a minimum number of chillers. This work proposes deep clustering of cooperative multi-agent reinforcement learning (DCCMARL) as well-suited to such system control, which supports centralized control by learning of agents. In MARL, since the learning of agents is based on discrete sets of actions and stats, this drawback significantly affects the model of agents for representing their actions with efficient performance. This drawback becomes considerably worse when increasing the number of agents, due to the increased complexity of solving MARL, which makes modeling policy very challenging. Therefore, the DCCMARL of multi-objective reinforcement learning is leveraging powerful frameworks of a hybrid clustering algorithm to deal with complexity and uncertainty, which is a critical factor that influences to the achievement of high levels of a performance action. The results showed that the ability of agents to manipulate the behavior of the smart building could improve indoor thermal conditions, as well as save energy up to 44.5% compared to conventional methods. It seems reasonable to conclude that agents' performance is influenced by what type of model structure.

Publication date

2023-04-15

Published in

Journal of Building Engineering (JOBE)

Published version

https://doi.org/10.1016/j.jobe.2022.105689

Metadata

Show full item record

University of Hertfordshire Research Archive