Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management
View/ Open
Author
Homod, Raad Z.
Yaseen, Zaher Mundher
Hussein, Ahmed Kadhim
Almusaed, Amjad
Alawi, Omer A.
Falah, Mayadah W.
Abdelrazek, Ali H.
Ahmed, Waqar
Eltaweel, Mahmoud
Attention
2299/25973
Abstract
Chillers are responsible for almost half of the total energy demand in buildings. Hence, the obligation of control systems of multi-chiller due to changes indoor environments is one of the most significant parts of a smart building. Such a controller is described as a nonlinear and multi-objective algorithm, and its fabrication is crucial to achieving the optimal balance between indoor thermal comfort and running a minimum number of chillers. This work proposes deep clustering of cooperative multi-agent reinforcement learning (DCCMARL) as well-suited to such system control, which supports centralized control by learning of agents. In MARL, since the learning of agents is based on discrete sets of actions and stats, this drawback significantly affects the model of agents for representing their actions with efficient performance. This drawback becomes considerably worse when increasing the number of agents, due to the increased complexity of solving MARL, which makes modeling policy very challenging. Therefore, the DCCMARL of multi-objective reinforcement learning is leveraging powerful frameworks of a hybrid clustering algorithm to deal with complexity and uncertainty, which is a critical factor that influences to the achievement of high levels of a performance action. The results showed that the ability of agents to manipulate the behavior of the smart building could improve indoor thermal conditions, as well as save energy up to 44.5% compared to conventional methods. It seems reasonable to conclude that agents' performance is influenced by what type of model structure.
Publication date
2023-04-15Published in
Journal of Building Engineering (JOBE)Published version
https://doi.org/10.1016/j.jobe.2022.105689Other links
http://hdl.handle.net/2299/25973Metadata
Show full item recordRelated items
Showing items related by title, author, creator and subject.
-
Identification of multi-component LOFAR sources with multi-modal deep learning
Alegre, Lara; Best, Philip; Sabater, Jose; Rottgering, Huub; Hardcastle, Martin; Williams, Wendy (2024-06-11)Modern high-sensitivity radio telescopes are discovering an increased number of resolved sources with intricate radio structures and fainter radio emissions. These sources often present a challenge because source detectors ... -
Multi-objective optimisation for minimum quantity lubrication assisted milling process based on hybrid response surface methodology and multi-objective genetic algorithm
Mumtaz, J.; Li, Z.; Imran, M.; Yue, L.; Jahanzaib, M.; Sarfraz, S.; Shehab, E.; Ismail, S. O.; Afzal , K. (2019-04-01)Parametric modelling and optimisation play an important role in choosing the best or optimal cutting conditions and parameters during machining to achieve the desirable results. However, analysis of optimisation of minimum ... -
Average transmit power of adaptive ZF very large multi-user and multi-antenna systems
Yue, Dian Wu; Sun, Yichuang (2015-04-01)In this paper, we investigate adaptive zero-forcing uplink transmission for very large multi-user multi-antenna systems in Rayleigh fading environments. We assume that the number of antennas at the base station (denoted ...