COMPARATIVE ANALYSIS OF MACHINE LEARNING MODELS FOR PREDICTIVE MAINTENANCE: CHALLENGES AND OPPORTUNITIES

Helal, Manal E., Vidalis, Stilianos, Ali, Baari, Foerster, Frank and Chiejina, Eric (2026) COMPARATIVE ANALYSIS OF MACHINE LEARNING MODELS FOR PREDICTIVE MAINTENANCE: CHALLENGES AND OPPORTUNITIES. Journal of Al-Azhar University Engineering Sector, 78 (21). pp. 128-150. ISSN 1687-8418
Copy

Predictive maintenance of Hard Disk Drives (HDDs) using machine learning is vitalfor preventing data loss and operational downtime in data centres. SequentialSMART (Self-Monitoring, Analysis, and Reporting Technology) data provides a richsource for forecasting impending failures. However, a clear consensus on the mosteffective modelling approach is absent, constituting a knowledge gap caused bystudies that focus on a single paradigm without a comprehensive, comparativebenchmark across multiple task types. This research aims to systematicallyevaluate and rank the performance of over 20 models across regression,classification, time-series forecasting, and anomaly detection tasks to identify theoptimal strategies for predicting HDD failures. We conducted an extensiveempirical study, employing tree-based models, deep learning architectures,including LSTMs (Long Short-Term Memory networks), GRUs (Gated RecurrentUnits), and Temporal Fusion Transformers (TFT), as well as various autoencoders.A full hyperparameter sweep was performed for time-series models to ensurerobust comparisons. Tree-based models excelled in static analysis, while deeplearning was superior for temporal sequences. The TFT model achieved thehighest average accuracy (86.12%) and best generalisation. The GRU modelattained the highest failure recall (92.52% peak accuracy). Conversely, anomalydetection methods demonstrated only moderate performance (65-73% accuracy).The key contribution of this work is a definitive, evidence-based model selectionframework that addresses this gap. The TFT architecture is identified as the mostrobust and efficient model for analysing temporal SMART data. Practitionersshould prioritise tree-based models for static tabular data, TFT for accurate andreliable forecasting, and GRU for maximising failure detection sensitivity, therebysignificantly enhancing predictive maintenance frameworks.

picture_as_pdf

picture_as_pdf
predMaint_azhar_2025_final.pdf
subject
Published Version
Available under Creative Commons: BY-SA 4.0

View Download

EndNote BibTeX Reference Manager Refer Atom Dublin Core METS MODS OpenURL ContextObject HTML Citation ASCII Citation OpenURL ContextObject in Span Data Cite XML MPEG-21 DIDL RIOXX2 XML
Export

Downloads