Show simple item record

dc.contributor.authorAlShourbaji, Ibrahim
dc.contributor.authorHelian, Na
dc.contributor.authorSun, Yi
dc.contributor.authorAlhameed, Mohammed
dc.date.accessioned2021-09-20T14:00:02Z
dc.date.available2021-09-20T14:00:02Z
dc.date.issued2021-09-17
dc.identifier.citationAlShourbaji , I , Helian , N , Sun , Y & Alhameed , M 2021 , ' Anovel HEOMGA Approach for Class Imbalance Problem in the Application of Customer Churn Prediction ' , SN Computer Science , vol. 2 , 464 . https://doi.org/10.1007/s42979-021-00850-y
dc.identifier.otherORCID: /0000-0001-6687-0306/work/100505837
dc.identifier.urihttp://hdl.handle.net/2299/25062
dc.description© The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2021. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1007/s42979-021-00850-y
dc.description.abstractMaking class balance is essential when learning from highly skewed datasets; otherwise, a learner may classify all instances to a negative class, resulting in a high false-negative rate. As a result, a precise balancing strategy is required. Many researchers have investigated class imbalance using Machine Learning (ML) methods due to their powerful generalization performance and interpreting capabilities, comparing with random sampling techniques, to handle the problem of class imbalance in the preprocessing phase to facilitate learning process and improve performance results of learners. In this research, an effective method called HEOMGA is presented by combining Heterogeneous Euclidean-Overlap Metric (HEOM) and Genetic Algorithm (GA) for oversampling minority class. The HEOM is employed to define a fitness function for the GA. To assess the performance of the proposed HEOMGA method, three benchmark datasets from UCI repository in the domain of customer churn prediction are examined using three different ML learners and evaluated with three performance metrics. The experiment results show the effectiveness of the proposed method compared to some popular oversample methods, such as SMOTE, ADASYN, G SMOTE, and Gaussian oversampling methods. The HEOMGA method significantly outperformed the other oversampling methods in terms of recall, G mean, and AUC when the Wilcoxon signed-rank test is used.en
dc.format.extent12
dc.format.extent525564
dc.language.isoeng
dc.relation.ispartofSN Computer Science
dc.subjectClass imbalance problem
dc.subjectGenetic algorithm
dc.subjectHEOM
dc.subjectOversampling
dc.subjectclassification system
dc.titleAnovel HEOMGA Approach for Class Imbalance Problem in the Application of Customer Churn Predictionen
dc.contributor.institutionCentre for Computer Science and Informatics Research
dc.contributor.institutionSchool of Physics, Engineering & Computer Science
dc.contributor.institutionDepartment of Computer Science
dc.contributor.institutionBiocomputation Research Group
dc.description.statusPeer reviewed
dc.date.embargoedUntil2022-09-17
rioxxterms.versionofrecord10.1007/s42979-021-00850-y
rioxxterms.typeJournal Article/Review
herts.preservation.rarelyaccessedtrue


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record