Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction
Abstract
In recent times, customer churn has become one of the most significant issues in business-oriented sectors with telecommunication being no exception. Maintaining current customers is particularly valuable due to the high degree of rivalry among telecommunication companies and the costs of acquiring new ones. The early prediction of churned customers may help telecommunication companies to identify the causes of churn and design industrial tactics to address or mitigate the churn problem. Controlling customer churn by developing efficient and reliable customer churn prediction (CCP) solutions is essential to achieving this objective. Findings from existing CCP studies have shown that numerous methods, such as rule-based and machine-learning (ML) mechanisms, have been devised to solve the CCP problem. Nonetheless, the problems of adaptability and the resilience of rule-based CCP solutions are its major weaknesses, and the skewed pattern of churn datasets (class imbalance) is detrimental to the prediction performances of conventional ML models in CCP. Hence, this research developed a robust heterogeneous multi-layer stacking ensemble method (HMSE) for effective CCP. Specifically, in the HMSE method, the prediction prowess of five ML classifiers (Random Forest (RF), Bayesian network (BN), Support Vector Machine (SVM), K-Nearest Neighbour (KNN), and Repeated Incremental Pruning to Produce Error Reduction (RIPPER)) with distinct computational characteristics are ensembled based on stacking and the resulting model is further enhanced using a forest penalizing attribute (FPA) model. The synthetic minority oversampling technique (SMOTE) is integrated with the proposed HMSE to balance the skewed class label present in the original experimental datasets. Extensive tests were carried out to determine the effectiveness of the proposed HMSE and S-HMSE on standard telecom CCP datasets. Observed findings from the experimental results showed that HMSE and S-HMSE can effectively predict churners even with the class imbalance (skewed datasets) problem. In addition, comparison studies demonstrated that the suggested S-HMSE offered improved prediction performance and optimum solutions for CCP in the telecom sector in comparison with baseline classifiers, homogeneous ensemble methods, and current CCP approaches.
Citations
-
0
CrossRef
-
0
Web of Science
-
0
Scopus
Authors (8)
Cite as
Full text
- Publication version
- Accepted or Published Version
- DOI:
- Digital Object Identifier (open in new tab) 10.1016/j.sciaf.2024.e02223
- License
- open in new tab
Keywords
Details
- Category:
- Articles
- Type:
- artykuły w czasopismach
- Published in:
-
Scientific African
no. 24,
ISSN: - Language:
- English
- Publication year:
- 2024
- Bibliographic description:
- Usman-Hamza F. E., Balogun A. O., Amosa R. T., Capretz L. F., Mojeed H., Salihu S. A., Akintola A. G., Mabayoje M. A.: Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction// Scientific African -,iss. Volume 24 (2024), s.e02223-
- DOI:
- Digital Object Identifier (open in new tab) 10.1016/j.sciaf.2024.e02223
- Sources of funding:
-
- Universiti Teknologi PETRONAS, under the STIRF Research Grant Scheme (015LA0-049)
- Verified by:
- Gdańsk University of Technology
seen 43 times
Recommended for you
Intelligent Decision Forest Models for Customer Churn Prediction
- F. E. Usman-Hamzah,
- A. O. Balogun,
- L. F. Capretz
- + 7 authors
Empirical analysis of tree-based classification models for customer churn prediction
- F. E. Usman-Hamza,
- A. O. Balogun,
- S. K. Nasiru
- + 6 authors
Performance Analysis of Machine Learning Methods with Class Imbalance Problem in Android Malware Detection
- A. G. Akintola,
- A. O. Balogun,
- H. Mojeed
- + 5 authors
Study of Multi-Class Classification Algorithms’ Performance on Highly Imbalanced Network Intrusion Datasets
- V. Bulavas,
- V. Marcinkevičius,
- J. Rumiński