Abstract
Customer churn is a vital and reoccurring problem facing most business industries, particularly the telecommunications industry. Considering the fierce competition among telecommunications firms and the high expenses of attracting and gaining new subscribers, keeping existing loyal subscribers becomes crucial. Early prediction of disgruntled subscribers can assist telecommunications firms in identifying the reasons for churn and in deploying applicable innovative policies to boost productivity, maintain market competitiveness, and reduce monetary damages. Controlling customer churn through the development of efficient and dependable customer churn prediction (CCP) solutions is imperative to attaining this goal. According to the outcomes of current CCP research, several strategies, including rule-based and machine-learning (ML) processes, have been proposed to handle the CCP phenomenon. However, the lack of flexibility and robustness of rule based CCP solutions is a fundamental shortcoming, and the lopsided distribution of churn datasets is deleterious to the efficacy of most traditional ML techniques in CCP. Regardless, ML-based CCP solutions have been reported to be more effective than other forms of CCP solutions. Unlike linear-based, instance-based, and function-based ML classifiers, tree-based ML classifiers are known to generate predictive models with high accuracy, high stability, and ease of interpretation. However, the deployment of tree-based classifiers for CCP is limited in most cases to the decision tree (DT) and random forest (RF). Hence, this research investigated the effectiveness of tree-based classifiers with diverse computational properties in CCP. Specifically, the CCP performances of diverse tree-based classifiers such as the single, ensemble, enhanced, and hybrid tree-based classifiers are investigated. Also, the effects of data quality problems such as the class imbalance problem (CIP) on the predictive performances of tree-based classifiers and their homogeneous ensemble variants on CCP were assessed. From the experimental results, it was observed that the investigated tree-based classifiers outperformed other forms of classifiers such as linear-based (Support Vector Machine (SVM)), instance-based (K-Nearest Neighbour (KNN)), Bayesian-based (Naïve Bayes (NB)) and function-based (MultiLayer Perceptron (MLP)) classifiers in most cases with or without the CIP.
Citations
-
2
CrossRef
-
0
Web of Science
-
3
Scopus
Authors (9)
Cite as
Full text
- Publication version
- Accepted or Published Version
- DOI:
- Digital Object Identifier (open in new tab) 10.1016/j.sciaf.2023.e02054
- License
- open in new tab
Keywords
Details
- Category:
- Articles
- Type:
- artykuły w czasopismach
- Published in:
-
Scientific African
no. 23,
ISSN: - Language:
- English
- Publication year:
- 2023
- Bibliographic description:
- Usman-Hamza F. E., Balogun A. O., Nasiru S. K., Capretz L. F., Mojeed H., Salihu S. A., Akintola A. G., Mabayoje M. A., Awotunde J. B.: Empirical analysis of tree-based classification models for customer churn prediction// Scientific African -,iss. 23 (2023), s.e02054-
- DOI:
- Digital Object Identifier (open in new tab) 10.1016/j.sciaf.2023.e02054
- Sources of funding:
-
- Free publication
- Verified by:
- Gdańsk University of Technology
seen 55 times
Recommended for you
Sampling-based novel heterogeneous multi-layer stacking ensemble method for telecom customer churn prediction
- F. E. Usman-Hamza,
- A. O. Balogun,
- R. T. Amosa
- + 5 authors
Intelligent Decision Forest Models for Customer Churn Prediction
- F. E. Usman-Hamzah,
- A. O. Balogun,
- L. F. Capretz
- + 7 authors
Performance Analysis of Machine Learning Methods with Class Imbalance Problem in Android Malware Detection
- A. G. Akintola,
- A. O. Balogun,
- H. Mojeed
- + 5 authors
Study of Multi-Class Classification Algorithms’ Performance on Highly Imbalanced Network Intrusion Datasets
- V. Bulavas,
- V. Marcinkevičius,
- J. Rumiński