Resampling and Ensemble Strategies for Churn Prediction


Creative Commons License

ÇELIK S., Tolun Tayalı S.

Bilişim Teknolojileri Dergisi, cilt.16, sa.4, ss.263-273, 2023 (Hakemli Dergi) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 16 Sayı: 4
  • Basım Tarihi: 2023
  • Doi Numarası: 10.17671/gazibtd.1314870
  • Dergi Adı: Bilişim Teknolojileri Dergisi
  • Derginin Tarandığı İndeksler: Applied Science & Technology Source, Computer & Applied Sciences, TR DİZİN (ULAKBİM)
  • Sayfa Sayıları: ss.263-273
  • İstanbul Üniversitesi Adresli: Evet

Özet

Churn analysis is a customer relationship management analytics that companies implement to predict the customers who are likely to terminate doing business with them. The success of marketing efforts to retain the existing customers is possible only if probable churners are correctly specified beforehand. Therefore, having powerful models with high prediction capabilities that lead to a profit growth is crucial. The imbalanced nature of churn datasets negatively effects the classification performance of machine learning methods. This study examines resampling –over- and under-sampling- and ensemble learning –bagging, boosting, and stacking– strategies integrated with the cross-validation procedure on imbalanced churn prediction. The experimental results, which are compared to the results of Support Vector Machines taken as the benchmark, show that ensemble methods improve the prediction performances. Also, applying over-sampling achieves a noticeable performance in comparison with the under-sampling approach.