Comparative Analysis of Resampling and Feature Selection Methods for Employee Turnover Prediction


Yagmur G., Sarikaya B., Najaflou N.

31st IEEE Conference on Signal Processing and Communications Applications (SIU), İstanbul, Türkiye, 5 - 08 Temmuz 2023, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/siu59756.2023.10224012
  • Basıldığı Şehir: İstanbul
  • Basıldığı Ülke: Türkiye
  • İstanbul Üniversitesi Adresli: Hayır

Özet

Employees are the most valuable assets of each company. Unexpected employee turnover imposes something between %30 and %150 of the employee's annual salary to the company. In this study, different data balancing methods were applied to regulate the imbalances in the data set and to handle imbalanced data problem. In addition, to reduce the number of features in the data set, RFE and Boruta feature selection techniques were applied to compare their performance. We applied prediction algorithms from 3 different categories including classic machine learning, ensemble methods and deep learning. Overall, oversampling method has been shown to perform better than undersampling. Among the algorithms, XGBOOST achieved the highest performance with %90.90 F1 Score.