Acta Informatica Pragensia 2025, 14(1), 112-127 | DOI: 10.18267/j.aip.2554256
Predicting Employee Turnover Using Machine Learning Techniques
- 1 Multidisciplinary Research Laboratory in Economics and Management, Faculty of Economics and Management, Sultan Moulay Slimane University, Beni Mellal, Morocco
- 2 Computer Science Department, Faculty of Sciences, University Sidi Mohamed Ben Abdallah, Fez, Morocco
Background: Employee turnover is a persistent issue in human resource management, leading to significant costs for organizations. This study aims to identify the most effective machine learning model for predicting employee attrition, thereby providing organizations with a reliable tool to anticipate turnover and implement proactive retention strategies.
Objective: This study aims to address the challenge of employee attrition by applying machine learning techniques to provide predictive insights that can improve retention strategies.
Methods: Nine machine learning algorithms are applied to a dataset of 1,470 employee records. After data preprocessing and splitting into training and test sets, the models are evaluated on metrics including accuracy, precision, recall, F1 score and AUC. Model performance is optimized through hyperparameter tuning, using grid search with cross-validation.
Results: Logistic regression achieves the highest accuracy and precision, making it the top-performing model overall. Random forest provides a balanced performance with strong AUC, offering a robust alternative.
Conclusion: Human resources managers and directors should consider using logistic regression or random forest for predictive modelling of employee turnover, as these models have shown strong performance. Future research should employ causal analysis for deeper insights. Real-time monitoring and adaptive prediction could also enhance models, offering a dynamic approach to attrition management.
Keywords: Human resource management; HRM; Machine learning; Employee attrition; Prediction.
Received: October 23, 2024; Revised: November 13, 2024; Accepted: December 7, 2024; Prepublished online: January 12, 2025; Published: January 31, 2025 Show citation
References
- Ahmetoglu, H., & Das, R. (2022). A comprehensive review on detection of cyber-attacks: Data sets, methods, challenges, and future research directions. Internet of Things, 20, 100615. https://doi.org/10.1016/j.iot.2022.100615
Go to original source...
- Arqawi, S., Rumman, M., Zitawi, E., Abunasser, B., & Abu-Naser, S. (2022). Predicting Employee Attrition and Performance Using Deep Learning. Journal of Theoretical and Applied Information Technology, 100, 6526-6536.
- Baby, D., Devaraj, S. J., Hemanth, J., & M, A. R. (2021). Leukocyte classification based on feature selection using extra trees classifier: atransfer learning approach. Turkish Journal of Electrical Engineering and Computer Sciences, 29(SI-1), 2742-2757. https://doi.org/10.3906/elk-2104-183
Go to original source...
- Brants, T. (2006). Part-of-Speech Tagging. In Encyclopedia of Language & Linguistics, (pp. 221-230). Elsevier. https://doi.org/10.1016/B0-08-044854-2/00952-4
Go to original source...
- Chakraborty, R., Mridha, K., Shaw, R. N., & Ghosh, A. (2021). Study and Prediction Analysis of the Employee Turnover using Machine Learning Approaches. In 2021 IEEE 4th International Conference on Computing, Power and Communication Technologies (GUCON). IEEE. https://doi.org/10.1109/GUCON50781.2021.9573759
Go to original source...
- Fallucchi, F., Coladangelo, M., Giuliano, R., & Luca, E. W. (2020). Predicting Employee Attrition Using Machine Learning Techniques. Computers, 9(4), 86. https://doi.org/10.3390/computers9040086
Go to original source...
- Gabrani, G., & Kwatra, A. (2018). Machine Learning Based Predictive Model for Risk Assessment of Employee Attrition. In Computational Science and Its Applications - ICCSA 2018 (pp 189-201). Springer. https://doi.org/10.1007/978-3-319-95171-3_16
Go to original source...
- Gerdes, M. (2013). Decision trees and genetic algorithms for condition monitoring forecasting of aircraft air conditioning. Expert Systems with Applications, 40(12), 5021-5026. https://doi.org/10.1016/j.eswa.2013.03.025
Go to original source...
- Giorgio, A. d., Cola, G., & Wang, L. (2023). Systematic review of class imbalance problems in manufacturing. Journal of Manufacturing Systems, 71, 620-644. https://doi.org/10.1016/j.jmsy.2023.10.014
Go to original source...
- Gupta, B., Rawat, A., Jain, A., Arora, A., & Dhami, N. (2017). Analysis of various decision tree algorithms for classification in data mining. International Journal of Computer Applications, 163(8), 15-19. https://doi.org/10.5120/ijca2017913660
Go to original source...
- Hom, P. W., Lee, T. W., Shaw, J. D., & Hausknecht, J. P. (2017). One hundred years of employee turnover theory and research. Journal of Applied Psychology, 102(3), 530-545. https://doi.org/10.1037/apl0000103
Go to original source...
- Jain, N., Tomar, A., & Jana, P. K. (2021). A novel scheme for employee churn problem using multi-attribute decision making approach and machine learning. Journal of Intelligent Information Systems, 56, 279-302. https://doi.org/10.1007/s10844-020-00614-9
Go to original source...
- Krishna, S., & Sidharth, S. (2022). HR Analytics: Employee Attrition Analysis using Random Forest. International Journal of Performability Engineering, 18(4), 275-281. https://doi.org/10.23940/ijpe.22.04.p5.275281
Go to original source...
- Lazzari, M., Alvarez, J. M., & Ruggieri, S. (2022). Predicting and explaining employee turnover intention. International Journal of Data Science and Analytics, 14(3), 279-292. https://doi.org/10.1007/s41060-022-00329-w
Go to original source...
- Liu, L., Akkineni, S., Story, P., & Davis, C. (2020). Using HR Analytics to Support Managerial Decisions: A Case Study. In Proceedings of the 2020 ACM Southeast Conference, (pp. 168-175). ACM. https://doi.org/10.1145/3374135.3385281
Go to original source...
- Pratt, M., Boudhane, M., & Cakula, S. (2021). Employee attrition estimation using random Forest algorithm. Baltic Journal of Modern Computing, 9(1), 49-66. https://doi.org/10.22364/bjmc.2021.9.1.04
Go to original source...
- Mozaffari, F., Rahimi, M., Yazdani, H., & Sohrabi, B. (2023). Employee attrition prediction in a pharmaceutical company using both machine learning approach and qualitative data. Benchmarking: An International Journal, 30(10), 4140-4173. https://doi.org/10.1108/BIJ-11-2021-0664
Go to original source...
- Najafi-Zangeneh, S., Shams-Gharneh, N., Arjomandi-Nezhad, A., & Zolfani, S. H. (2021). An Improved Machine Learning-Based Employees Attrition. Mathematics, 9(11), 1226. https://doi.org/10.3390/math9111226
Go to original source...
- Nguyen, T. D., Pham, L. D., Crouch, M., & Springer, M. G. (2020). The correlates of teacher turnover: An updated and expanded Meta-analysis of the literature. Educational Research Review, 31, 100355. https://doi.org/10.1016/j.edurev.2020.100355
Go to original source...
- Nijjer, S., & Raj, S. (2020). Predictive Analytics in Human Resource Management. Routledge India. https://doi.org/10.4324/9781003026822
Go to original source...
- Ponnuru, S. R., Merugumala, G. K., Padigala, S., Vanga, R., & Kantapalli, B. (2020). Employee Attrition Prediction using Logistic Regression. International Journal for Research in Applied Science and Engineering Technology, 8(5), 2871-2875. https://doi.org/10.22214/ijraset.2020.5481
Go to original source...
- Qutub, A., Al-Mehmadi, A., Al-Hssan, M., Aljohani, R., & Alghamdi, H. S. (2021). Prediction of employee attrition using machine learning and ensemble methods. International Journal of Machine Learning and Computing, 11(2), 110-114. https://doi.org/10.18178/ijmlc.2021.11.2.1022
Go to original source...
- Raza, A., Munir, K., Almutairi, M., & Younas, F. (2022). Predicting Employee Attrition Using Machine Learning. Applied Sciences, 12(13), 6424. https://doi.org/10.3390/app12136424
Go to original source...
- Reddy, V., Taurani, S. K., Chandrashekhar, A., & Shravya, D. (2023). Big data-based framework for prediction of employee attrition by using deep data people analytics. Journal of Informatics Education and Research, 3(2). https://doi.org/10.52783/jier.v3i2.432
Go to original source...
- Saidi, A., Othman, S. B., Dhouibi, M., & Saoud, S. B. (2021). FPGA-based implementation of classification techniques: A survey. Integration, 81, 280-299. https://doi.org/10.1016/j.vlsi.2021.08.004
Go to original source...
- Sousa-Poza, A., & Henneberger, F. (2004). Analyzing Job Mobility with Job Turnover Intentions: An International Comparative Study. Journal of Economic Issues, 38(1), 113-137. https://doi.org/10.1080/00213624.2004.11506667
Go to original source...
- Usha, P.M., & Balaji, N. V. (2019). Analysing Employee attrition using machine learning. Karpagam JCS, 13, 277-282.
- Vázquez-Diosdado, J. A., Doidge, C., Bushby, E. V., Occhiuto, F., & Kaler, J. (2024). Quantification of play behaviour in calves using automated ultra-wideband location data and its association with age, weaning and health status. Scientific Reports, 14, Article number 8872. https://doi.org/10.1038/s41598-024-59142-z
Go to original source...
- Xia, Y. (2020). Chapter Eleven - Correlation and association analyses in microbiome study integrating multiomics in health and disease. In Progress in Molecular Biology and Translational Science, (pp. 309-491). AP. https://doi.org/10.1016/bs.pmbts.2020.04.003
Go to original source...
- Xiong, L., & Yao, Y. (2021). Study on an adaptive thermal comfort model with K-nearest-neighbors (KNN) algorithm. Building and Environment, 202, 108026. https://doi.org/10.1016/j.buildenv.2021.108026
Go to original source...
- Yahia, N. B., Hlel, J., & Colomo-Palacios, R. (2021). From Big Data to Deep Data to Support People Analytics for Employee Attrition Prediction. IEEE Access, 9, 60447-60458. https://doi.org/10.1109/ACCESS.2021.3074559
Go to original source...
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.