Enhancing Customer Churn Prediction and Retention for E-Commerce
DOI:
https://doi.org/10.15662/IJARCST.2026.0903003Keywords:
Customer Churn Prediction, Machine Learning, Data Preprocessing, Feature Engineering, Predictive Analytics, Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, Model Evaluation, Accuracy, Precision, Recall, F1-Score, Customer Retention, Personalized Marketing, E-commerce Analytics, Data Visualization, Business Intelligence, Interactive DashboardAbstract
Customer churn is a major challenge in e-commerce, as losing customers directly impacts revenue and business growth. This project presents an intelligent customer churn prediction and retention system that utilizes machine learning techniques to identify customers who are likely to discontinue their engagement. The system is designed as a complete data pipeline, incorporating stages such as data collection, preprocessing, feature engineering, model training, and prediction. Multiple machine learning algorithms, including Logistic Regression, Decision Tree, Random Forest, and Gradient Boosting, are implemented to analyze customer behavior and improve prediction accuracy. The system evaluates and compares model performance using metrics such as accuracy, precision, recall, and F1-score to ensure optimal results. Based on the prediction outcomes, a retention strategy module is integrated to recommend business actions such as personalized offers, loyalty rewards, and targeted marketing campaigns to retain customers. Furthermore, the system includes visualization and reporting features through an interactive dashboard, enabling users to easily interpret insights and make data-driven decisions. This project demonstrates the effective application of machine learning in real-world e-commerce scenarios to enhance customer retention, optimize business strategies, and improve overall operational efficiency.
References
1. T. Zhang, S. Moro, and R. F. Ramos, “A data-driven approach to improve customer churn prediction based on telecom customer segmentation,” Future Internet, vol. 14, no. 3, pp. 1–18, 2022.
2. K. Matuszelański and K. Kopczewska, “Customer churn in retail e-commerce business: Spatial and machine learning approach,” Journal of Theoretical and Applied Electronic Commerce Research, vol. 17, no. 1, pp. 1–20, 2022.
3. M. Bogaert and L. Delaere, “Ensemble methods in customer churn prediction: A comparative analysis of the state-of-the-art,” Mathematics, vol. 11, no. 5, pp. 1–25, 2023.
4. J. Li, “Identification of customer churn considering difficult case mining systems,” Systems, vol. 11, no. 7, pp. 325–338, 2023.
5. V. Chang, “Prediction of customer churn behavior in the telecommunication industry using machine learning models and algorithms,” Algorithms, vol. 17, no. 6, pp. 231–249, 2024.
6. N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: Synthetic minority over-sampling technique,” Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, 2002.
7. L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.
8. J. H. Friedman, “Greedy function approximation: A gradient boosting machine,” Annals of Statistics, vol. 29, no. 5, pp. 1189–1232, 2001.
9. Sarma, N. R., Nisarga, S., Prashanth, R., Iram, R., & Mani, S. A. (2025). Food Recoginition and Calorie Estimation using Machine Learning. IJSAT-International Journal on Science and Technology, 16(4).
10. Seedha Devi, V., Selvi, D., Uma Maheshwari, K., & Yuvashree, G. (2026). Food linker: A smart system for global waste reduction. International Journal of Engineering & Extended Technologies Research (IJEETR), 8(3), 5012–5021. https://doi.org/10.15662/IJEETR.2026.0803002
11. Kumar, S. A., & Anand, L. (2025). A Novel EEG-Based Deep Learning Framework for Enhancing Communication in Locked-In Syndrome Using P300 Speller and Attention Mechanisms. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 19(11), 3841-3855.
12. Seedha Devi, V., Harshini, R., Dhana Lakshmi, E., Gayathri, N., & Nithesha, P. (2026). Low-code mobile application builder with AI-assisted features using Flutter & Firebase. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 9(3), 1001–1010. https://doi.org/10.15662/IJRPETM.2026.0903001
13. Murugeshwari, B., Sarukesi, K., & Jayakumar, C. (2010, March). An efficient method for knowledge hiding through database extension. In 2010 International Conference on Recent Trends in Information, Telecommunication and Computing (pp. 342-344). IEEE.
14. Sharma, K. P., Kumar, I., Singh, P. P., Anbazhagan, K., Albarakati, H. M., Bhatt, M. W., ... & Rana, A. (2024). Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles. Computers in Human Behavior, 153, 108110.
15. Sugumar, R. (2025). Federated AI in Offline-First Mobile Health Architectures for Privacy-Preserving Clinical Intelligence. International Journal of Science, Research and Technology, 8(4), 14589-14600.
16. Rajasekar, M. (2025). Risk-Aware Generative AI and Machine Learning Frameworks for Privacy-Preserving Banking and Trade Analytics over Cloud and 5G Networks. International Journal of Computer Technology and Electronics Communication, 8(4), 11078-11086.
17. Mathew, A. Trust Is Not a Default Control: AI-Powered Social Engineering and the Need to Have New Governance.
18. Murugeshwari, B., Jothi, D., Hemalatha, B., & Pari, S. N. (2023). Trust Aware Privacy Preserving Routing Protocol for Wireless Adhoc Network. arXiv preprint arXiv:2304.14653.
19. Rajasekar, M., Nahar, G., Jagatheeswaran, S., Chinthamani, S. A. M., Mohammed, S. H., & Al-Hilali, A. (2024, May). The Roadmap to Classify Malware Using ML Algo Through IOT Based SN. In 2024 4th International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) (pp. 127-130). IEEE.
20. Sugumar, R. (2025). Separating Technology and Trust: A Survey Analysis of Patients’ Attitudes toward AI-Assisted Healthcare Decision-Making. International Journal of Humanities and Information Technology, 7(01), 72-79.
21. Kumar, K. A., Vimal, V. R., & Karthick, V. (2024, June). Error Rate Increment of Quantum Computer using Quantity Computational Algorithm Comparing with Deutsch Jozsa Algorithm. In 2024 OPJU International Technology Conference (OTCON) on Smart Computing for Innovation and Advancement in Industry 4.0 (pp. 1-5). IEEE.
22. Vimal, V. R., Jayalakshmi, D., Narayanan, L. K., Hemavathi, R., & Loganayagi, S. (2024, November). 5G-Enabled Remote Healthcare Monitoring for Improved Patient Care. In 2024 International Conference on Recent Advances in Science and Engineering Technology (ICRASET) (pp. 1-5). IEEE.
23. Seedha Devi, V., Kumar, M. D., & Kumar, C. A. (2026). Flutter-based SOS alert and location tracking application with volunteer assist and rescue. International Journal of Research and Applied Innovations (IJRAI), 9(3), 521–530. https://doi.org/10.15662/IJRAI.2026.0903003
24. Rekha, V. B., Manchattahalli, U. R., Nithya, S., Devi, V. S., Anand, A. J., & Jacob, M. S. (2025, September). Data-Driven Predictive Modeling of Brain Responses under Morphological Priming Using Machine Learning. In 2025 International Conference on Electrical and Electronics Engineering (ICE3) (pp. 1-6). IEEE.
25. Saravanan, M., & Sivakumaran, T. S. (2016). Three phase dual input direct matrix converter for integration of two AC sources from wind turbines. Circuits Syst., 7, 3807-3817.
26. Alangaram, S., Praveen, S., Rajesh, V., & Sanjai, A. (2026). Sales guard AI-driven decision intelligence platform for business optimization. International Journal of Engineering & Extended Technologies Research (IJEETR), 8(3), 5022–5031. https://doi.org/10.15662/IJEETR.2026.0803003
27. Seedha Devi, V., Nivedha, S., Harisha, V., Mol, D. R., & Janaranjini, J. R. (2026). Enhanced prediction of PCOS and PCOD using deep learning for early diagnosis and clinical risk stratification. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 9(3), 783–793.
28. Anand, L., Tyagi, R., & Mehta, V. (2024, January). Food recognition using deep learning for recipe and restaurant recommendation. In Proceedings of Eighth International Conference on Information System Design and Intelligent Applications (pp. 269-279). Singapore: Springer Nature Singapore.
29. Anbazhagan, K., Kumar, R., Thilagavathy, R., & Anuradha, D. (2024, March). Shortest Job First with Gateway-based Resource Management Strategy for Fog Enabled Cloud Computing. In 2024 4th International Conference on Data Engineering and Communication Systems (ICDECS) (pp. 1-6). IEEE.
30. Seedha Devi, V., Namitha, B., Divya Dharshini, J., & Livetha, K. (2026). A hybrid biometric and geo-fencing based smart attendance system. International Journal of Advanced Research in Computer Science and Technology (IJARCST), 9(3), 794–802. https://doi.org/10.15662/IJARCST.2026.0903002
31. MATHEW, A. (2025). BEYOND THE BURNER: THE SYSTEMIC RISKS OF DISPOSABLE EMAIL ECOSYSTEMS.
32. Seedha Devi, V., Harshini, R., Dhana Lakshmi, E., Gayathri, N., & Nithesha, P. (2026). Low-code mobile application builder with AI-assisted features using Flutter & Firebase. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 9(3), 1001–1010.
33. F. Pedregosa et al., “Scikit-learn: Machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
34. T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” in Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, 2016, pp. 785–794.
35. D. Dua and C. Graff, “UCI machine learning repository,” University of California, Irvine, 2019.
36. I. Guyon and A. Elisseeff, “An introduction to variable and feature selection,” Journal of Machine Learning Research, vol. 3, pp. 1157–1182, 2003.
37. T. Fawcett, “An introduction to ROC analysis,” Pattern Recognition Letters, vol. 27, no. 8, pp. 861–874, 2006.
38. J. Davis and M. Goadrich, “The relationship between precision-recall and ROC curves,” in Proc. 23rd Int. Conf. Machine Learning, 2006, pp. 233–240.
39. C. Cortes and V. Vapnik, “Support-vector networks,” Machine Learning, vol. 20, pp. 273–297, 1995.
40. T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning. New York, NY, USA: Springer, 2009.


