CFP last date
20 February 2025
Reseach Article

Customer Churn Prediction using Machine Learning: A Case Study of E-commerce Data

by Jingyuan Li
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 48
Year of Publication: 2024
Authors: Jingyuan Li
10.5120/ijca2024924140

Jingyuan Li . Customer Churn Prediction using Machine Learning: A Case Study of E-commerce Data. International Journal of Computer Applications. 186, 48 ( Nov 2024), 22-25. DOI=10.5120/ijca2024924140

@article{ 10.5120/ijca2024924140,
author = { Jingyuan Li },
title = { Customer Churn Prediction using Machine Learning: A Case Study of E-commerce Data },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2024 },
volume = { 186 },
number = { 48 },
month = { Nov },
year = { 2024 },
issn = { 0975-8887 },
pages = { 22-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number48/customer-churn-prediction-using-machine-learning-a-case-study-of-e-commerce-data/ },
doi = { 10.5120/ijca2024924140 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-11-27T00:39:23.306950+05:30
%A Jingyuan Li
%T Customer Churn Prediction using Machine Learning: A Case Study of E-commerce Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 48
%P 22-25
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In the highly competitive e-commerce industry, customer churn represents a major challenge to profitability and sustainability. This study aims to develop a robust predictive model for customer churn using a publicly available e-commerce dataset. The research leverages various machine learning algorithms, including Logistic Regression, Random Forest, XGBoost, and LightGBM, to compare performance. We address class imbalance with SMOTE and utilize SHAP and LIME for model interpretability. Our results demonstrate the effectiveness of the Random Forest model, achieving a ROC AUC of 0.9850. This study provides valuable insights into the factors driving customer churn, offering actionable recommendations for businesses to reduce churn rates and enhance customer retention strategies.

References
  1. Xia, G., & He, Q. (2018). The Research of Online Shopping Customer Churn Prediction Based on Integrated Learning. , 259-267. https://doi.org/10.2991/MECAE-18.2018.133.
  2. Gordini, N., & Veglio, V. (2017). Customers churn prediction and marketing retention strategies. An application of support vector machines based on the AUC parameter-selection technique in B2B e-commerce industry. Industrial Marketing Management, 62, 100-107. https://doi.org/10.1016/J.INDMARMAN.2016.08.003.
  3. Zimal, S., Shah, C., Borhude, S., Birajdar, A., & Patil, P. (2023). Customer Churn Prediction Using Machine Learning. International Journal for Research in Applied Science and Engineering Technology. https://doi.org/10.22214/ijraset.2023.49142.
  4. Peddarapu, R., Ameena, S., Yashaswini, S., Shreshta, N., & PurnaSahithi, M. (2022). Customer Churn Prediction using Machine Learning. 2022 6th International Conference on Electronics, Communication and Aerospace Technology, 1035-1040. https://doi.org/10.1109/ICECA55336.2022.10009093.
  5. Pondel, M., Wuczynski, M., Gryncewicz, W., Lysik, L., Hernes, M., Rot, A., & Kozina, A. (2021). Deep Learning for Customer Churn Prediction in E-Commerce Decision Support. , 3-12. https://doi.org/10.52825/bis.v1i.42.
  6. Zhang, X.; Guo, F.; Chen, T.; Pan, L.; Beliakov, G.; Wu, J. A Brief Survey of Machine Learning and Deep Learning Techniques for E-Commerce Research. J. Theor. Appl. Electron. Commer. Res. 2023, 18, 2188-2216. https://doi.org/10.3390/jtaer18040110
  7. Kaggle. (2021). E-commerce customer churn analysis and prediction dataset. Retrieved on September 20, 2024, from https://www.kaggle.com/datasets/ankitverma2010/ecommerce-customer-churn-analysis-and-prediction/data
Index Terms

Computer Science
Information Sciences
Machine Learning
E-commerce
Churn Prediction

Keywords

Customer Churn Random Forest XGBoost E-commerce Data