Skip to main content
Log in

Predicting customer churn using machine learning: A case study in the software industry

  • Original Article
  • Published:
Journal of Marketing Analytics Aims and scope Submit manuscript

Abstract

Customer churn can be defined as the phenomenon of customers who discontinue their relationship with a company. This problem is transversal to many industries, including the software industry. This study uses Machine Learning to build a predictive model to identify potential churners in a Portuguese software house. Six popular Machine Learning models: Random Forest, AdaBoost, Gradient Boosting Machine, Multilayer Perceptron Classifier, XGBoost, and Logistic Regression, were developed to assess which one would have a better performance. The experimental results show that boosting techniques such as XGBoost present the best predictive performance. The XGBoost model presents a Recall of 0.85 and a ROC AUC of 0.86. Additionally to the model performance, the study of the model's feature importance revealed that some factors, such as the time to solve a support ticket, the type of application, the license age, and the number of incidents, significantly influence customer churn. These insights can help the software industry key drivers of churn and prioritize retention efforts accordingly.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

Download references

Funding

This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia) under the project - UIDB/04152/2020 - Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nuno Antonio.

Ethics declarations

Conflict of interest

The authors declare that they have a relationship with the studied software house, but this relationship did not influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dias, J.R., Antonio, N. Predicting customer churn using machine learning: A case study in the software industry. J Market Anal (2023). https://doi.org/10.1057/s41270-023-00269-9

Download citation

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1057/s41270-023-00269-9

Keywords

Navigation