528 research outputs found
Review of Data Mining Techniques for Churn Prediction in Telecom
Telecommunication sector generates a huge amount of data due to increasing number of subscribers, rapidly renewable technologies; data based applications and other value added service. This data can be usefully mined for churn analysis and prediction. Significant research had been undertaken by researchers worldwide to understand the data mining practices that can be used for predicting customer churn. This paper provides a review of around 100 recent journal articles starting from year 2000 to present the various data mining techniques used in multiple customer based churn models. It then summarizes the existing telecom literature by highlighting the sample size used, churn variables employed and the findings of different DM techniques. Finally, we list the most popular techniques for churn prediction in telecom as decision trees, regression analysis and clustering, thereby providing a roadmap to new researchers to build upon novel churn management models
Review of Data Mining Techniques for Churn Prediction in Telecom
Telecommunication sector generates a huge amount of data due to increasing number of subscribers, rapidly renewable technologies; data based applications and other value added service. This data can be usefully mined for churn analysis and prediction. Significant research had been undertaken by researchers worldwide to understand the data mining practices that can be used for predicting customer churn. This paper provides a review of around 100 recent journal articles starting from year 2000 to present the various data mining techniques used in multiple customer based churn models. It then summarizes the existing telecom literature by highlighting the sample size used, churn variables employed and the findings of different DM techniques. Finally, we list the most popular techniques for churn prediction in telecom as decision trees, regression analysis and clustering, thereby providing a roadmap to new researchers to build upon novel churn management models
A SLR on Customer Dropout Prediction
Dropout prediction is a problem that is being addressed with machine learning algorithms;
thus, appropriate approaches to address the dropout rate are needed. The selection of an algorithm to predict
the dropout rate is only one problem to be addressed. Other aspects should also be considered, such as
which features should be selected and how to measure accuracy while considering whether the features are
appropriate according to the business context in which they are employed. To solve these questions, the
goal of this paper is to develop a systematic literature review to evaluate the development of existing studies
and to predict the dropout rate in contractual settings using machine learning to identify current trends and
research opportunities. The results of this study identify trends in the use of machine learning algorithms
in different business areas and in the adoption of machine learning algorithms, including which metrics are
being adopted and what features are being applied. Finally, some research opportunities and gaps that could
be explored in future research are presented.info:eu-repo/semantics/publishedVersio
A novel customer churn prediction model for the telecommunication industry using data transformation methods and feature selection
Data transformation (DT) is a process that transfers the original data into a
form which supports a particular classification algorithm and helps to analyze
the data for a special purpose. To improve the prediction performance we
investigated various data transform methods. This study is conducted in a
customer churn prediction (CCP) context in the telecommunication industry
(TCI), where customer attrition is a common phenomenon. We have proposed a
novel approach of combining data transformation methods with the machine
learning models for the CCP problem. We conducted our experiments on publicly
available TCI datasets and assessed the performance in terms of the widely used
evaluation measures (e.g. AUC, precision, recall, and F-measure). In this
study, we presented comprehensive comparisons to affirm the effect of the
transformation methods. The comparison results and statistical test proved that
most of the proposed data transformation based optimized models improve the
performance of CCP significantly. Overall, an efficient and optimized CCP model
for the telecommunication industry has been presented through this manuscript.Comment: 24 page
Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods
© 2018 Elsevier Ltd Cross-Company Churn Prediction (CCCP) is a domain of research where one company (target) is lacking enough data and can use data from another company (source) to predict customer churn successfully. To support CCCP, the cross-company data is usually transformed to a set of similar normal distribution of target company data prior to building a CCCP model. However, it is still unclear which data transformation method is most effective in CCCP. Also, the impact of data transformation methods on CCCP model performance using different classifiers have not been comprehensively explored in the telecommunication sector. In this study, we devised a model for CCCP using data transformation methods (i.e., log, z-score, rank and box-cox) and presented not only an extensive comparison to validate the impact of these transformation methods in CCCP, but also evaluated the performance of underlying baseline classifiers (i.e., Naive Bayes (NB), K-Nearest Neighbour (KNN), Gradient Boosted Tree (GBT), Single Rule Induction (SRI) and Deep learner Neural net (DP)) for customer churn prediction in telecommunication sector using the above mentioned data transformation methods. We performed experiments on publicly available datasets related to the telecommunication sector. The results demonstrated that most of the data transformation methods (e.g., log, rank, and box-cox) improve the performance of CCCP significantly. However, the Z-Score data transformation method could not achieve better results as compared to the rest of the data transformation methods in this study. Moreover, it is also investigated that the CCCP model based on NB outperform on transformed data and DP, KNN and GBT performed on the average, while SRI classifier did not show significant results in term of the commonly used evaluation measures (i.e., probability of detection, probability of false alarm, area under the curve and g-mean)
- …