562 research outputs found
One-Class Classification: Taxonomy of Study and Review of Techniques
One-class classification (OCC) algorithms aim to build classification models
when the negative class is either absent, poorly sampled or not well defined.
This unique situation constrains the learning of efficient classifiers by
defining class boundary just with the knowledge of positive class. The OCC
problem has been considered and applied under many research themes, such as
outlier/novelty detection and concept learning. In this paper we present a
unified view of the general problem of OCC by presenting a taxonomy of study
for OCC problems, which is based on the availability of training data,
algorithms used and the application domains applied. We further delve into each
of the categories of the proposed taxonomy and present a comprehensive
literature review of the OCC algorithms, techniques and methodologies with a
focus on their significance, limitations and applications. We conclude our
paper by discussing some open research problems in the field of OCC and present
our vision for future research.Comment: 24 pages + 11 pages of references, 8 figure
A comparative study of tree-based models for churn prediction : a case study in the telecommunication sector
Dissertation presented as the partial requirement for obtaining a Master's degree in Statistics and Information Management, specialization in Marketing Research e CRMIn the recent years the topic of customer churn gains an increasing importance, which is the phenomena of the customers abandoning the company to another in the future. Customer churn plays an important role especially in the more saturated industries like telecommunication industry. Since the existing customers are very valuable and the acquisition cost of new customers is very high nowadays. The companies want to know which of their customers and when are they going to churn to another provider, so that measures can be taken to retain the customers who are at risk of churning. Such measures could be in the form of incentives to the churners, but the downside is the wrong classification of a churners will cost the company a lot, especially when incentives are given to some non-churner customers. The common challenge to predict customer churn will be how to pre-process the data and which algorithm to choose, especially when the dataset is heterogeneous which is very common for telecommunication companies’ datasets. The presented thesis aims at predicting customer churn for telecommunication sector using different decision tree algorithms and its ensemble models
Twitter Analysis to Predict the Satisfaction of Saudi Telecommunication Companies’ Customers
The flexibility in mobile communications allows customers to quickly switch from one service provider to
another, making customer churn one of the most critical challenges for the data and voice telecommunication
service industry. In 2019, the percentage of post-paid telecommunication customers in Saudi Arabia
decreased; this represents a great deal of customer dissatisfaction and subsequent corporate fiscal losses.
Many studies correlate customer satisfaction with customer churn. The Telecom companies have depended
on historical customer data to measure customer churn. However, historical data does not reveal current
customer satisfaction or future likeliness to switch between telecom companies. Current methods of analysing
churn rates are inadequate and faced some issues, particularly in the Saudi market.
This research was conducted to realize the relationship between customer satisfaction and customer churn
and how to use social media mining to measure customer satisfaction and predict customer churn.
This research conducted a systematic review to address the churn prediction models problems and their
relation to Arabic Sentiment Analysis. The findings show that the current churn models lack integrating
structural data frameworks with real-time analytics to target customers in real-time. In addition, the findings
show that the specific issues in the existing churn prediction models in Saudi Arabia relate to the Arabic
language itself, its complexity, and lack of resources.
As a result, I have constructed the first gold standard corpus of Saudi tweets related to telecom companies,
comprising 20,000 manually annotated tweets. It has been generated as a dialect sentiment lexicon extracted
from a larger Twitter dataset collected by me to capture text characteristics in social media. I developed a
new ASA prediction model for telecommunication that fills the detected gaps in the ASA literature and fits
the telecommunication field. The proposed model proved its effectiveness for Arabic sentiment analysis and
churn prediction. This is the first work using Twitter mining to predict potential customer loss (churn) in
Saudi telecom companies, which has not been attempted before. Different fields, such as education, have
different features, making applying the proposed model is interesting because it based on text-mining
A novel customer churn prediction model for the telecommunication industry using data transformation methods and feature selection
Data transformation (DT) is a process that transfers the original data into a
form which supports a particular classification algorithm and helps to analyze
the data for a special purpose. To improve the prediction performance we
investigated various data transform methods. This study is conducted in a
customer churn prediction (CCP) context in the telecommunication industry
(TCI), where customer attrition is a common phenomenon. We have proposed a
novel approach of combining data transformation methods with the machine
learning models for the CCP problem. We conducted our experiments on publicly
available TCI datasets and assessed the performance in terms of the widely used
evaluation measures (e.g. AUC, precision, recall, and F-measure). In this
study, we presented comprehensive comparisons to affirm the effect of the
transformation methods. The comparison results and statistical test proved that
most of the proposed data transformation based optimized models improve the
performance of CCP significantly. Overall, an efficient and optimized CCP model
for the telecommunication industry has been presented through this manuscript.Comment: 24 page
A SLR on Customer Dropout Prediction
Dropout prediction is a problem that is being addressed with machine learning algorithms;
thus, appropriate approaches to address the dropout rate are needed. The selection of an algorithm to predict
the dropout rate is only one problem to be addressed. Other aspects should also be considered, such as
which features should be selected and how to measure accuracy while considering whether the features are
appropriate according to the business context in which they are employed. To solve these questions, the
goal of this paper is to develop a systematic literature review to evaluate the development of existing studies
and to predict the dropout rate in contractual settings using machine learning to identify current trends and
research opportunities. The results of this study identify trends in the use of machine learning algorithms
in different business areas and in the adoption of machine learning algorithms, including which metrics are
being adopted and what features are being applied. Finally, some research opportunities and gaps that could
be explored in future research are presented.info:eu-repo/semantics/publishedVersio
- …