96 research outputs found
Developing strategies to retain organizational insurers using a clustering technique: evidence from the insurance industry
Formulating strategies to maintain policyholders is one of the main challenges for most insurance companies in Iran. The purpose of this article is to help marketing strategists of insurance companies predict insurees' churn and develop insurees retention strategies. Since the cost of maintaining an insurance policyholder is approximately one-eighth of the cost of attracting new ones, predicting their churn can help insurance companies adopt proper strategies in advance, which will definitely lead to saving marketing costs and maintaining the insurer's portfolio. Accordingly, the main question of this research is how to classify organizational insurees with the help of the clustering technique. This research is conducted in both qualitative and quantitative phases. In the qualitative phase, by conducting a semi-structured interview (interview protocol) with 15 experts in the insurance industry, the influential factors on policyholders' churn are identified. Then, based on the factors identified in the research literature and comparing them with the interview results, eight main factors are finalized. In the quantitative phase, in order to cluster the organizational insurees, 120 samples from the Iran Insurance Company are selected, and k-means is applied for clustering. Organizational insurees are divided into two groups according to the desired indicators. Using the results of clustering, insurees are divided into four groups, and effective marketing strategies are developed for each group. According to the results, the variable “health care insurance price” has the most effective role in separating the clusters at an error level of <0.01, and on the contrary, the variable “liability insurance amount” has the least important role at an error level of <0.978
Twitter Analysis to Predict the Satisfaction of Saudi Telecommunication Companies’ Customers
The flexibility in mobile communications allows customers to quickly switch from one service provider to
another, making customer churn one of the most critical challenges for the data and voice telecommunication
service industry. In 2019, the percentage of post-paid telecommunication customers in Saudi Arabia
decreased; this represents a great deal of customer dissatisfaction and subsequent corporate fiscal losses.
Many studies correlate customer satisfaction with customer churn. The Telecom companies have depended
on historical customer data to measure customer churn. However, historical data does not reveal current
customer satisfaction or future likeliness to switch between telecom companies. Current methods of analysing
churn rates are inadequate and faced some issues, particularly in the Saudi market.
This research was conducted to realize the relationship between customer satisfaction and customer churn
and how to use social media mining to measure customer satisfaction and predict customer churn.
This research conducted a systematic review to address the churn prediction models problems and their
relation to Arabic Sentiment Analysis. The findings show that the current churn models lack integrating
structural data frameworks with real-time analytics to target customers in real-time. In addition, the findings
show that the specific issues in the existing churn prediction models in Saudi Arabia relate to the Arabic
language itself, its complexity, and lack of resources.
As a result, I have constructed the first gold standard corpus of Saudi tweets related to telecom companies,
comprising 20,000 manually annotated tweets. It has been generated as a dialect sentiment lexicon extracted
from a larger Twitter dataset collected by me to capture text characteristics in social media. I developed a
new ASA prediction model for telecommunication that fills the detected gaps in the ASA literature and fits
the telecommunication field. The proposed model proved its effectiveness for Arabic sentiment analysis and
churn prediction. This is the first work using Twitter mining to predict potential customer loss (churn) in
Saudi telecom companies, which has not been attempted before. Different fields, such as education, have
different features, making applying the proposed model is interesting because it based on text-mining
Churn prediction models tested and evaluated in the Dutch indemnity industry
Due to global developments customer churn is getting a growing concern to the insurance industry. Technological improvements like the internet makes it much easier for customer to compare their policies, obtain new offers or even churn from one provider to another. The insurance industry therefore has become a heavily competitive market in which insurance companies have to compete to protect and expand their customer base in order to maintain or expand their market position. Thus, retaining customers is becoming more and more important and therefore finding customers who are most likely to leave is a central aspect. Many different techniques are available to identify customers who are most likely to leave, however which technique can be used best is often not clear. Research clarifies that the characteristics of the industry and/or dataset which is used are mostly assessing related to performance. In advance it is impossible to determine the best suited technique to use if previous research in which performance was tested has not been published. This study presents a data mining methodology in which the four most used prediction techniques in literature are tested and evaluated using a real life voluminous insurance company dataset to determine which technique performs best. Using the same dataset makes results comparable and clears out which technique performs best based on the insurance data domain characteristics
A Comprehensive Survey on Enterprise Financial Risk Analysis: Problems, Methods, Spotlights and Applications
Enterprise financial risk analysis aims at predicting the enterprises' future
financial risk.Due to the wide application, enterprise financial risk analysis
has always been a core research issue in finance. Although there are already
some valuable and impressive surveys on risk management, these surveys
introduce approaches in a relatively isolated way and lack the recent advances
in enterprise financial risk analysis. Due to the rapid expansion of the
enterprise financial risk analysis, especially from the computer science and
big data perspective, it is both necessary and challenging to comprehensively
review the relevant studies. This survey attempts to connect and systematize
the existing enterprise financial risk researches, as well as to summarize and
interpret the mechanisms and the strategies of enterprise financial risk
analysis in a comprehensive way, which may help readers have a better
understanding of the current research status and ideas. This paper provides a
systematic literature review of over 300 articles published on enterprise risk
analysis modelling over a 50-year period, 1968 to 2022. We first introduce the
formal definition of enterprise risk as well as the related concepts. Then, we
categorized the representative works in terms of risk type and summarized the
three aspects of risk analysis. Finally, we compared the analysis methods used
to model the enterprise financial risk. Our goal is to clarify current
cutting-edge research and its possible future directions to model enterprise
risk, aiming to fully understand the mechanisms of enterprise risk
communication and influence and its application on corporate governance,
financial institution and government regulation
Data mining using neural networks
Data mining is about the search for relationships and global patterns in large databases that are increasing in size. Data mining is beneficial for anyone who has a huge amount of data, for example, customer and business data, transaction, marketing, financial, manufacturing and web data etc. The results of data mining are also referred to as knowledge in the form of rules, regularities and constraints. Rule mining is one of the popular data mining methods since rules provide concise statements of potentially important information that is easily understood by end users and also actionable patterns. At present rule mining has received a good deal of attention and enthusiasm from data mining researchers since rule mining is capable of solving many data mining problems such as classification, association, customer profiling, summarization, segmentation and many others. This thesis makes several contributions by proposing rule mining methods using genetic algorithms and neural networks. The thesis first proposes rule mining methods using a genetic algorithm. These methods are based on an integrated framework but capable of mining three major classes of rules. Moreover, the rule mining processes in these methods are controlled by tuning of two data mining measures such as support and confidence. The thesis shows how to build data mining predictive models using the resultant rules of the proposed methods. Another key contribution of the thesis is the proposal of rule mining methods using supervised neural networks. The thesis mathematically analyses the Widrow-Hoff learning algorithm of a single-layered neural network, which results in a foundation for rule mining algorithms using single-layered neural networks. Three rule mining algorithms using single-layered neural networks are proposed for the three major classes of rules on the basis of the proposed theorems. The thesis also looks at the problem of rule mining where user guidance is absent. The thesis proposes a guided rule mining system to overcome this problem. The thesis extends this work further by comparing the performance of the algorithm used in the proposed guided rule mining system with Apriori data mining algorithm. Finally, the thesis studies the Kohonen self-organization map as an unsupervised neural network for rule mining algorithms. Two approaches are adopted based on the way of self-organization maps applied in rule mining models. In the first approach, self-organization map is used for clustering, which provides class information to the rule mining process. In the second approach, automated rule mining takes the place of trained neurons as it grows in a hierarchical structure
Cost-Sensitive Learning-based Methods for Imbalanced Classification Problems with Applications
Analysis and predictive modeling of massive datasets is an extremely significant problem that arises in many practical applications. The task of predictive modeling becomes even more challenging when data are imperfect or uncertain. The real data are frequently affected by outliers, uncertain labels, and uneven distribution of classes (imbalanced data). Such uncertainties create bias and make predictive modeling an even more difficult task. In the present work, we introduce a cost-sensitive learning method (CSL) to deal with the classification of imperfect data. Typically, most traditional approaches for classification demonstrate poor performance in an environment with imperfect data. We propose the use of CSL with Support Vector Machine, which is a well-known data mining algorithm. The results reveal that the proposed algorithm produces more accurate classifiers and is more robust with respect to imperfect data. Furthermore, we explore the best performance measures to tackle imperfect data along with addressing real problems in quality control and business analytics
Risk prediction of product-harm events using rough sets and multiple classifier fusion:an experimental study of listed companies in China
With the increasing of frequency and destructiveness of product-harm events, study on enterprise crisis management becomes essentially important, but little literature thoroughly explores the risk prediction method of product-harm event. In this study, an initial index system for risk prediction was built based on the analysis of the key drivers of the product-harm event's evolution; ultimately, nine risk-forecasting indexes were obtained using rough set attribute reduction. With the four indexes of cumulative abnormal returns as the input, fuzzy clustering was used to classify the risk level of a product-harm event into four grades. In order to control the uncertainty and instability of single classifiers in risk prediction, multiple classifier fusion was introduced and combined with self-organising data mining (SODM). Further, an SODM-based multiple classifier fusion (SB-MCF) model was presented for the risk prediction related to a product-harm event. The experimental results based on 165 Chinese listed companies indicated that the SB-MCF model improved the average predictive accuracy and reduced variation degree simultaneously. The statistical analysis demonstrated that the SB-MCF model significantly outperformed six widely used single classification models (e.g. neural networks, support vector machine, and case-based reasoning) and other six commonly used multiple classifier fusion methods (e.g. majority voting, Bayesian method, and genetic algorithm)
Feature selection in credit risk modeling: an international evidence
This paper aims to discover a suitable combination of contemporary feature selection techniques and robust prediction classifiers.
As such, to examine the impact of the feature selection method
on classifier performance, we use two Chinese and three other
real-world credit scoring datasets. The utilized feature selection
methods are the least absolute shrinkage and selection operator
(LASSO), multivariate adaptive regression splines (MARS). In contrast, the examined classifiers are the classification and regression
trees (CART), logistic regression (LR), artificial neural network
(ANN), and support vector machines (SVM). Empirical findings
confirm that LASSO’s feature selection method, followed by
robust classifier SVM, demonstrates remarkable improvement and
outperforms other competitive classifiers. Moreover, ANN also
offers improved accuracy with feature selection methods; LR only
can improve classification efficiency through performing feature
selection via LASSO. Nonetheless, CART does not provide any
indication of improvement in any combination. The proposed
credit scoring modeling strategy may use to develop policy, progressive ideas, operational guidelines for effective credit risk management of lending, and other financial institutions. The finding
of this study has practical value, as to date, there is no consensus
about the combination of feature selection method and prediction classifiers
Personality Identification from Social Media Using Deep Learning: A Review
Social media helps in sharing of ideas and information among people scattered around the world and thus helps in creating communities, groups, and virtual networks. Identification of personality is significant in many types of applications such as in detecting the mental state or character of a person, predicting job satisfaction, professional and personal relationship success, in recommendation systems. Personality is also an important factor to determine individual variation in thoughts, feelings, and conduct systems. According to the survey of Global social media research in 2018, approximately 3.196 billion social media users are in worldwide. The numbers are estimated to grow rapidly further with the use of mobile smart devices and advancement in technology. Support vector machine (SVM), Naive Bayes (NB), Multilayer perceptron neural network, and convolutional neural network (CNN) are some of the machine learning techniques used for personality identification in the literature review. This paper presents various studies conducted in identifying the personality of social media users with the help of machine learning approaches and the recent studies that targeted to predict the personality of online social media (OSM) users are reviewed
An Augmented Artificial Intelligence Approach for Chronic Diseases Prediction
Chronic diseases are increasing in prevalence and mortality worldwide. Early diagnosis has therefore become an important research area to enhance patient survival rates. Several research studies have reported classification approaches for specific disease prediction. In this paper, we propose a novel augmented artificial intelligence approach using an artificial neural network (ANN) with particle swarm optimization (PSO) to predict five prevalent chronic diseases including breast cancer, diabetes, heart attack, hepatitis, and kidney disease. Seven classification algorithms are compared to evaluate the proposed model's prediction performance. The ANN prediction model constructed with a PSO based feature extraction approach outperforms other state-of-the-art classification approaches when evaluated with accuracy. Our proposed approach gave the highest accuracy of 99.67%, with the PSO. However, the classification model's performance is found to depend on the attributes of data used for classification. Our results are compared with various chronic disease datasets and shown to outperform other benchmark approaches. In addition, our optimized ANN processing is shown to require less time compared to random forest (RF), deep learning and support vector machine (SVM) based methods. Our study could play a role for early diagnosis of chronic diseases in hospitals, including through development of online diagnosis systems
- …