406 research outputs found

    Analytics-as-a-service, Automated analytics, Data analytics, Experimental study, Novices

    Get PDF
    Generating insights and value from data hasbecome an important asset for organizations. At the sametime, the need for experts in analytics is increasing and thenumber of analytics applications is growing. Recently, a newtrend has emerged, i.e. analytics-as-a-service platforms, thatmakes it easier to apply analytics both for novice and expertusers.Inthisstudy,theauthorsapproachthesenew servicesbyconducting a full-factorial experiment where both inexperi-enced and experienced users take on an analytics task with ananalytics-as-a-service technology. The research proves thatalthough experts in analytics still significantly outperformnovices, these web-based platforms do offer an advantage toinexperienced users. Furthermore, the authors find that ana-lytics-as-a-service does not offer the same benefits acrossdifferent analytics tasks. That is, they observe better perfor-mance for supervised analytics tasks. Moreover, this studyindicates that there are significant differences between novi-ces. The most important distinction lies in the approach theytake on the task. Novices who follow a more complex,although structured, workflow behave more similarly toexperts and, thus, also perform better. The findings can aidmanagers in their hiring and training strategy with regards toboth business users and data scientists. Moreover, it can guidemanagers in the development of an enterprise-wide analyticsculture. Finally, the results can inform vendors about thedesign and development of these platforms

    ITER: An algorithm for predictive regression rule extraction. Data warehousing and knowledge discovery. Proceedings.

    Get PDF
    Various benchmarking studies have shown that artificial neural networks and support vector machines have a superior performance when compared to more traditional machine learning techniques. The main resistance against these newer techniques is based on their lack of interpretability: it is difficult for the human analyst to understand the motivation behind these models' decisions. Various rule extraction techniques have been proposed to overcome this opacity restriction. However, most of these extraction techniques are devised for classification and only few algorithms can deal with regression problems.

    Modeling customer loyalty using customer lifetime value.

    Get PDF
    The definition and modeling of customer loyalty have been central issues in customer relationship management since many years. Recent papers propose solutions to detect customers that are becoming less loyal, also called churners. The churner status is then defined as a function of the volume of commercial transactions. In the context of a Belgian retail financial service company, our first contribution will be to redefine the notion of customer's loyalty by considering it from a customer-centric point-of-view instead of a product-centric point-of-view. We will hereby use the customer lifetime value (CLV) defined as the discounted value of future marginal earnings, based on the customer's activity. Hence, a churner will be defined as someone whose CLV, thus the related marginal profit, is decreasing. As a second contribution, the loss incurred by the CLV decrease will be used to appraise the cost to misclassify a customer by introducing a new loss function. In the empirical study, we will compare the accuracy of various classification techniques commonly used in the domain of churn prediction, including two cost-sensitive classirfiers. Our final conclusion is that since profit is what really matters in a commercial environment, standard statistical accuracy measures or prediction need to be revised and a more profit oriented focus may be desirable.Churn prediction; Classification; Customer lifetime value; Prediction models;

    Post-processing of association rules.

    Get PDF
    In this paper, we situate and motivate the need for a post-processing phase to the association rule mining algorithm when plugged into the knowledge discovery in databases process. Major research effort has already been devoted to optimising the initially proposed mining algorithms. When it comes to effectively extrapolating the most interesting knowledge nuggets from the standard output of these algorithms, one is faced with an extreme challenge, since it is not uncommon to be confronted with a vast amount of association rules after running the algorithms. The sheer multitude of generated rules often clouds the perception of the interpreters. Rightful assessment of the usefulness of the generated output introduces the need to effectively deal with different forms of data redundancy and data being plainly uninteresting. In order to do so, we will give a tentative overview of some of the main post-processing tasks, taking into account the efforts that have already been reported in the literature.

    Using rule extraction to improve the comprehensibility of predictive models.

    Get PDF
    Whereas newer machine learning techniques, like artifficial neural net-works and support vector machines, have shown superior performance in various benchmarking studies, the application of these techniques remains largely restricted to research environments. A more widespread adoption of these techniques is foiled by their lack of explanation capability which is required in some application areas, like medical diagnosis or credit scoring. To overcome this restriction, various algorithms have been proposed to extract a meaningful description of the underlying `blackbox' models. These algorithms' dual goal is to mimic the behavior of the black box as closely as possible while at the same time they have to ensure that the extracted description is maximally comprehensible. In this research report, we first develop a formal definition of`rule extraction and comment on the inherent trade-off between accuracy and comprehensibility. Afterwards, we develop a taxonomy by which rule extraction algorithms can be classiffied and discuss some criteria by which these algorithms can be evaluated. Finally, an in-depth review of the most important algorithms is given.This report is concluded by pointing out some general shortcomings of existing techniques and opportunities for future research.Models; Model; Algorithms; Criteria; Opportunities; Research; Learning; Neural networks; Networks; Performance; Benchmarking; Studies; Area; Credit; Credit scoring; Behavior; Time;

    A modified Pareto/NBD approach for predicting customer lifetime value.

    Get PDF
    Valuing customers is a central issue for any commercial activity. The customer lifetime value (CLV) is the discounted value of the future profits that this customer yields to the company. In order to compute the CLV, one needs to predict the future number of transactions a customer will make and the profit of these transactions. With the Pareto/NBD model, the future number of transactions of a customer can be predicted, and the CLV is then computed as a discounted product between this number and the expected profit per transaction. Usually, the number of transactions and the future profits per transaction are estimated separately. This study proposes an alternative. We show that the dependence between the number of transactions and their profitability can be used to increase the accuracy of the prediction of the CLV. This is illustrated with a new empirical case from the retail banking sector.Customer lifetime value; Value; Yield; Companies; Order; Model; Product; Expected;

    A modified Pareto/NBD approach for predicting customer lifetime value.

    Get PDF
    Systems; Applications; Customer lifetime value; Value;

    Modeling churn using customer lifetime value.

    Get PDF
    The definition and modeling of customer loyalty have been central issues in customer relationship management since many years. Recent papers propose solutions to detect customers that are becoming less loyal, also called churners. The churner status is then defined as a function of the volume of commercial transactions. In the context of a Belgian retail financial service company, our first contribution is to redefine the notion of customer loyalty by considering it from a customer-centric viewpoint instead of a productcentric one. We hereby use the customer lifetime value (CLV) defined as the discounted value of future marginal earnings, based on the customer's activity. Hence, a churner is defined as someone whose CLV, thus the related marginal profit, is decreasing. As a second contribution, the loss incurred by the CLV decrease is used to appraise the cost to misclassify a customer by introducing a new loss function. In the empirical study, we compare the accuracy of various classification techniques commonly used in the domain of churn prediction, including two cost-sensitive classifiers. Our final conclusion is that since profit is what really matters in a commercial environment, standard statistical accuracy measures for prediction need to be revised and a more profit oriented focus may be desirable.Data mining; Decision support systems; Marketing; Churn prediction;

    Software defect prediction based on association rule classification.

    Get PDF
    In software defect prediction, predictive models are estimated based on various code attributes to assess the likelihood of software modules containing errors. Many classification methods have been suggested to accomplish this task. However, association based classification methods have not been investigated so far in this context. This paper assesses the use of such a classification method, CBA2, and compares it to other rule based classification methods. Furthermore, we investigate whether rule sets generated on data from one software project can be used to predict defective software modules in other, similar software projects. It is found that applying the CBA2 algorithm results in both accurate and comprehensible rule sets.Software defect prediction; Association rule classification; CBA2; AUC;
    corecore