1,070 research outputs found

    Expanding the Knowledge Base for More Effective Data Mining

    Get PDF
    Traditionally, data mining, as part of the knowledge discovery process, relies solely on the information contained in the database to generate patterns. Recently, there has been some recognition in the field that expanding the knowledge passed to the pattern generation phase by including other domain knowledge, may have beneficial effects of the interestingness and actionability of the resulting patterns. In this paper, we present a new knowledge discovery method that uses additional decision rules and the analytic hierarchy process (AHP) to conceptualize and structure the domain, thus capturing a broader notion of domain knowledge upon which data mining can be applied. Based on design science guidelines, we design, develop and implement our method within the domain of a brain trauma intensive care unit

    Data Driven Data Mining to Domain Driven Data Mining

    Get PDF
    In the preceding decade data mining has came into sight as one of the largely energetic areas in information technology Traditional data mining is seriously dependent on data itself and relies on data oriented methodologies So there is a universal necessity in bridging the space among academia and trade is to provide all-purpose domain-related matters in surrounding real-life applications Domain-Driven Data Mining try to build up general principles methodologies and techniques for modelling and reconciling wide-ranging domain-related factors and synthesized ubiquitous intelligence adjacent problem domains with the data mining course of action and discovering knowledge to hold up business decision-makin

    Modeling interestingness of streaming association rules as a benefit-maximizing classification problem

    Get PDF
    Cataloged from PDF version of article.In a typical application of association rule learning from market basket data, a set of transactions for a fixed period of time is used as input to rule learning algorithms. For example, the well-known Apriori algorithm can be applied to learn a set of association rules from such a transaction set. However, learning association rules from a set of transactions is not a one time only process. For example, a market manager may perform the association rule learning process once every month over the set of transactions collected through the last month. For this reason, we will consider the problem where transaction sets are input to the system as a stream of packages. The sets of transactions may come in varying sizes and in varying periods. Once a set of transactions arrive, the association rule learning algorithm is executed on the last set of transactions, resulting in new association rules. Therefore, the set of association rules learned will accumulate and increase in number over time, making the mining of interesting ones out of this enlarging set of association rules impractical for human experts. We refer to this sequence of rules as "association rule set stream" or "streaming association rules" and the main motivation behind this research is to develop a technique to overcome the interesting rule selection problem. A successful association rule mining system should select and present only the interesting rules to the domain experts. However, definition of interestingness of association rules on a given domain usually differs from one expert to another and also over time for a given expert. This paper proposes a post-processing method to learn a subjective model for the interestingness concept description of the streaming association rules. The uniqueness of the proposed method is its ability to formulate the interestingness issue of association rules as a benefit-maximizing classification problem and obtain a different interestingness model for each user. In this new classification scheme, the determining features are the selective objective interestingness factors related to the interestingness of the association rules, and the target feature is the interestingness label of those rules. The proposed method works incrementally and employs user interactivity at a certain level. It is evaluated on a real market dataset. The results show that the model can successfully select the interesting ones. (C) 2008 Elsevier B.V. All rights reserved

    Data Mining as Support to Knowledge Management in Marketing

    Get PDF
    Background: Previous research has shown success of data mining methods in marketing. However, their integration in a knowledge management system is still not investigated enough. Objectives: The purpose of this paper is to suggest an integration of two data mining techniques: neural networks and association rules in marketing modeling that could serve as an input to knowledge management and produce better marketing decisions. Methods/Approach: Association rules and artificial neural networks are combined in a data mining component to discover patterns and customers\u27 profiles in frequent item purchases. The results of data mining are used in a web-based knowledge management component to trigger ideas for new marketing strategies. The model is tested by an experimental research. Results: The results show that the suggested model could be efficiently used to recognize patterns in shopping behaviour and generate new marketing strategies. Conclusions: The scientific contribution lies in proposing an integrative data mining approach that could present support to knowledge management. The research could be useful to marketing and retail managers in improving the process of their decision making, as well as to researchers in the area of marketing modelling. Future studies should include more samples and other data mining techniques in order to test the model generalization ability

    CAPRE: A New Methodology for Product Recommendation Based on Customer Actionability and Profitability

    Get PDF
    International audienceRecommender systems can apply knowledge discovery techniques to the problem of making product recommendations. This aims to establish a customer loyalty strategy and thus to optimize the customer life time value. In this paper we propose CAPRE, a data-mining based methodology for recommender systems based on the analysis of turnover for customers of specific products. Contrary to classical recommender systems, CAPRE does not aspire to predict a customer's behavior but to influence that behavior. By measuring the actionability and profitability of customers, we have the ability to focus on customers that can afford to spend larger sums of money in the target business. CAPRE aggregates rules to extract characteristic purchasing behaviors, and then analyzes the counter-examples to detect the most actionable and profitable customers. We measure the effectiveness of CAPRE by performing a cross-validation on the MovieLens benchmark. The methodology is applied to over 10,000 individual customers and 100,000 products for the customer relationship management of VM Matériaux company, thus assisting the salespersons' objective to increase the customer value

    Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking

    Full text link
    Machine-learned models are often described as "black boxes". In many real-world applications however, models may have to sacrifice predictive power in favour of human-interpretability. When this is the case, feature engineering becomes a crucial task, which requires significant and time-consuming human effort. Whilst some features are inherently static, representing properties that cannot be influenced (e.g., the age of an individual), others capture characteristics that could be adjusted (e.g., the daily amount of carbohydrates taken). Nonetheless, once a model is learned from the data, each prediction it makes on new instances is irreversible - assuming every instance to be a static point located in the chosen feature space. There are many circumstances however where it is important to understand (i) why a model outputs a certain prediction on a given instance, (ii) which adjustable features of that instance should be modified, and finally (iii) how to alter such a prediction when the mutated instance is input back to the model. In this paper, we present a technique that exploits the internals of a tree-based ensemble classifier to offer recommendations for transforming true negative instances into positively predicted ones. We demonstrate the validity of our approach using an online advertising application. First, we design a Random Forest classifier that effectively separates between two types of ads: low (negative) and high (positive) quality ads (instances). Then, we introduce an algorithm that provides recommendations that aim to transform a low quality ad (negative instance) into a high quality one (positive instance). Finally, we evaluate our approach on a subset of the active inventory of a large ad network, Yahoo Gemini.Comment: 10 pages, KDD 201

    Marketing Insight: The Construct, Antecedents, Implications, and Empirical Testing

    Get PDF
    While firms’ data are exponentially growing, the level of marketing insight within firms is not. Insight is becoming a buzzword and dissipating its value due to the lack of conceptual understanding. This research develops and tests a marketing insight nomological network to answer how firms can generate marketing insights and what are the consequences of managing marketing insights. The research findings are relevant for the literature because (1) define the term theoretical domain, (2) lead companies to increase their chances to generate marketing insights and (3) establish the activities to improve the positive financial effect of marketing insight generation
    corecore