2,820 research outputs found

    Web Usage Mining with Evolutionary Extraction of Temporal Fuzzy Association Rules

    Get PDF
    In Web usage mining, fuzzy association rules that have a temporal property can provide useful knowledge about when associations occur. However, there is a problem with traditional temporal fuzzy association rule mining algorithms. Some rules occur at the intersection of fuzzy sets' boundaries where there is less support (lower membership), so the rules are lost. A genetic algorithm (GA)-based solution is described that uses the flexible nature of the 2-tuple linguistic representation to discover rules that occur at the intersection of fuzzy set boundaries. The GA-based approach is enhanced from previous work by including a graph representation and an improved fitness function. A comparison of the GA-based approach with a traditional approach on real-world Web log data discovered rules that were lost with the traditional approach. The GA-based approach is recommended as complementary to existing algorithms, because it discovers extra rules. (C) 2013 Elsevier B.V. All rights reserved

    An initial state of design and development of intelligent knowledge discovery system for stock exchange database

    Get PDF
    Data mining is a challenging matter in research field for the last few years.Researchers are using different techniques in data mining.This paper discussed the initial state of Design and Development Intelligent Knowledge Discovery System for Stock Exchange (SE) Databases. We divide our problem in two modules.In first module we define Fuzzy Rule Base System to determined vague information in stock exchange databases.After normalizing massive amount of data we will apply our proposed approach, Mining Frequent Patterns with Neural Networks.Future prediction (e.g., political condition, corporation factors, macro economy factors, and psychological factors of investors) perform an important rule in Stock Exchange, so in our prediction model we will be able to predict results more precisely.In second module we will generate clustering algorithm. Generally our clustering algorithm consists of two steps including training and running steps.The training step is conducted for generating the neural network knowledge based on clustering.In running step, neural network knowledge based is used for supporting the Module in order to generate learned complete data, transformed data and interesting clusters that will help to generate interesting rules

    Fuzzy-Granular Based Data Mining for Effective Decision Support in Biomedical Applications

    Get PDF
    Due to complexity of biomedical problems, adaptive and intelligent knowledge discovery and data mining systems are highly needed to help humans to understand the inherent mechanism of diseases. For biomedical classification problems, typically it is impossible to build a perfect classifier with 100% prediction accuracy. Hence a more realistic target is to build an effective Decision Support System (DSS). In this dissertation, a novel adaptive Fuzzy Association Rules (FARs) mining algorithm, named FARM-DS, is proposed to build such a DSS for binary classification problems in the biomedical domain. Empirical studies show that FARM-DS is competitive to state-of-the-art classifiers in terms of prediction accuracy. More importantly, FARs can provide strong decision support on disease diagnoses due to their easy interpretability. This dissertation also proposes a fuzzy-granular method to select informative and discriminative genes from huge microarray gene expression data. With fuzzy granulation, information loss in the process of gene selection is decreased. As a result, more informative genes for cancer classification are selected and more accurate classifiers can be modeled. Empirical studies show that the proposed method is more accurate than traditional algorithms for cancer classification. And hence we expect that genes being selected can be more helpful for further biological studies

    Information fusion from multiple databases using meta-association rules

    Get PDF
    Nowadays, data volume, distribution, and volatility make it difficult to search global patterns by applying traditional Data Mining techniques. In the case of data in a distributed environment, sometimes a local analysis of each dataset separately is adequate but some other times a global decision is needed by the analysis of the entire data. Association rules discovering methods typically require a single uniform dataset and managing with the entire set of distributed data is not possible due to its size. To address the scenarios in which satisfying this requirement is not practical or even feasible, we propose a new method for fusing information, in the form of rules, extracted from multiple datasets. The proposed model produces meta-association rules, i.e. rules in which the antecedent or the consequent may contain rules as well, for finding joint correlations among trends found individually in each dataset. In this paper, we describe the formulation and the implementation of two alternative frameworks that obtain, respectively, crisp meta-rules and fuzzy meta-rules. We compare our proposal with the information obtained when the datasets are not separated, in order to see the main differences between traditional association rules and meta-association rules. We also compare crisp and fuzzy methods for meta-association rule mining, observing that the fuzzy approach offers several advantages: it is more accurate since it incorporates the strength or validity of the previous information, produces a more manageable set of rules for human inspection, and allows the incorporation of contextual information to the mining process expressed in a more human-friendly format

    A Spatio-Temporal Framework for Managing Archeological Data

    Get PDF
    Space and time are two important characteristics of data in many domains. This is particularly true in the archaeological context where informa- tion concerning the discovery location of objects allows one to derive important relations between findings of a specific survey or even of different surveys, and time aspects extend from the excavation time, to the dating of archaeological objects. In recent years, several attempts have been performed to develop a spatio-temporal information system tailored for archaeological data. The first aim of this paper is to propose a model, called Star, for repre- senting spatio-temporal data in archaeology. In particular, since in this domain dates are often subjective, estimated and imprecise, Star has to incorporate such vague representation by using fuzzy dates and fuzzy relationships among them. Moreover, besides to the topological relations, another kind of spatial relations is particularly useful in archeology: the stratigraphic ones. There- fore, this paper defines a set of rules for deriving temporal knowledge from the topological and stratigraphic relations existing between two findings. Finally, considering the process through which objects are usually manually dated by archeologists, some existing automatic reasoning techniques may be success- fully applied to guide such process. For this purpose, the last contribution regards the translation of archaeological temporal data into a Fuzzy Temporal Constraint Network for checking the overall data consistency and reducing the vagueness of some dates based on their relationships with other ones

    Fuzzy association rules for biological data analysis: A case study on yeast

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Last years' mapping of diverse genomes has generated huge amounts of biological data which are currently dispersed through many databases. Integration of the information available in the various databases is required to unveil possible associations relating already known data. Biological data are often imprecise and noisy. Fuzzy set theory is specially suitable to model imprecise data while association rules are very appropriate to integrate heterogeneous data.</p> <p>Results</p> <p>In this work we propose a novel fuzzy methodology based on a fuzzy association rule mining method for biological knowledge extraction. We apply this methodology over a yeast genome dataset containing heterogeneous information regarding structural and functional genome features. A number of association rules have been found, many of them agreeing with previous research in the area. In addition, a comparison between crisp and fuzzy results proves the fuzzy associations to be more reliable than crisp ones.</p> <p>Conclusion</p> <p>An integrative approach as the one carried out in this work can unveil significant knowledge which is currently hidden and dispersed through the existing biological databases. It is shown that fuzzy association rules can model this knowledge in an intuitive way by using linguistic labels and few easy-understandable parameters.</p

    Mining fuzzy association rules in large databases with quantitative attributes.

    Get PDF
    by Kuok, Chan Man.Thesis (M.Phil.)--Chinese University of Hong Kong, 1997.Includes bibliographical references (leaves 74-77).Abstract --- p.iAcknowledgments --- p.iiiChapter 1 --- Introduction --- p.1Chapter 1.1 --- Data Mining --- p.2Chapter 1.2 --- Association Rule Mining --- p.3Chapter 2 --- Background --- p.6Chapter 2.1 --- Framework of Association Rule Mining --- p.6Chapter 2.1.1 --- Large Itemsets --- p.6Chapter 2.1.2 --- Association Rules --- p.8Chapter 2.2 --- Association Rule Algorithms For Binary Attributes --- p.11Chapter 2.2.1 --- AIS --- p.12Chapter 2.2.2 --- SETM --- p.13Chapter 2.2.3 --- "Apriori, AprioriTid and AprioriHybrid" --- p.15Chapter 2.2.4 --- PARTITION --- p.18Chapter 2.3 --- Association Rule Algorithms For Numeric Attributes --- p.20Chapter 2.3.1 --- Quantitative Association Rules --- p.20Chapter 2.3.2 --- Optimized Association Rules --- p.23Chapter 3 --- Problem Definition --- p.25Chapter 3.1 --- Handling Quantitative Attributes --- p.25Chapter 3.1.1 --- Discrete intervals --- p.26Chapter 3.1.2 --- Overlapped intervals --- p.27Chapter 3.1.3 --- Fuzzy sets --- p.28Chapter 3.2 --- Fuzzy association rule --- p.31Chapter 3.3 --- Significance factor --- p.32Chapter 3.4 --- Certainty factor --- p.36Chapter 3.4.1 --- Using significance --- p.37Chapter 3.4.2 --- Using correlation --- p.38Chapter 3.4.3 --- Significance vs. Correlation --- p.42Chapter 4 --- Steps For Mining Fuzzy Association Rules --- p.43Chapter 4.1 --- Candidate itemsets generation --- p.44Chapter 4.1.1 --- Candidate 1-Itemsets --- p.45Chapter 4.1.2 --- Candidate k-Itemsets (k > 1) --- p.47Chapter 4.2 --- Large itemsets generation --- p.48Chapter 4.3 --- Fuzzy association rules generation --- p.49Chapter 5 --- Experimental Results --- p.51Chapter 5.1 --- Experiment One --- p.51Chapter 5.2 --- Experiment Two --- p.53Chapter 5.3 --- Experiment Three --- p.54Chapter 5.4 --- Experiment Four --- p.56Chapter 5.5 --- Experiment Five --- p.58Chapter 5.5.1 --- Number of Itemsets --- p.58Chapter 5.5.2 --- Number of Rules --- p.60Chapter 5.6 --- Experiment Six --- p.61Chapter 5.6.1 --- Varying Significance Threshold --- p.62Chapter 5.6.2 --- Varying Membership Threshold --- p.62Chapter 5.6.3 --- Varying Confidence Threshold --- p.63Chapter 6 --- Discussions --- p.65Chapter 6.1 --- User guidance --- p.65Chapter 6.2 --- Rule understanding --- p.67Chapter 6.3 --- Number of rules --- p.68Chapter 7 --- Conclusions and Future Works --- p.70Bibliography --- p.7

    ARM-AMO: An Efficient Association Rule Mining Algorithm Based on Animal Migration Optimization

    Get PDF
    The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI linkAssociation rule mining (ARM) aims to find out association rules that satisfy predefined minimum support and confidence from a given database. However, in many cases ARM generates extremely large number of association rules, which are impossible for end users to comprehend or validate, thereby limiting the usefulness of data mining results. In this paper, we propose a new mining algorithm based on Animal Migration Optimization (AMO), called ARM-AMO, to reduce the number of association rules. It is based on the idea that rules which are not of high support and unnecessary are deleted from the data. Firstly, Apriori algorithm is applied to generate frequent itemsets and association rules. Then, AMO is used to reduce the number of association rules with a new fitness function that incorporates frequent rules. It is observed from the experiments that, in comparison with the other relevant techniques, ARM-AMO greatly reduces the computational time for frequent item set generation, memory for association rule generation, and the number of rules generated

    ARM-AMO: An Efficient Association Rule Mining Algorithm Based on Animal Migration Optimization

    Get PDF
    The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI linkAssociation rule mining (ARM) aims to find out association rules that satisfy predefined minimum support and confidence from a given database. However, in many cases ARM generates extremely large number of association rules, which are impossible for end users to comprehend or validate, thereby limiting the usefulness of data mining results. In this paper, we propose a new mining algorithm based on Animal Migration Optimization (AMO), called ARM-AMO, to reduce the number of association rules. It is based on the idea that rules which are not of high support and unnecessary are deleted from the data. Firstly, Apriori algorithm is applied to generate frequent itemsets and association rules. Then, AMO is used to reduce the number of association rules with a new fitness function that incorporates frequent rules. It is observed from the experiments that, in comparison with the other relevant techniques, ARM-AMO greatly reduces the computational time for frequent item set generation, memory for association rule generation, and the number of rules generated
    corecore