214,920 research outputs found

    Principal Component Analysis Untuk Analisa Pola Tangkapan Ikan Di Indonesia

    Get PDF
    Different kinds of fish in Indonesia is very much known to exist more than 80 species of fish caught in the waters of Indonesia. To find out which type of fish caught necessary analysis of the data pattern catches so as to know what kind of fish are caught. Search pattern or associative relationships of large-scale data that are closely related to data mining. Analysis of the association or the association rule mining is a data mining technique to discover the rules of associative between a combination of items. In the association rule method, there are two processes, namely the process of generating Frequent Itemset and trenching association rules. Frequent Itemset Generation is a process to get itemset interconnected and has a value of association based on the value of support and confidence. The algorithm used to generate the frequent itemset is Apriori Algorithm.Apriori algorithm has a weakness in the appropriate feature extraction that is used to attribute causing rule that formed a research banyak.dalam bebasis applying apriori algorithm principal component analysis to obtain a more optimal rule. After experiments using apriori algorithm with a magnitude Φ = 30, min Support 80% and 80% Confidence min rule formed results totaled 82 rules. While the second experiment was done by using an algorithm based on principal component analysis priori the magnitude Φ = 30, min Support 80% and 80% Confidence min formed results amounted to 12 rules to fully lift the ratio of

    PRINCIPAL COMPONENT ANALYSIS UNTUK ANALISA POLA TANGKAPAN IKAN DI INDONESIA

    Get PDF
    Different kinds of fish in Indonesia is very much known to exist more than 80 species of fish caught in the waters of Indonesia. To find out which type of fish caught necessary analysis of the data pattern catches so as to know what kind of fish are caught. Search pattern or associative relationships of large-scale data that are closely related to data mining. Analysis of the association or the association rule mining is a data mining technique to discover the rules of associative between a combination of items. In the association rule method, there are two processes, namely the process of generating Frequent Itemset and trenching association rules. Frequent Itemset Generation is a process to get itemset interconnected and has a value of association based on the value of support and confidence. The algorithm used to generate the frequent itemset is Apriori Algorithm. Apriori algorithm has a weakness in the appropriate feature extraction that is used to attribute causing rule that formed a research a lot in based applying apriori algorithm principal component analysis to obtain a more optimal rule. After experiments using apriori algorithm with a magnitude Φ = 30, min Support 80% and 80% Confidence min rule formed results totaled 82 rules. While the second experiment was done by using an algorithm based on principal component analysis prior the magnitude Φ = 30, min Support 80% and 80% Confidence min formed results amounted to 12 rules to fully lift the ratio of

    A review of associative classification mining

    Get PDF
    Associative classification mining is a promising approach in data mining that utilizes the association rule discovery techniques to construct classification systems, also known as associative classifiers. In the last few years, a number of associative classification algorithms have been proposed, i.e. CPAR, CMAR, MCAR, MMAC and others. These algorithms employ several different rule discovery, rule ranking, rule pruning, rule prediction and rule evaluation methods. This paper focuses on surveying and comparing the state-of-the-art associative classification techniques with regards to the above criteria. Finally, future directions in associative classification, such as incremental learning and mining low-quality data sets, are also highlighted in this paper

    Evaluation and optimization of frequent association rule based classification

    Get PDF
    Deriving useful and interesting rules from a data mining system is an essential and important task. Problems such as the discovery of random and coincidental patterns or patterns with no significant values, and the generation of a large volume of rules from a database commonly occur. Works on sustaining the interestingness of rules generated by data mining algorithms are actively and constantly being examined and developed. In this paper, a systematic way to evaluate the association rules discovered from frequent itemset mining algorithms, combining common data mining and statistical interestingness measures, and outline an appropriated sequence of usage is presented. The experiments are performed using a number of real-world datasets that represent diverse characteristics of data/items, and detailed evaluation of rule sets is provided. Empirical results show that with a proper combination of data mining and statistical analysis, the framework is capable of eliminating a large number of non-significant, redundant and contradictive rules while preserving relatively valuable high accuracy and coverage rules when used in the classification problem. Moreover, the results reveal the important characteristics of mining frequent itemsets, and the impact of confidence measure for the classification task

    QCBA: Postoptimization of Quantitative Attributes in Classifiers based on Association Rules

    Full text link
    The need to prediscretize numeric attributes before they can be used in association rule learning is a source of inefficiencies in the resulting classifier. This paper describes several new rule tuning steps aiming to recover information lost in the discretization of numeric (quantitative) attributes, and a new rule pruning strategy, which further reduces the size of the classification models. We demonstrate the effectiveness of the proposed methods on postoptimization of models generated by three state-of-the-art association rule classification algorithms: Classification based on Associations (Liu, 1998), Interpretable Decision Sets (Lakkaraju et al, 2016), and Scalable Bayesian Rule Lists (Yang, 2017). Benchmarks on 22 datasets from the UCI repository show that the postoptimized models are consistently smaller -- typically by about 50% -- and have better classification performance on most datasets
    • …
    corecore