6,348 research outputs found
The Rule Extraction of Numerical Association Rule Mining Using Hybrid Evolutionary Algorithm
The topic of Particle Swarm Optimization (PSO) has recently gained popularity. Researchers has used it to solve difficulties related to job scheduling, evaluation of stock markets and association rule mining optimization. However, the PSO method often encounters the problem of getting trapped in the local optimum. Some researchers proposed a solution to over come that problem using combination of PSO and Cauchy distribution because this performance proved to reach the optimal rules. In this paper, we focus to adopt the combination for solving association rule mining (ARM) optimization problem in numerical dataset. Therefore, the aim of this research is to extract the rule of numerical ARM optimization problem for certain multi-objective functions such as support, confidence, and amplitude. This method is called PARCD. It means that PSO for numerical association rule mining problem with Cauchy Distribu- tion. PARCD performed better results than other methods such as MOPAR, MODENAR, GAR, MOGAR and RPSOA
Improved optimization of numerical association rule mining using hybrid particle swarm optimization and cauchy distribution
Particle Swarm Optimization (PSO) has been applied to solve optimization problems in various fields, such as Association Rule Mining (ARM) of numerical problems. However, PSO often becomes trapped in local optima. Consequently, the results do not represent the overall optimum solutions. To address this limitation, this study aims to combine PSO with the Cauchy distribution (PARCD), which is expected to increase the global optimal value of the expanded search space. Furthermore, this study uses multiple objective functions, i.e., support, confidence, comprehensibility, interestingness and amplitude. In addition, the proposed method was evaluated using benchmark datasets, such as the Quake, Basket ball, Body fat, Pollution, and Bolt datasets. Evaluation results were compared to the results obtained by previous studies. The results indicate that the overall values of the objective functions obtained using the proposed PARCD approach are satisfactory
Improving a multi-objective evolutionary algorithm to discover quantitative association rules
This work aims at correcting flaws existing in multi-objective evolutionary
schemes to discover quantitative association rules, specifically those based on the wellknown
non-dominated sorting genetic algorithm-II (NSGA-II). In particular, a
methodology is proposed to find the most suitable configurations based on the set of
objectives to optimize and distance measures to rank the non-dominated solutions. First,
several quality measures are analyzed to select the best set of them to be optimized.
Furthermore, different strate-gies are applied to replace the crowding distance used by
NSGA-II to sort the solutions for each Pareto-front since such distance is not suitable for
handling many-objective problems. The proposed enhancements have been integrated into
the multi-objective algorithm called MOQAR. Several experiments have been carried out
to assess the algorithm’s performance by using different configuration settings, and the best
ones have been compared to other existing algorithms. The results obtained show a
remarkable performance of MOQAR in terms of quality measures.Ministerio de Ciencia y Tecnología TIN2011-28956-C02Ministerio de Ciencia y Tecnología TIN2014- 55894-C2-RJunta de Andalucia P12-TIC-1728Universidad Pablo de Olavide APPB81309
Inferring Gene-Gene Associations from Quantitative Association Rules
The microarray technique is able to monitor the
change in concentration of RNA in thousands of genes simultaneously.
The interest in this technique has grown exponentially
in recent years and the difficulties in analyzing data from
such experiments, which are characterized by the high number
of genes to be analyzed in relation to the low number of
experiments or samples available. Microarray experiments
are generating datasets that can help in reconstructing gene
networks. One of the most important problems in network
reconstruction is finding, for each gene in the network, which
genes can affect it and how. Association Rules are an approach
of unsupervised learning to relate attributes to each other.
In this work we use Quantitative Association Rules in order
to define interrelations between genes. These rules work with
intervals on the attributes, without discretizing the data before
and they are generated by a multi-objective evolutionary
algorithm. In most cases the extracted rules confirm the existing
knowledge about cell-cycle gene expression, while hitherto
unknown relationships can be treated as new hypotheses.Ministerio de Ciencia y Tecnología TIN2007-68084-C-00Junta de Andalucía P07-TIC-0261
Penguins Search Optimisation Algorithm for Association Rules Mining
Association Rules Mining (ARM) is one of the most popular and well-known approaches for the decision-making process. All existing ARM algorithms are time consuming and generate a very large number of association rules with high overlapping. To deal with this issue, we propose a new ARM approach based on penguins search optimisation algorithm (Pe-ARM for short). Moreover, an efficient measure is incorporated into the main process to evaluate the amount of overlapping among the generated rules. The proposed approach also ensures a good diversification over the whole solutions space. To demonstrate the effectiveness of the proposed approach, several experiments have been carried out on different datasets and specifically on the biological ones. The results reveal that the proposed approach outperforms the well-known ARM algorithms in both execution time and solution quality
Automatic Finding Trapezoidal Membership Functions in Mining Fuzzy Association Rules Based on Learning Automata
Association rule mining is an important data mining technique used for discovering relationships among all data items. Membership functions have a significant impact on the outcome of the mining association rules. An important challenge in fuzzy association rule mining is finding an appropriate membership functions, which is an optimization issue. In the most relevant studies of fuzzy association rule mining, only triangle membership functions are considered. This study, as the first attempt, used a team of continuous action-set learning automata (CALA) to find both the appropriate number and positions of trapezoidal membership functions (TMFs). The spreads and centers of the TMFs were taken into account as parameters for the research space and a new approach for the establishment of a CALA team to optimize these parameters was introduced. Additionally, to increase the convergence speed of the proposed approach and remove bad shapes of membership functions, a new heuristic approach has been proposed. Experiments on two real data sets showed that the proposed algorithm improves the efficiency of the extracted rules by finding optimized membership functions
Enhancing predictive crime mapping model using association rule mining for geographical and demographic structure
This research project is to enhanced predictive crime mapping model with data mining technique to predict the possible rate of crime occurrence. Few specific objectives are stated in order to achieve the aim of this research project. This project proposed a data mining technique called Association Rule Mining. Basically Association Rule Mining is to investigate the rules according to the predefined parameter. This technique considered useful if it can satisfy both minimum confidence and support. Apriori is a popular algorithm in finding frequent set of items in data and association rule. Dataset of Communities and Crime from UCI Machine Learning Repository is used in order to setup the experiment. 60% of the dataset is used for training to generate association rules by using WEKA. The association rules generated shows the prediction of the rate of crime occurrence. The other 40% of the dataset is used to test generated rules. A simple program of C++ is implemented using Microsoft Visual Studio to test generated rules until accuracy of performance is obtained. At the end of the project, generated rules tested and come out with difference accuracy according to predefined minimum support
- …