590 research outputs found
Automatic Finding Trapezoidal Membership Functions in Mining Fuzzy Association Rules Based on Learning Automata
Association rule mining is an important data mining technique used for discovering relationships among all data items. Membership functions have a significant impact on the outcome of the mining association rules. An important challenge in fuzzy association rule mining is finding an appropriate membership functions, which is an optimization issue. In the most relevant studies of fuzzy association rule mining, only triangle membership functions are considered. This study, as the first attempt, used a team of continuous action-set learning automata (CALA) to find both the appropriate number and positions of trapezoidal membership functions (TMFs). The spreads and centers of the TMFs were taken into account as parameters for the research space and a new approach for the establishment of a CALA team to optimize these parameters was introduced. Additionally, to increase the convergence speed of the proposed approach and remove bad shapes of membership functions, a new heuristic approach has been proposed. Experiments on two real data sets showed that the proposed algorithm improves the efficiency of the extracted rules by finding optimized membership functions
Recommended from our members
A niching memetic algorithm for simultaneous clustering and feature selection
Clustering is inherently a difficult task, and is made even more difficult when the selection of relevant features is also an issue. In this paper we propose an approach for simultaneous clustering and feature selection using a niching memetic algorithm. Our approach (which we call NMA_CFS) makes feature selection an integral part of the global clustering search procedure and attempts to overcome the problem of identifying less promising locally optimal solutions in both clustering and feature selection, without making any a priori assumption about the number of clusters. Within the NMA_CFS procedure, a variable composite representation is devised to encode both feature selection and cluster centers with different numbers of clusters. Further, local search operations are introduced to refine feature selection and cluster centers encoded in the chromosomes. Finally, a niching method is integrated to preserve the population diversity and prevent premature convergence. In an experimental evaluation we demonstrate the effectiveness of the proposed approach and compare it with other related approaches, using both synthetic and real data
A systematic review of data quality issues in knowledge discovery tasks
Hay un gran crecimiento en el volumen de datos porque las organizaciones capturan permanentemente la cantidad colectiva de datos para lograr un mejor proceso de toma de decisiones. El desafío mas fundamental es la exploración de los grandes volúmenes de datos y la extracción de conocimiento útil para futuras acciones por medio de tareas para el descubrimiento del conocimiento; sin embargo, muchos datos presentan mala calidad. Presentamos una revisión sistemática de los asuntos de calidad de datos en las áreas del descubrimiento de conocimiento y un estudio de caso aplicado a la enfermedad agrícola conocida como la roya del café.Large volume of data is growing because the organizations are continuously capturing the collective amount of data for better decision-making process. The most fundamental challenge is to explore the large volumes of data and extract useful knowledge for future actions through knowledge discovery tasks, nevertheless many data has poor quality. We presented a systematic review of the data quality issues in knowledge discovery tasks and a case study applied to agricultural disease named coffee rust
Enhancing the scalability of a genetic algorithm to discover quantitative association rules in large-scale datasets
Association rule mining is a well-known methodology to discover significant and apparently hidden relations among
attributes in a subspace of instances from datasets. Genetic algorithms have been extensively used to find interesting association
rules. However, the rule-matching task of such techniques usually requires high computational and memory requirements. The use
of efficient computational techniques has become a task of the utmost importance due to the high volume of generated data
nowadays. Hence, this paper aims at improving the scalability of quantitative association rule mining techniques based on
genetic algorithms to handle large-scale datasets without quality loss in the results obtained. For this purpose, a new
representation of the individuals, new genetic operators and a windowing-based learning scheme are proposed to achieve
successfully such challenging task. Specifically, the proposed techniques are integrated into the multi-objective evolutionary
algorithm named QARGA-M to assess their performances. Both the standard version and the enhanced one of QARGA-M have
been tested in several datasets that present different number of attributes and instances. Furthermore, the proposed methodologies
have been integrated into other existing techniques based in genetic algorithms to discover quantitative association rules. The
comparative analysis performed shows significant improvements of QARGA-M and other existing genetic algorithms in terms of
computational costs without losing quality in the results when the proposed techniques are applied.Ministerio de Ciencia y Tecnología TIN2011- 28956-C02-02Junta de Andalucía TIC-7528Junta de Andalucía P12-TIC-1728Universidad Pablo de Olavide APPB81309
Meta-learning computational intelligence architectures
In computational intelligence, the term \u27memetic algorithm\u27 has come to be associated with the algorithmic pairing of a global search method with a local search method. In a sociological context, a \u27meme\u27 has been loosely defined as a unit of cultural information, the social analog of genes for individuals. Both of these definitions are inadequate, as \u27memetic algorithm\u27 is too specific, and ultimately a misnomer, as much as a \u27meme\u27 is defined too generally to be of scientific use. In this dissertation the notion of memes and meta-learning is extended from a computational viewpoint and the purpose, definitions, design guidelines and architecture for effective meta-learning are explored. The background and structure of meta-learning architectures is discussed, incorporating viewpoints from psychology, sociology, computational intelligence, and engineering. The benefits and limitations of meme-based learning are demonstrated through two experimental case studies -- Meta-Learning Genetic Programming and Meta- Learning Traveling Salesman Problem Optimization. Additionally, the development and properties of several new algorithms are detailed, inspired by the previous case-studies. With applications ranging from cognitive science to machine learning, meta-learning has the potential to provide much-needed stimulation to the field of computational intelligence by providing a framework for higher order learning --Abstract, page iii
Recommendation Systems Based on Association Rule Mining for a Target Object by Evolutionary Algorithms
Recommender systems are designed for offering products to the potential customers. Collaborative Filtering is known as a common way in Recommender systems which offers recommendations made by similar users in the case of entering time and previous transactions. Low accuracy of suggestions due to a database is one of the main concerns about collaborative filtering recommender systems. In this field, numerous researches have been done using associative rules for recommendation systems to improve accuracy but runtime of rule-based recommendation systems is high and cannot be used in the real world. So, many researchers suggest using evolutionary algorithms for finding relative best rules at runtime very fast. The present study investigated the works done for producing associative rules with higher speed and quality. In the first step Apriori-based algorithm will be introduced which is used for recommendation systems and then the Particle Swarm Optimization algorithm will be described and the issues of these 2 work will be discussed. Studying this research could help to know the issues in this research field and produce suggestions which have higher speed and quality
Self learning neuro-fuzzy modeling using hybrid genetic probabilistic approach for engine air/fuel ratio prediction
Machine Learning is concerned in constructing models which can learn and make predictions based on data. Rule extraction from real world data that are usually tainted with noise, ambiguity, and uncertainty, automatically requires feature selection. Neuro-Fuzzy system (NFS) which is known with its prediction performance has the difficulty in determining the proper number of rules and the number of membership functions for each rule. An enhanced hybrid Genetic Algorithm based Fuzzy Bayesian
classifier (GA-FBC) was proposed to help the NFS in the rule extraction. Feature selection was performed in the rule level overcoming the problems of the FBC which depends on the frequency of the features leading to ignore the patterns of small classes. As dealing with a real world problem such as the Air/Fuel Ratio (AFR) prediction, a multi-objective problem is adopted. The GA-FBC uses mutual information entropy, which considers the relevance between feature attributes and class attributes. A fitness function is proposed to deal with multi-objective problem without weight using a new composition method. The model was compared to other learning algorithms for NFS such as Fuzzy c-means (FCM) and grid partition algorithm. Predictive accuracy and the complexity of the Fuzzy Rule Base System (FRBS) including number of rules and number of terms in each rule were taken as terms of evaluation. It was also compared to the original GA-FBC depending on the
frequency not on Mutual Information (MI). Experimental results using Air/Fuel Ratio
(AFR) data sets show that the new model participates in decreasing the average number of attributes in the rule and sometimes in increasing the average performance compared to other models. This work facilitates in achieving a self-generating FRBS from real data. The GA-FBC can be used as a new direction in machine learning research. This research contributes in controlling automobile emissions in helping the
reduction of one of the most causes of pollution to produce greener environment
A Framework for Leveraging Artificial Intelligence in Project Management
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementThis dissertation aims to support the project manager in their daily tasks. As we use artificial
intelligence (AI) and machine learning (ML) in everyday life, it is necessary to include them in business
and change traditional ways of working. For the purpose of this study, it is essential to understand
challenges and areas of project management and how artificial intelligence can contribute to them. A
theoretical overview, applying the knowledge of project management, will show a holistic view of the
current situation in the enterprises. The research is about artificial intelligence applications in project
management, the common activities in project management, the biggest challenges, and how AI and
ML can support it. Understanding project managers help create a framework that will contribute to
optimizing their tasks. After designing and developing the framework for applying artificial intelligence
to project management, the project managers were asked to evaluate. This study is essential to
increase awareness among the stakeholders and enterprises on how automation of the processes can
be improved and how AI and ML can decrease the possibility of risk and cost along with improving the
happiness and efficiency of the employees
- …