11 research outputs found

    Attributes regrouping in Fuzzy Rule Based Classification Systems: an intra-classes approach

    Get PDF
    International audienceFuzzy rule-based classification systems (FRBCS) are able to build linguistic interpretable models, they automatically generate fuzzy if-then rules and use them to classify new observations. However, in these supervised learning systems, a high number of predictive attributes leads to an exponential increase of the number of generated rules. Moreover the antecedent conditions of the obtained rules are very large since they contain all the attributes that describe the examples. Therefore the accuracy of these systems as well as their interpretability degraded. To address this problem, we propose to use ensemble methods for FRBCS where the decisions of different classifiers are combined in order to form the final classification model. We are interested in particular in ensemble methods which split the attributes into subgroups and treat each subgroup separately. We propose to regroup attributes by correlation search among the training set elements that belongs to the same class, such an intra-classes correlation search allows to characterize each class separately. Several experiences were carried out on various data. The results show a reduction in the number of rules and of antecedents without altering accuracy, on the contrary classification rates are even improved

    Критерії навчання нечіткого класифікатора на основі відстані між головними конкурентами

    Get PDF
    Класифікація це віднесення об`єкта за деякими ознаками до одного з класів. До класифікації зводяться різноманітні задачі прийняття рішень в інженерії, економіці, медицині, соціології та в інших областях. В нечітких класифікаторах залежність «входи – вихід» описуються за допомогою лінгвістичних правил , антецеденти яких містять нечіткі терми «низький», «середній», «високий» тощо. Для підвищення безпомилковості нечіткий класифікатор навчають за експериментальними даними. В даній роботі запропоновано нові критерії навчання нечіткого класифікатора, які враховують різницю належностей нечіткого висновку лише до головних конкурентів. За правильної класифікації головним конкурентом прийнятого рішення є клас, що має другий за величиною ступінь належності. У випадку неправильної класифікації помилково прийняте рішення є головним конкурентом правильного класу. Проведені комп`ютерні експерименти із навчання нечіткого класифікатора для розпізнавання трьох сортів італійських вин засвідчили суттєву перевагу нових критеріїв. Серед нових критеріїв помірну перевагу має критерій на основі квадратичної відстані між головними конкурентами з штрафом за помилкове рішення. Нові критерії можуть застосовуватися не лише для навчання нечітких класифікаторів, але і для навчання деяких інших моделей, наприклад, нейронних мереж.The classification problem is the assignment an object with certain features to one of classes. Various engineering, management, economic,political, medical, sport, and other problems are reduced to classification. In fuzzy classifiers «inputs – output» relation is described by linguistic rules. Antecedents of these rules contain fuzzy terms «low», «average», «high» etc. To increase the correctness it is necessary totune the fuzzy classifier on experimental data. The new criteria for fuzzy classifier learning that take into account the difference ofmembership degrees to the main competitors only are proposed. When the classification is correct, the main competitor of the decision is theclass with the second largest membership degree. In cases of misclassification the wrong decision is the main competitor to the correct class.Computer experiments with learning the fuzzy classifier of 3 kinds of Italian wines recognition showed a significant advantage of the newcriteria. Among new learning criteria the criterion in the form of squared distance between main competitors with the penalty for wrongdecision has minor advantage. New criteria can be used not only for tuning fuzzy classifiers but for tuning some other models, such as neuralnetworks

    A HEDGE ALGEBRAS BASED CLASSIFICATION REASONING METHOD WITH MULTI-GRANULARITY FUZZY PARTITIONING

    Get PDF
    During last years, lots of the fuzzy rule based classifier (FRBC) design methods have been proposed to improve the classification accuracy and the interpretability of the proposed classification models. Most of them are based on the fuzzy set theory approach in such a way that the fuzzy classification rules are generated from the grid partitions combined with the pre-designed fuzzy partitions using fuzzy sets. Some mechanisms are studied to automatically generate fuzzy partitions from data such as discretization, granular computing, etc. Even those, linguistic terms are intuitively assigned to fuzzy sets because there is no formalisms to link inherent semantics of linguistic terms to fuzzy sets. In view of that trend, genetic design methods of linguistic terms along with their (triangular and trapezoidal) fuzzy sets based semantics for FRBCs, using hedge algebras as the mathematical formalism, have been proposed. Those hedge algebras-based design methods utilize semantically quantifying mapping values of linguistic terms to generate their fuzzy sets based semantics so as to make use of fuzzy sets based-classification reasoning methods proposed in design methods based on fuzzy set theoretic approach for data classification. If there exists a classification reasoning method which bases merely on semantic parameters of hedge algebras, fuzzy sets-based semantics of the linguistic terms in fuzzy classification rule bases can be replaced by semantics - based hedge algebras. This paper presents a FRBC design method based on hedge algebras approach by introducing a hedge algebra- based classification reasoning method with multi-granularity fuzzy partitioning for data classification so that the semantic of linguistic terms in rule bases can be hedge algebras-based semantics. Experimental results over 17 real world datasets are compared to existing methods based on hedge algebras and the state-of-the-art fuzzy sets theoretic-based approaches, showing that the proposed FRBC in this paper is an effective classifier and produces good results

    Diabetes Prediction by Optimizing the Nearest Neighbor Algorithm Using Genetic Algorithm

    Get PDF
    Introduction: Diabetes or diabetes mellitus is a metabolic disorder in body when the body does not produce insulin, and produced insulin cannot function normally. The presence of various signs and symptoms of this disease makes it difficult for doctors to diagnose. Data mining allows analysis of patients’ clinical data for medical decision making. The aim of this study was to provide a model for increasing the accuracy of diabetes prediction. Method: In this study, the medical records of 1151 patients with diabetes were studied, with 19 features. Patients’ information were collected from the UCI standard database. Each patient has been followed for at least one year. Genetic Algorithm (GA) and the nearest neighbor algorithm were used to provide diabetes prediction model. Results: It was revealed that the prediction accuracy of the proposed model equals 0.76. Also, for the methods of Naïve Bayes, Multi-layer perceptron (MLP) neural network, and support vector machine (SVM), the prediction accuracy was 0.62, 0.65, and 0.75, respectively. Conclusion: In predicting diabetes, the proposed model has the lowest error rate and the highest accuracy compared to the other models. Naïve Bayes method has the highest error rate and the lowest accuracy

    Automatic Finding Trapezoidal Membership Functions in Mining Fuzzy Association Rules Based on Learning Automata

    Get PDF
    Association rule mining is an important data mining technique used for discovering relationships among all data items. Membership functions have a significant impact on the outcome of the mining association rules. An important challenge in fuzzy association rule mining is finding an appropriate membership functions, which is an optimization issue. In the most relevant studies of fuzzy association rule mining, only triangle membership functions are considered. This study, as the first attempt, used a team of continuous action-set learning automata (CALA) to find both the appropriate number and positions of trapezoidal membership functions (TMFs). The spreads and centers of the TMFs were taken into account as parameters for the research space and a new approach for the establishment of a CALA team to optimize these parameters was introduced. Additionally, to increase the convergence speed of the proposed approach and remove bad shapes of membership functions, a new heuristic approach has been proposed. Experiments on two real data sets showed that the proposed algorithm improves the efficiency of the extracted rules by finding optimized membership functions

    Automatic synthesis of fuzzy systems: An evolutionary overview with a genetic programming perspective

    Get PDF
    Studies in Evolutionary Fuzzy Systems (EFSs) began in the 90s and have experienced a fast development since then, with applications to areas such as pattern recognition, curve‐fitting and regression, forecasting and control. An EFS results from the combination of a Fuzzy Inference System (FIS) with an Evolutionary Algorithm (EA). This relationship can be established for multiple purposes: fine‐tuning of FIS's parameters, selection of fuzzy rules, learning a rule base or membership functions from scratch, and so forth. Each facet of this relationship creates a strand in the literature, as membership function fine‐tuning, fuzzy rule‐based learning, and so forth and the purpose here is to outline some of what has been done in each aspect. Special focus is given to Genetic Programming‐based EFSs by providing a taxonomy of the main architectures available, as well as by pointing out the gaps that still prevail in the literature. The concluding remarks address some further topics of current research and trends, such as interpretability analysis, multiobjective optimization, and synthesis of a FIS through Evolving methods

    May I Ask a Follow-up Question? Understanding the Benefits of Conversations in Neural Network Explainability

    Full text link
    Research in explainable AI (XAI) aims to provide insights into the decision-making process of opaque AI models. To date, most XAI methods offer one-off and static explanations, which cannot cater to the diverse backgrounds and understanding levels of users. With this paper, we investigate if free-form conversations can enhance users' comprehension of static explanations, improve acceptance and trust in the explanation methods, and facilitate human-AI collaboration. Participants are presented with static explanations, followed by a conversation with a human expert regarding the explanations. We measure the effect of the conversation on participants' ability to choose, from three machine learning models, the most accurate one based on explanations and their self-reported comprehension, acceptance, and trust. Empirical results show that conversations significantly improve comprehension, acceptance, trust, and collaboration. Our findings highlight the importance of customized model explanations in the format of free-form conversations and provide insights for the future design of conversational explanations

    Data Mining in Smart Grids

    Get PDF
    Effective smart grid operation requires rapid decisions in a data-rich, but information-limited, environment. In this context, grid sensor data-streaming cannot provide the system operators with the necessary information to act on in the time frames necessary to minimize the impact of the disturbances. Even if there are fast models that can convert the data into information, the smart grid operator must deal with the challenge of not having a full understanding of the context of the information, and, therefore, the information content cannot be used with any high degree of confidence. To address this issue, data mining has been recognized as the most promising enabling technology for improving decision-making processes, providing the right information at the right moment to the right decision-maker. This Special Issue is focused on emerging methodologies for data mining in smart grids. In this area, it addresses many relevant topics, ranging from methods for uncertainty management, to advanced dispatching. This Special Issue not only focuses on methodological breakthroughs and roadmaps in implementing the methodology, but also presents the much-needed sharing of the best practices. Topics include, but are not limited to, the following: Fuzziness in smart grids computing Emerging techniques for renewable energy forecasting Robust and proactive solution of optimal smart grids operation Fuzzy-based smart grids monitoring and control frameworks Granular computing for uncertainty management in smart grids Self-organizing and decentralized paradigms for information processin
    corecore