3 research outputs found

    Finding Good Attribute Subsets for Improved Decision Trees Using a Genetic Algorithm Wrapper; a Supervised Learning Application in the Food Business Sector for Wine Type Classification

    Get PDF
    This study aims to provide a method that will assist decision makers in managing large datasets, eliminating the decision risk and highlighting significant subsets of data with certain weight. Thus, binary decision tree (BDT) and genetic algorithm (GA) methods are combined using a wrapping technique. The BDT algorithm is used to classify data in a tree structure, while the GA is used to identify the best attribute combinations from a set of possible combinations, referred to as generations. The study seeks to address the problem of overfitting that may occur when classifying large datasets by reducing the number of attributes used in classification. Using the GA, the number of selected attributes is minimized, reducing the risk of overfitting. The algorithm produces many attribute sets that are classified using the BDT algorithm and are assigned a fitness number based on their accuracy. The fittest set of attributes, or chromosomes, as well as the BDTs, are then selected for further analysis. The training process uses the data of a chemical analysis of wines grown in the same region but derived from three different cultivars. The results demonstrate the effectiveness of this innovative approach in defining certain ingredients and weights of wine’s origin

    Fostering Sustainable Aquaculture: Mitigating Fish Mortality Risks Using Decision Trees Classifiers

    No full text
    A proposal has been put forward advocating a data-driven strategy that employs classifiers from data mining to foresee and categorize instances of fish mortality. This addresses the increasing concerns regarding the death rates in caged fish environments because of the unsustainable fish farming techniques employed and environmental variables involved. The aim of this research is to enhance the competitiveness of Greek fish farming through the development of an intelligent system that is able to diagnose fish diseases in farms. This system concurrently addresses medication and dosage issues. To achieve this, a comprehensive dataset derived from various aquaculture sources was used, including various factors such as the geographic locations, farming techniques, and indicative parameters such as the water quality, climatic conditions, and fish biological characteristics. The main objective of the research was to categorize fish mortality cases through predictive models. Advanced data mining classification methods, specifically decision trees (DTs), were used for the comparison, aiming to recognize the most appropriate method with high precision and recall rates in predicting fish death rates. To ensure the reliability of the results, a methodical evaluation process was adopted, including cross-validation and a classification performance assessment. In addition, a statistical analysis was performed to gain insights into the factors that identify the correlations between the various factors affecting fish mortality. This analysis contributes to the development of targeted conservation and restoration action strategies. The research results have important implications for sustainable management actions, enabling stakeholders to proactively address issues and monitor aquaculture practices. This proactive approach ensures the protection of farmed fish quantities while meeting global seafood requirements. The data mining using a classification approach coincides with the general context of the UN sustainability goals, reducing the losses in seafood management and production when dealing with the consequences of climate change

    Finding Good Attribute Subsets for Improved Decision Trees Using a Genetic Algorithm Wrapper; a Supervised Learning Application in the Food Business Sector for Wine Type Classification

    No full text
    This study aims to provide a method that will assist decision makers in managing large datasets, eliminating the decision risk and highlighting significant subsets of data with certain weight. Thus, binary decision tree (BDT) and genetic algorithm (GA) methods are combined using a wrapping technique. The BDT algorithm is used to classify data in a tree structure, while the GA is used to identify the best attribute combinations from a set of possible combinations, referred to as generations. The study seeks to address the problem of overfitting that may occur when classifying large datasets by reducing the number of attributes used in classification. Using the GA, the number of selected attributes is minimized, reducing the risk of overfitting. The algorithm produces many attribute sets that are classified using the BDT algorithm and are assigned a fitness number based on their accuracy. The fittest set of attributes, or chromosomes, as well as the BDTs, are then selected for further analysis. The training process uses the data of a chemical analysis of wines grown in the same region but derived from three different cultivars. The results demonstrate the effectiveness of this innovative approach in defining certain ingredients and weights of wine’s origin
    corecore