17 research outputs found

    Using an interpretable Machine Learning approach to study the drivers of International Migration

    Full text link
    Globally increasing migration pressures call for new modelling approaches in order to design effective policies. It is important to have not only efficient models to predict migration flows but also to understand how specific parameters influence these flows. In this paper, we propose an artificial neural network (ANN) to model international migration. Moreover, we use a technique for interpreting machine learning models, namely Partial Dependence Plots (PDP), to show that one can well study the effects of drivers behind international migration. We train and evaluate the model on a dataset containing annual international bilateral migration from 19601960 to 20102010 from 175175 origin countries to 3333 mainly OECD destinations, along with the main determinants as identified in the migration literature. The experiments carried out confirm that: 1) the ANN model is more efficient w.r.t. a traditional model, and 2) using PDP we are able to gain additional insights on the specific effects of the migration drivers. This approach provides much more information than only using the feature importance information used in previous works

    Cost-based filtering algorithms for a capacitated lot sizing problem and the constrained arborescence problem

    No full text
    Constraint Programming (CP) is a paradigm derived from artificial intelligence, operational research and algorithmics that can be used to solve combinatorial problems. CP solves problems by interleaving search (assign a value to an unassigned variable) and propagation. Constraint propagation aims at removing/filtering inconsistent values from the domains of the variables in order to reduce the search space of the problem. In this thesis, we develop filtering algorithms for two complex combinatorial optimization problems: a Capacitated Lot Sizing Problem (CLSP) and the Constrained Arborescence Problem (CAP). Each of these problems has many variants and practical applications. The CLSP is the problem of finding an optimal production plan for single or multiple items while satisfying demands of clients and respecting resource restrictions. The CLSP finds important applications in production planning. In this thesis, we introduce a CLSP in CP. In many lot sizing and scheduling problems, in particular when the planning horizon is discrete and finite, there are stocking costs to be minimized. These costs depend on the time spent between the production of an order and its delivery. We focus on developing specialized filtering algorithms to handle the stocking cost part of a class of the CLSP. We propose the global optimization constraint StockingCost when the per-period stocking cost is the same for all orders; and its generalized version, the IDStockingCost constraint (ID stands for Item Dependent). In this thesis, we also deal with a well-known problem in graph theory: the Minimum Weight Arborescence (MWA) problem. Consider a directed graph in which we distinguish one vertex r as the root. An MWA rooted at r is a directed spanning tree rooted at r with minimum total weight. We focus on the CAP that requires one to find an arborescence that satisfies some side constraints and that has minimum weight. The CAP has many real life applications in telecommunication networks, computer networks, transportation problems, scheduling problems, etc. After sensitivity analysis of the MWA, we introduce the CAP in CP. We propose a dedicated global optimization constraint to handle any variant of the CAP in CP: the MinArborescence constraint. All the proposed filtering algorithms are analyzed theoretically and evaluated experimentally. The different experimental evaluations of these propagators against the state-of-the-art propagators show their respective efficiencies.(FSA - Sciences de l'ingénieur) -- UCL, 201

    The item dependent stocking cost constraint

    No full text
    In a previous work we introduced a global StockingCost constraint to compute the total number of periods between the production periods and the due dates in a multi-order capacitated lot-sizing problem. Here we consider a more general case in which each order can have a different per period stocking cost and the goal is to minimise the total stocking cost. In addition the production capacity, limiting the number of orders produced in a given period, is allowed to vary over time.We propose an efficient filtering algorithm in O(n log n) where n is the number of orders to produce. On a variant of the capacitated lot-sizing problem, we demonstrate experimentally that our new filtering algorithm scales well and is competitive wrt the StockingCost constraint when the stocking cost is the same for all orders

    Finding optimum climatic parameters for high tomato yield in Benin (West Africa) using frequent pattern growth algorithm.

    No full text
    Tomato is one of the most appreciated vegetables in the world. Predicting its yield and optimizing its culture is important for global food security. This paper addresses the challenge of finding optimum climatic values for a high tomato yield. The Frequent Pattern Growth (FPG) algorithm was considered to establish the associations between six climate variables: minimum and maximum temperatures, maximum humidity, sunshine (Sun), rainfall, and evapotranspiration (ET), collected over 26 years in the three agro-ecological Zones of Benin. Monthly climate data were aggregated with yield data over the same period. After aggregation, the data were transformed into 'low', 'medium', and 'high' attributes using the threshold values defined. Then, the rules were generated using the minimum support set to 0.2 and the confidence to 0.8. Only the rules with the consequence 'high yield' were screened. The best yield patterns were observed in the Guinean Zone, followed by the Sudanian. The results indicated that high tomato yield was associated with low ET in all areas considered. Minimum and maximum temperatures, maximum humidity, and Sun were medium in every Zone. Moreover, rainfall was high in the Sudanian Zone, unlike the other regions where it remained medium. These results are useful in assessing climate variability's impact on tomato production. Thus, they can help farmers make informed decisions on cultivation practices to optimize production in a changing environment. In addition, the findings of this study can be considered in other regions and adapted to other crops

    Rules from Sudanian Zone.

    No full text
    Tomato is one of the most appreciated vegetables in the world. Predicting its yield and optimizing its culture is important for global food security. This paper addresses the challenge of finding optimum climatic values for a high tomato yield. The Frequent Pattern Growth (FPG) algorithm was considered to establish the associations between six climate variables: minimum and maximum temperatures, maximum humidity, sunshine (Sun), rainfall, and evapotranspiration (ET), collected over 26 years in the three agro-ecological Zones of Benin. Monthly climate data were aggregated with yield data over the same period. After aggregation, the data were transformed into ‘low’, ‘medium’, and ‘high’ attributes using the threshold values defined. Then, the rules were generated using the minimum support set to 0.2 and the confidence to 0.8. Only the rules with the consequence ‘high yield’ were screened. The best yield patterns were observed in the Guinean Zone, followed by the Sudanian. The results indicated that high tomato yield was associated with low ET in all areas considered. Minimum and maximum temperatures, maximum humidity, and Sun were medium in every Zone. Moreover, rainfall was high in the Sudanian Zone, unlike the other regions where it remained medium. These results are useful in assessing climate variability’s impact on tomato production. Thus, they can help farmers make informed decisions on cultivation practices to optimize production in a changing environment. In addition, the findings of this study can be considered in other regions and adapted to other crops.</div

    Deep learning methods for biotic and abiotic stresses detection and classification in fruits and vegetables: State of the art and perspectives

    No full text
    Deep Learning (DL), a type of Machine Learning, has gained significant interest in many fields, including agriculture. This paper aims to shed light on deep learning techniques used in agriculture for abiotic and biotic stress detection in fruits and vegetables, their benefits, and the challenges faced by users. Scientific papers were collected from Web of Science, Scopus, Google Scholar, Springer, and Directory of Open Access Journals (DOAJ) using combinations of specific keywords such as:’Deep Learning’ OR’Artificial Intelligence’ in combination with fruit disease’, vegetable disease’, ‘fruit stress', OR ‘vegetable stress' following PRISMA guidelines. From the initial 818 papers identified using the keywords, 132 were reviewed after excluding books, reviews, and the irrelevant. The recovered scientific papers were from 2003 to 2022; 93 % addressed biotic stress on fruits and vegetables. The most common biotic stresses on species are fungal diseases (grey spots, brown spots, black spots, downy mildew, powdery mildew, and anthracnose). Few studies were interested in abiotic stresses (nutrient deficiency, water stress, light intensity, and heavy metal contamination). Deep Learning and Convolutional Neural Networks were the most used keywords, with GoogleNet (18.28%), ResNet50 (16.67%), and VGG16 (16.67%) as the most used architectures. Fifty-two percent of the data used to compile these models come from the fields, followed by data obtained online. Precision problems due to unbalanced classes and the small size of some databases were also analyzed. We provided the research gaps and some perspectives from the reviewed papers. Further research works are required for a deep understanding of the use of machine learning techniques in fruit and vegetable studies: collection of large datasets according to different scenarios on fruit and vegetable diseases, evaluation of the effect of climatic variability on the fruit and vegetable yield using AI methods and more abiotic stress studies

    Correlation analysis between variables.

    No full text
    A: Correlation between predictors and response variable. B: Correlation between predictors.</p

    Threshold values of variables for Sudanian Zone.

    No full text
    Tomato is one of the most appreciated vegetables in the world. Predicting its yield and optimizing its culture is important for global food security. This paper addresses the challenge of finding optimum climatic values for a high tomato yield. The Frequent Pattern Growth (FPG) algorithm was considered to establish the associations between six climate variables: minimum and maximum temperatures, maximum humidity, sunshine (Sun), rainfall, and evapotranspiration (ET), collected over 26 years in the three agro-ecological Zones of Benin. Monthly climate data were aggregated with yield data over the same period. After aggregation, the data were transformed into ‘low’, ‘medium’, and ‘high’ attributes using the threshold values defined. Then, the rules were generated using the minimum support set to 0.2 and the confidence to 0.8. Only the rules with the consequence ‘high yield’ were screened. The best yield patterns were observed in the Guinean Zone, followed by the Sudanian. The results indicated that high tomato yield was associated with low ET in all areas considered. Minimum and maximum temperatures, maximum humidity, and Sun were medium in every Zone. Moreover, rainfall was high in the Sudanian Zone, unlike the other regions where it remained medium. These results are useful in assessing climate variability’s impact on tomato production. Thus, they can help farmers make informed decisions on cultivation practices to optimize production in a changing environment. In addition, the findings of this study can be considered in other regions and adapted to other crops.</div
    corecore