267,998 research outputs found

    Feature-based search space characterisation for data-driven adaptive operator selection

    Get PDF
    Combinatorial optimisation problems are known as unpredictable and challenging due to their nature and complexity. One way to reduce the unpredictability of such problems is to identify features and the characteristics that can be utilised to guide the search using domain-knowledge and act accordingly. Many problem solving algorithms use multiple complementary operators in patterns to handle such unpredictable cases. A well-characterised search space may help to evaluate the problem states better and select/apply a neighbourhood operator to generate more productive new problem states that allow for a smoother path to the final/optimum solutions. This applies to the algorithms that use multiple operators to solve problems. However, the remaining challenge is determining how to select an operator in an optimal way from the set of operators while taking the search space conditions into consideration. Recent research shows the success of adaptive operator selection to address this problem. However, efficiency and scalability issues still persist in this regard. In addition, selecting the most representative features remains crucial in addressing problem complexity and inducing commonality for transferring experience across domains. This paper investigates if a problem can be represented by a number of features identified by landscape analysis, and whether an adaptive operator selection scheme can be constructed using Machine Learning (ML) techniques to address the efficiency and scalability issues. The proposed method determines the optimal categorisation by analysing the predictivity of a set of features using the most well-known supervised ML techniques. The identified set of features is then used to construct an adaptive operator selection scheme. The findings of the experiments demonstrate that supervised ML algorithms are highly effective when building adaptable operator selectors

    History matching and production optimization under uncertainties – Application of closed-loop reservoir management

    Get PDF
    There is an intensive investigation reported in the literature regarding the development of robust methods to improve the economical performance during the production management of petroleum fields. One paradigm that emerged in the last decade and has been calling the attention of various research groups is known as closed-loop reservoir management. The closed-loop entails the application of history matching and production optimization in a near-continuous feedback process. This work presents a closed-loop workflow constructed with ensemble-based methods. The proposed workflow consists of three components: history matching, model selection and production optimization. For history matching, we use the method known as ensemble smoother with multiple data assimilation. For model selection, we propose a procedure grounded on the calculation of distances defined in a metric space and a minimization procedure to determine the optimal set of representative models. For production optimization, we use the ensemble-based optimization method. We investigate the performance of each method separately before testing the complete closed-loop in a benchmark problem based on Namorado field in Campos Basis, Brazil. The results showed the effectiveness of the proposed methods to form a robust closed-loop workflow.Indisponível

    A competing Markov model for cracking prediction on civil structures

    Get PDF
    Cracks on the surface of civil structures (e.g. pavement sections, concrete structures) progress in several formations and under different deterioration mechanisms. In monitoring practice, it is often that cracking type with its worst damage level is selected as a representative condition state, while other cracking types and their damage levels are neglected in records, remaining as hidden information. Therefore, the practice in monitoring has a potential to conceal with a bias selection process, which possibly result in not optimal intervention strategies. In overcoming these problems, our paper presents a non-homogeneous Markov hazard model, with competing hazard rates. Cracking condition states are classified in three types (longitudinal crack, horizontal crack, and alligator crack), with three respective damage levels. The dynamic selection of cracking condition states are undergone a competing process of cracking types and damage levels. We apply a numerical solution using Bayesian estimation and Markov Chain Monte Carlo method to solve the problem of high-order integration of complete likelihood function. An empirical study on a data-set of Japanese pavement system is presented to demonstrate the applicability and contribution of the model

    Local feature selection for multiple instance learning with applications.

    Get PDF
    Feature selection is a data processing approach that has been successfully and effectively used in developing machine learning algorithms for various applications. It has been proven to effectively reduce the dimensionality of the data and increase the accuracy and interpretability of machine learning algorithms. Conventional feature selection algorithms assume that there is an optimal global subset of features for the whole sample space. Thus, only one global subset of relevant features is learned. An alternative approach is based on the concept of Local Feature Selection (LFS), where each training sample can have its own subset of relevant features. Multiple Instance Learning (MIL) is a variation of traditional supervised learning, also known as single instance learning. In MIL, each object is represented by a set of instances, or a bag. While bags are labeled, the labels of their instances are unknown. The ambiguity of the instance labels makes the feature selection for MIL challenging. Although feature selection in traditional supervised learning has been researched extensively, there are only a few methods for the MIL framework. Moreover, localized feature selection for MIL has not been researched. This dissertation focuses on developing a local feature selection method for the MIL framework. Our algorithm, called Multiple Instance Local Salient Feature Selection (MI-LSFS), searches the feature space to find the relevant features within each bag. We also propose a new multiple instance classification algorithm, called MILES-LFS, that integrates information learned by MI-LSFS during the feature selection process to identify a reduced subset of representative bags and instances. We show that using a more focused subset of prototypes can improve the performance while significantly reducing the computational complexity. Other applications of the proposed MI-LSFS include a new method that uses our MI-LSFS algorithm to explore and investigate the features learned by a Convolutional Neural Network (CNN) model; a visualization method for CNN models, called Gradient-weighted Sample Activation Map (Grad-SAM), that uses the locally learned features of each sample to highlight their relevant and salient parts, and a novel explanation method, called Classifier Explanation by Local Feature Selection (CE-LFS), to explain the decisions of trained models. The proposed MI-LSFS and its applications are validated using several synthetic and real data sets. We report and compare quantitative measures such as Rand Index, Area Under Curve (AUC), and accuracy. We also provide qualitative measures by visualizing and interpreting the selected features and their effects

    A hierarchical Mamdani-type fuzzy modelling approach with new training data selection and multi-objective optimisation mechanisms: A special application for the prediction of mechanical properties of alloy steels

    Get PDF
    In this paper, a systematic data-driven fuzzy modelling methodology is proposed, which allows to construct Mamdani fuzzy models considering both accuracy (precision) and transparency (interpretability) of fuzzy systems. The new methodology employs a fast hierarchical clustering algorithm to generate an initial fuzzy model efficiently; a training data selection mechanism is developed to identify appropriate and efficient data as learning samples; a high-performance Particle Swarm Optimisation (PSO) based multi-objective optimisation mechanism is developed to further improve the fuzzy model in terms of both the structure and the parameters; and a new tolerance analysis method is proposed to derive the confidence bands relating to the final elicited models. This proposed modelling approach is evaluated using two benchmark problems and is shown to outperform other modelling approaches. Furthermore, the proposed approach is successfully applied to complex high-dimensional modelling problems for manufacturing of alloy steels, using ‘real’ industrial data. These problems concern the prediction of the mechanical properties of alloy steels by correlating them with the heat treatment process conditions as well as the weight percentages of the chemical compositions
    corecore